• kautau@lemmy.world
    link
    fedilink
    arrow-up
    2
    ·
    5 days ago

    I think it really depends on how accurate you want / what language you are interpreting. https://github.com/openai/whisper has multiple variations on their model, but they all pretty much require VRAM/graphics capability (or likely NPUs as they become more commonplace).