• BlameThePeacock@lemmy.ca
    link
    fedilink
    English
    arrow-up
    7
    ·
    5 months ago

    This particular implementation doesn’t really apply to those situations, there are already existing technologies which can pre-train on specific voices they could be using for that since the target is known. The main “improvement” from this system is that you can train it on any target subject, even with background noise, in only a few seconds.

    It’s most useful in scenarios they’ve outlined in their study, like using it with your friend you ran into on the bus, your tour guide, etc.