this post was submitted on 01 Apr 2025
39 points (97.6% liked)
Technology
68245 readers
6668 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Interesting article, I agree with his analysis, not sure (yet) that I agree with his conclusions. My brain needs to think about it in the background for a bit (just the way mine works).
TLDR: we should expect conversational interfaces to be an addition to the workflows we currently use.
Computer upvote this post. I mean comment. No, I meant the comment. Computer remove the upvote from the post. Computer upvote the comment.
Computer compose reply.
Dear Aunt, let's set so double the killer delete select all
In my own real world usage I estimate a comprehension rate of about 92% with voice agents. I'm no linguist, but I'd guess that you'd need to achieve at least 98% comprehension to not feel like a conversation is frustrating. I'm also instantly irritated if my computer is delayed and nothing happens when I click on something, or if I go to use someone else's computer and they have double-clicking enabled for some reason (why?!) so my tolerance is probably on the low end.
Anyway, I thought this was an insightful read and the key to me is that the bar is pretty high now for Man-machine interfaces, so any implementation of newer tech needs to be both thoughtful and bug-free as possible in this realm.
For me it feels more like 9.2% most of the time, and that is just the voice-to-text part, not even the interpretation of the resulting text as a command.
Does feel like that, I agree, but if you spoke to someone who randomly completely misunderstood 8 out of every 100 words you said and had next to zero dead reckoning ability to figure out what that missing word was, I think you'd feel pretty frustrated.
I thought about it some more since I wrote my comment and I am genuinely unsure any voice recognition system I have ever used managed to transcribe a full sentence to text successfully without making at least one mistake.
On the other hand with a keyboard I am reasonably sure I get problems such as network filesystems being unable to reach the server or broken hard drives more often than having to worry about mistyping a command I commonly use. Granted, part of that is thanks to tab completion but that is part of the issue with voice input, no easy way to correct what it got wrong.
In English? Do you have an accent? Dragon is one of the better ones and it seems with many accents it does remarkably well. Google seems to have one of the worst I've come across.
Both in English and my native German. I probably do have an accent in English but that is difficult to judge myself. Certainly nothing that prevents other people from understanding me though.