Home Assistant sells units with duel microphones that aren’t too expensive and work relatively well. But the local voice recognition wasn’t great last I tried.
- 11 Posts
- 282 Comments
Similar to the other user’s response, I use the calendar integration, then add the things on the calendar (say, putting the recycling out to be collected). Then I have an automation that will read out a reminder at the time it is scheduled for in the calendar.
So the evening before recycling pickup every fortnight, it pipes up and says “Reminder: Recycling” or whatever.
Works pretty well for these regular reoccurring things. I haven’t tried using it for one off reminders, and you can’t say “ok nabu, remind me to wish Steve a happy birthday on the 27th of February” or anything like that. Still, I’m pretty happy.
I seem to remember needing a bit of playing to get the notification working, I’m happy to look up and post what I have in my automation if needed.
In Home Assistant, in the settings, if you go to Voice Assistants then click the … on your assistant and click Debug, you can see what it thought you said (and what it did).
Setting a timer on an up to date Home Assistant will repeat back what it set. E.g. If I say “Set a timer for 2 minutes” it will say “Timer set for 2 minutes”. It says “Done” when running some Home Assistant task/automation, so it’s probably not understanding you correctly (hence what the debug option is good for). I use the cloud voice recognition as I couldn’t get the local version to understand my accent when I tried it (a year ago). It’s through Azure but is proxied by Home Assistant so they don’t know it’s you.
The wake word responds to me, but not my girlfriend’s voice.
My wife swears it’s sexist, she has a bit of trouble too. In the integration options you can set the sensitivity to make it more sensitive, but it does increase false activations. I have it on the most sensitive and she can activate it first time most of the time.
I agree that it’s not production ready and they know that too, hence the name. But in relation to your points, I plugged in some speaker as it’s not really that great of a speaker at all.
For the wake word, at some point they did an update to add a sensitivity setting so you can make it more sensitive. You could also ty donating your voice to the training: https://ohf-voice.github.io/wake-word-collective/
But all in all you’re spot on with the challenges. I’d add a couple more.
With OpenAI I find it can outperform other voice assistants in certain areas. Without it, you come up across weird issues, like my wife always says “set timer 2 minutes” and it runs off to OpenAI to work out what that means. If you says “set a timer for 2 minutes” it understands immediately.
What I wish for is the ability to rewrite requests. Local voice recognition can’t understand my accent so I use the proxied Azure speech to text via Home Assistant Clound, and it regularly thinks I’m saying “Cortana” (I’m NEVER saying Cortana!)
Oh and I wish it could do streaming voice recognition instead of waiting for you to finish talking then waiting for a pause before trying anything. My in-laws have a google home and if you say something like “set a timer for 2 minutes” it immediately responds because it was converting to text as it went, and knew that nothing more was coming after a command like that. HAVP has perhaps a 1 second delay between finishing speaking and replying, assuming it doesn’t need another 5 seconds to go to open AI. And you have to be quiet in that 1 second otherwise it thinks you’re still talking (a problem in a busy room).
In my experience it’s not quite the same. Using webdav through the distro account seems that it’s fully online. And folder access or file access contacts the server.
The virtual file experience is more of a hybrid. All the folders actually exist on disk, as well as shells for every file. If you try to open a virtual file, in the background Windows will seamlessly download it for you. At that point the file is actually on your disk. This way regularly accessed files on on your hard drive and seldom accessed ones are not, saving local hard drive space while providing an experience almost like if all the files were actually on your drive.
On Windows, Nextcloud seems to tap into some Windows function to provide files on demand. Is there any Linux cloud file service that can do it?
It’s probably consumption power changes below X threshold for some amount of time
Well the containers are grouped into services. I would easily have 15 services running, some run a separate postgres or redis while others do an internal sqlite so hard to say (I’m not where I can look rn).
If we’re counting containers then between Nextcloud and Home Assistant I’m probably over 20 already lol.
Dave@lemmy.nzto
Technology@lemmy.world•OpenAI is considering using biometric verification like World's eyeball scanning Orb for its planned social network to ensure its users are real peopleEnglish
18·15 days agoChat GPT, generate me an image of an eyeball.
Dave@lemmy.nzto
Technology@lemmy.world•Amazon is forcibly upgrading Prime members to Alexa Plus, and users are not happyEnglish
3·1 month agoWell if you do, contribute it to home assistant and I’ll install it 😆, it’s actually a little surprising conversions aren’t supported natively but I guess there is a lot to cover and they will get there eventually.
Dave@lemmy.nzto
Technology@lemmy.world•Amazon is forcibly upgrading Prime members to Alexa Plus, and users are not happyEnglish
3·1 month agoHome assistant has an automation event that lets you set the conversation result, but you’ve already passed my ability haha so I can’t tell you how to pull the result in from an external service.
It may well be worth building it as a home assistant integration rather than just custom sentence triggered automations.
Dave@lemmy.nzto
linuxmemes@lemmy.world•Guys, what's the best Linux distro to install on my PC?
21·1 month agoLinux’s problem is that it’s not an OS, and so suggesting people use Linux doesn’t give them much advice.
The next problem is that linux based OSs are generally open source, which means it can be forked any number of times at any point in time.
There’s this super awesome and super confusing think in open software where you don’t have to use the thing you are given. Want to use facebook? Must use their app. Want to use reddit? Pretty much must use their app, etc.
But if you want to use Lemmy or Piefed, there are a dozen good choices, none are the wrong answer. Want to use Jellyfin? Well I connect with Kodi on my TV, Swiftfin on my mother’s, the Android Jellyfin app on my in-laws’ TV, Findroid (movies/TV) or Finamp (music) on my phone, etc. You don’t like an app you can still use the service just try another app or make your own. This is awesome, but super confusing to non-technical people.
Linux distros are the same. There are dozens of popular ones, many of which are based on others, the variety of choices is awesome but for non-technical people they have no idea where to start.
Dave@lemmy.nzto
Technology@lemmy.world•Amazon is forcibly upgrading Prime members to Alexa Plus, and users are not happyEnglish
6·1 month agoWorth noting you need both the speaker part and the server part. Home assistant sells both as out of the box ready to go but you do need both parts.
It’s also worth noting it’s a Preview Edition, as in not yet consumer ready.
It works but you will find quirks, and will find things it can’t do that you’d expect it to, and things it can do that others can’t.
It’s also very customisable, if you’re a bit technical (honestly you don’t need to be that technical these days, it has come a long way).
Dave@lemmy.nzto
Technology@lemmy.world•Amazon is forcibly upgrading Prime members to Alexa Plus, and users are not happyEnglish
3·1 month agoDo you have a plan? I have a Home Assistant Voice Preview Edition and it’s great but I don’t think it can do unit conversions without connecting it to an LLM. Timers work locally.
I guess if it’s an equation you could add automation to pick up on the phrase and reply with the conversion, but that would need each unit to be manually done and wouldn’t work for things like currency conversion that needs live data.
Also arbitrary things would be challenging, like converting tablespoons of butter into grams or grams of rice into cups.
Dave@lemmy.nzto
Technology@lemmy.world•What steps can be taken to prevent AI training and scraping of my public facing website?English
3·2 months agoAs someone with a public facing website, there are significant volumes of scraping still happening. But largely this appears to come out of South East Asia and South America and they take steps to hide who they are so it’s not clear who is doing it or why, but like you say it doesn’t appear to be OpenAI, Google, etc.
It doesn’t appear to be web search indexing, the scraping is aggressive and the volume will bring down a Lemmy server no matter how powerful the hardware.
I don’t know the answer but does tab to autocomplete work in other contexts? E.g. you type ‘cd ca’ and it fills it to ‘cd catpics’?
I’m not at a PC right now but from memory you’d have to be in bash or similar, it won’t work in sh.
What’s your solution to this problem for the rest of your digital life?
Dave@lemmy.nzto
Selfhosted@lemmy.world•Plex’s crackdown on free remote streaming access starts this week - Ars TechnicaEnglish
1·3 months agoI think I tried this when troubleshooting and didn’t notice a difference. Nevermind, I pretty easily taught her how to bring up the menu and switch audio streams so she can solve it herself now.
Dave@lemmy.nzto
Selfhosted@lemmy.world•Plex’s crackdown on free remote streaming access starts this week - Ars TechnicaEnglish
2·3 months agoThanks, I didn’t manage to find many options in swiftfin, you don’t know if I can enforce it for a user from the server side?

I think it’s a pretty cool toy to play with. It mostly gets used for setting timers and playing music, but you can add Home Assistant automations that trigger when you say certain things. Lot’s to play with if that’s your idea of fun!