Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

LLMs already make amazing rubber ducks! I would bet you're right -- when we have really good voice UIs for LLMs so it's even more like having a real conversation there are a lot of people who are going to start unlocking considerably more value from them.

I've had some success using Gemini 1.5 to take a recorded Teams meeting of a debugging session (with screen share), extract an audio transcript with Whisper, upload _both_ the video and the transcript, and get a summary of what was done. I'm still working on how to get the right amount of detail and organization without losing the high level flow, but even in a basic state it's better than anything I've had previously.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: