Hacker Newsnew | past | comments | ask | show | jobs | submit | sails's commentslogin

How do you anticipate teams deploying this? I’m wary of GitHub for sensitive business documents, and wonder what an easy secure agent friendly deployment looks like. Cloudflare or GCP are maybe good candidates

Hey, contributor to Wuphf here,

Right now this is setup to be run on your machine. Git is used to do versioning but we don't push that to GitHub, nor do we keep any insight into what you have or what you're doing.

If there is long term value people are getting out of Wuphf we'll be happy to build out a hosted business/enterprise compliant version.


Thanks. I mean self hosting a shared version of this on internal infra but designed to be slightly collaborative.

> Accent, dialect, and low-resource language adaptation — adapt a base Gemma model to underrepresented voices and languages with your own labeled audio.

Is this for TTS? Have been looking for something to do a local fine tune to get a specific accent


I like it. I feel like this is a possible evolution of the browser.

Going further, AI internet browser could be an entirely new app to break from the legacy.

I feel this with coding agents, so often where it fetches web data and interprets it, html in that loop is only occasionally additive. Feels quite futuristic


This is good! Switching from the various terrible online tools I cobble together. (Descript, Riverside, etc etc)

Request for transcription and transcription editing


I’m doing something similar to simulate llms in b2b lending, it’s slightly slower paced but the core mechanisms are using just-bash to analyse business financials and make profitable loans.

I quite like the idea of llms writing more code up front to execute strategies.

I’m currently developing the game mechanics and ELO. Please share anything relevant if it comes to mind


See also speculative cascades which is a nice read and furthered my understanding of how it all works

https://research.google/blog/speculative-cascades-a-hybrid-a...


Always wondered how auth validation works on these. Could I use your serverless ocr?


I’ve had some success building “text to dashboard” with this using vercel.

I use bash-tool and Vercel sandbox to generate charts (Echarts) or tables (Tanstack table) from json data, and then json-render to render the charts, tables and markdown into a dashboard.


Please share as I would like to see what you have built.

What I like about this is that ides of a catalog which is what most business systems have in the form of their records and objects. Giving an AI accessible structure to this gets AI into the realm of the various 4GLs back in late 90s which made user created forms so much easier. Anybody remember that Informix 4GL for building simple apps from the db schema?


Is it reliable/robust?


It is more robust than when I tried the exact thing with structured outputs API and gpt4 era models, it’s not perfect but surprisingly good


Looking for an iOS app to test this as I’m generally curious about the capabilities of on devices TTS (yet to find an app, but there are loads for text gen)

It can’t be too far off considering Siri and TTS has been on devices for ages


Any recommendations for an iOS app to test models like this? There are a few good ones for text gen, and it’s a great way to try models


Besides UTM, no.


Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: