More

_vxw6 · on Jan 4, 2023

Haha I copied it from that page, hilliarious!

I'll keep it haha

_vxw6 · on Jan 4, 2023

It stores it in browser local storage using IndexDB, if you have access to years of documents, it might take up to 10+gb of persistant storage.

_vxw6 · on Jan 4, 2023

gb's of storage, potentially 10+ for very large datasets.

Minutes, not days. Very big data sets might take 30+ minutes (or even a couple of hours), but usefulness starts in the first few minutes (because of the priority algorithm)

_vxw6 · on Jan 4, 2023

Actually a few hundred documents is really no biggy, my current benchmarks is in the range of <250ms (instant feeling) for hundreds of thousands of paragraphs.

I'm testing this on a large knowledge base.

_vxw6 · on Jan 3, 2023

Same bundle of weights but is being run by some rust code that is compiled down to WebAssembly.

_vxw6 · on Jan 3, 2023

It will be very soon https://github.com/haystackoss/haystack

some rust code that compiles to WASM loads LLM from memory, and uses custom transformer.py like rust alternative we wrote.

_vxw6 · on Jan 3, 2023

Forgive me if I don't understand, but I don't think there's a problem with multiple companies using the same common noun as the base for their domain.

Let the best product be remembered for the name.

metaphor · on Jan 3, 2023

Certainly not, presuming it doesn't escalate to the level of trademark/service mark infringement (to be fair, IANAL). Just a risk consideration...your product, your call.

But I think there's value in at least recognizing that the namespace is quite crowded given the collisions that two interweb randoms were able to identify in short order.

_vxw6 · on Jan 3, 2023

Not really, gethaystack searches your tabs

alexfrdmn · on Jan 3, 2023

Their beta seems so

_vxw6 · on Jan 3, 2023

Thanks

_vxw6 · on Jan 3, 2023

I’m using a fine-tuned t5-small model, I fune-tuned it for two tasks, question answering from a paragraph, and highlighting relevant text of search results.

_vxw6 · on Jan 3, 2023

I’m planning on releasing an open source version of this.

But also have paid features that managers would like to use.