More

tshiran · on April 25, 2018

This is great. Starflake reflections are so much nicer than old fashion cubes.

tshiran · on March 27, 2018

Arrow has really taken off over the last year. I'm seeing it in tons of different projects.

tshiran · on Oct 31, 2017

I would argue that the fact that Arrow has been integrated into so many projects over the last year is proof that a separate project made sense. Dremio, PySpark, Pandas, MapD, NVIDIA GPU Data Frame, Graphistry, Turbodbc, ...

rch · on Oct 31, 2017

Here's the conclusion of the article:

It makes sense to keep Arrow and Parquet/ORC as separate projects, while also continuing to maintain tight integration.

You might enjoy reading it to see why.

ineedasername · on Oct 31, 2017

Well... use in projects is an indicator, not a proof, of utility. Not all choices are good ones, inferior & redundant products are crop up on a regular basis, and to the dismay of many in this community, sometimes win out. Happily, from the article, Arrow, doesn't appear one of them, and has its own unique and justified niche.

tshiran · on July 21, 2017

We considered starting a company in SF a couple years ago. I'm glad that we decided to instead start it in Mountain View (http://www.dremio.com). We're able to attract very talented people from San Jose, Milpitas, Redwood City, etc. where rents are a bit more reasonable.

tshiran · on July 15, 2017

This is cool

tshiran · on Feb 13, 2017

Great article, this is the exact process I've observed multiple times.

tshiran · on Dec 2, 2015

Unless there's a really good reason to not split it equal, I think it's better if both founders have the same amount of equity. Otherwise you could create unnecessary tension which is the last thing you want at that early stage.

tshiran · on May 19, 2015

tshiran · on Dec 3, 2014

Another analogy that might help is strongly typed vs. loosely typed languages. There are pros and cons to both Java and Python, and both are widely used.

tshiran · on Dec 3, 2014

I think there's a tradeoff here. It's like MongoDB - hundreds of thousands downloads per month because it enables people to build and deploy applications faster. For many cases a relational database is still better though.

The nice attribute of Drill is that it works on schema-less data as well as data with strong schemas. The user can make the tradeoff between agility and 'safety'.