danhilltech's comments

danhilltech · on Aug 23, 2024

Not quite. It's first a search engine, so I index latest HN posts and a few other sources. There's then an algorithm that classifies new feeds as either "good" (personal blog, little ads, first person writing etc) or not. Then yes, it's those RSS feeds ranked chronologically. But the point is, it's not a curated or set list of blogs.

danhilltech · on Aug 22, 2024

Great ideas, will do!

breck · on Aug 22, 2024

Thanks!

Also any relation to author of https://en.wikipedia.org/wiki/The_Pattern_on_the_Stone ? (one of my all time favorite books)

danhilltech · on Aug 22, 2024

Thanks! I actually read your note on `use Cache-Control max-age HTTP headers for a "do not poll again before" time: savings of 10x or much more are likely if the feed server is set up well: an unnecessary feed poll avoided entirely is the cheapest kind!` and realized I don't currently, so adding that tonight.

DamonHD · on Aug 22, 2024

Hurrah! Thanks!

danhilltech · on Feb 21, 2022

Server-side tracking has been around for a while (indeed this article is dated Nov 15, 2020; and of course, you could argue simply parsing your Apache/nginx logs to get visitor stats has existed forever). The article I think conflates several different pieces.

There's probably a few actual use cases marketers may care about for tagging/tracking/analytics:

1. Simplest: I want to know how many people use my site/app, how many come back, how many are real (not bots), which pages are popular, etc. I'd like to see all this in a nice UI where I can cut and filter the data.

2. Same as #1, but I'd like to do it across devices. Still all within my own site/app, but simply connecting a non-logged in session across desktop and mobile web. Google and FB probably have the largest available dataset on this.

3. I'd like to enrich all this information with data from other sources, for example to target ads, serve ads, etc.

Site owners/marketers then try and tackle these in a few ways, the first 3 equally bad:

1. Just dump a bunch of scripts into your site (GA, FB, Segment, whatever). Pros: easy. Cons: very easily blocked, so your data is super biased.

2. Self host some of these scripts, or CNAME them. Pros: maybe a bit better for performance? Cons: still rather easily blocked with content signatures etc. A nightmare to ensure consistency if self-hosting.

3. Run your own JS that sends events to your server, and then your server fans out to whomever. Pros: much harder to block, and likely quite performant. Cons: its unlikely your self built lib is going to give all the same 'features' as GA (features meaning device fingerprinting and so on).

4. Just get everything from HTTP logs. Pros: very performant, can't be blocked. Cons: much more limited data to work with.

Personally, I think #4 is the future (and also where we started 20 years ago). What I don't think anyone is doing yet is relaying that data out to all the other parts of the stack: GA, FB, Mixpanel, whatever. If you could solve both - giving users privacy and performance and giving marketers the same tools they're used to - sounds like a win. You might argue "well we'd be missing a bunch of user data", but you're already missing it with adblockers and iOS privacy features.

Raed667 · on Feb 21, 2022

> 3. Run your own JS that sends events to your server

If your platform is popular enough, those telemetry endpoints will end-up on ad-blockers lists.

Then it is up to you, if you want to do an arms race of obfuscation or just accept it.

danhilltech · on Feb 21, 2022

totally

olliej · on Feb 21, 2022

1) can be done trivially with first party cookies.

2) you can already tell what device someone is using. If you mean “I want to know if the same person is on different devices” get them to login, don’t try in effectively spy while also providing google etc with the ability to actually spy

3)you cannot know how to target ads on a per user basis unless you are spying on your users. You have no justification that supports a claim to such information.

danhilltech · on Feb 21, 2022

Yea, I think we're saying the same thing. Ultimately both the best choice (for privacy, performance etc.) and the one that's most likely (given adblockers and and ever increasing push for privacy from browsers and OSs) is to stop trying to find a way around adblockers, and simply invest in the technologies that work - http, cookies, sessions, logins, and os on.

pixeldetracking · on Feb 21, 2022

5. use edge computing https://blog.cloudflare.com/zaraz-use-workers-to-make-third-...

tootie · on Feb 21, 2022

I think some of the whiplash in the market isn't just the tit for tat battle with ad blockers and regulators but the realization that there's so much useless data being collected. The best data we get is first party (ie things people click or type into forms on our sites) or qualitative feedback from surveys. GA and GTM are valuable tools for us but Google's network isn't really.

danhilltech · on Feb 21, 2022

Yea. Though, GA does (at least) two things: analyzes your own data, and, uses the data they collect from all their other sites to improve your experience via better bot detection, recommendations, insights. Google's network is useful, like it or not, for a) their cross device graph - they know which mobile devices and which desktop browsers are the same user (ish) and b) from that, building better MTA models than you can with pure first-party data - especially if most of your traffic isn't logged in.

But I agree, the future is pointing toward a world where privacy and empowerment is more in the hands of the user, and that's a good thing.

danhilltech · on Aug 20, 2019

Cool idea

kofman · on Aug 20, 2019

Thanks! Let us know if you have any feedback as you play with it!

danhilltech · on June 9, 2019

We’re building a graph of this same underlying data at https://alma.app (with a lot of enrichments) to help people discover and donate to nonprofits.

E.g. here’s Stanford: https://alma.app/charities/941156365-the-board-of-trustees-o...

Would love to know what types of analysis people would like to see about organizations and their relationships?

walterbell · on June 9, 2019

> Would love to know what types of analysis people would like to see about organizations and their relationships?

Related public records, e.g. court filings, municipal or other co-investments alongside the nonprofits, adjacent (time or geo) legislation/policy changes, rotating doors of nonprofit, gov, commercial.

danhilltech · on June 9, 2019

Interesting thanks!

schwanksta · on June 9, 2019

This is neat! I thought about extracting the grants (still might), but full-text seemed like good bang for the buck. Your tools sound like they might be very useful for reporters. Have you given any thought to that? We love mapping these sorts of connections.

danhilltech · on June 9, 2019

Hey, first up really amazing work you’re doing, hugely inspiring for us! Thank you.

Whilst our focus has been delivering a consumer layer on top of all this data, yea, very open to exposing our underlying graph to others. Want to drop me a note at dan at alma.app?

As you mention elsewhere, half the battle is cleaning the data and getting quality.

johnymontana · on June 9, 2019

Could you clarify what you mean by “building a graph”? Are you combining the 990 data plus other enrichments in a graph database?

danhilltech · on June 9, 2019

Yep basically - looking at the network of grants (who funds who), people (who works where over their careers), social media and news (who’s referenced or cited alongside whom) and so on. You start to see interesting clusters of nonprofits that are funded by the same groups and work together extensively.

johnymontana · on June 9, 2019

Neat. What technologies are you using to store and analyze the data? Are you using any graph algorithms, like PageRank, community detection, etc?

danhilltech · on June 9, 2019

DGraph and elasticsearch with some redis caching.

Yea, a version of pagerank is coming soon! Love any other ideas...

danhilltech · on March 24, 2019

Yea can totally work - it gets at a lot in the zeitgeist in tech recently. I left Airbnb and started https://alma.app for similar reasons. We help companies with strong social missions (and their employees and their customers) connect with charities that share the same broad goals.

Many people I worked with would I think not only want to work with an agency that supports charities, but you could go a step further and also help those employees use their skills at the charities to volunteer, the companies support them too.

Always happy to talk, just ping me a note!

danhilltech · on Sept 3, 2018

ALMA | front end (js) | Washington DC | fulltime | onsite

We're hiring our second engineer to focus on our front end site and apps. We're a seed funded startup helping make philanthropy easy for everyone.

I'm the CEO and wrote all the code so far. My co-founder and I lead product/growth at Airbnb the last six years. Check out https://alma.app for more. Email me on Dan at alma.app or on Twitter @danhilltech. Can also help relocate from Bay area.

Stack is react, react native, golang, on aws.

danhilltech · on Aug 30, 2018

Recently left Airbnb to start a new company in Washington DC. I'm reminded of my previous experience starting a company in London. As soon as we had success (exit) we moved to SF. All that's changed for me is that the $ threshold has gone up, but the principle is probably still true.