More

feconroses · 2026-01-13T23:40:38 1768347638

Very cool project! Quick question: is the underlying Pushshift dataset updated with new Reddit data on any regular cadence (daily/weekly/monthly), or is this essentially a fixed historical snapshot up to a certain date? Just want to understand if self-hosters would need to periodically re-download for fresh content or if it's archival-only.

19-84 · 2026-01-13T23:51:33 1768348293

the data from 2025-12 has been released already, it is usually released every month, it just needs to be split and reprocessed for 2025 by watchful1. i will probably eventually add support for importing data from the monthly arctic shift dumps so that archives can be updated monthly.

https://github.com/ArthurHeitmann/arctic_shift/releases

Arctic Shift https://academictorrents.com/browse.php?search=RaiderBDev

Watchful1 https://academictorrents.com/browse.php?search=Watchful1

riku_iki · 2026-01-14T01:02:20 1768352540

Is data web scrapped? Is reddit ok with that?..

feconroses · on June 9, 2018

Very useful! If you could also add note sounds, that would be amazing!

feconroses · on April 19, 2018

Hello HN!

We have been seeing quite a lot of conversations in customer support teams around tagging tickets (used as part of triggers, macros, analytics, etc). And we know how its hard and time consuming process.

This is why we built a MonkeyLearn extension for Zendesk (that we are releasing today) to help this tagging process with machine learning.

With this integration, MonkeyLearn will automatically tag and categorize incoming tickets into Zendesk. It will predict the value of a given field based on the subject and content of a ticket (it uses your historical data to train the machine learning model).

You can find it in the marketplace here: https://www.zendesk.com/apps/support/monkeylearn-ticket-clas...

This is an initial version and it’s free to use (at least for most cs teams).

We are trying to understand the value and if it helps support teams in this process, so any type of feedback is greatly appreciated. Also, if you need any kind of help to fine tune the model, more than happy to help.

feconroses · on Jan 25, 2018

previous step before Waze + Waymo?

feconroses · on Jan 18, 2018

As an alternative, you can try Anafora https://github.com/weitechen/anafora

feconroses · on Jan 18, 2018

A good alternative to Mturk is ScaleAPI: https://www.scaleapi.com/

feconroses · on Nov 3, 2016

Hi /r/dataisbeautiful/!

For creating Tarsier, we used Tweepy for extracing tweets using the Twitter Public API, we used MonkeyLearn for analyzing the tweets and finally used Plotly for creating the visualizations.

You can see some of the insights we got using Tarsier here: https://blog.monkeylearn.com/donald-trump-vs-hillary-clinton...

feconroses · on Nov 2, 2016

Hi HN!

For creating Tarsier, we used Tweepy for extracing tweets using the Twitter Public API, we used MonkeyLearn for analyzing the tweets and finally used Plotly for creating the visualizations.

You can see some of the insights we got by using Tarsier here: https://blog.monkeylearn.com/donald-trump-vs-hillary-clinton...

feconroses · on Oct 20, 2016

You can check out the tool here: http://tarsier.monkeylearn.com/

feconroses · on Sept 1, 2016

Thanks Maëlle Salmon for this awesome analysis!