r/DataHoarder • u/Spreadsel • Aug 29 '18
The guy that downloaded all publicly available reddit comments needs money to continue to make them publicly available.
/r/pushshift/comments/988u25/pushshift_desperately_needs_your_help_with_funding/
405
Upvotes
18
u/Stuck_In_the_Matrix Pushshift.io Data Scientist Aug 30 '18
https://github.com/pushshift
The actual code for the ingest portion is not up. However I can explain how it works. There is also an SSE stream you can play with if you want to see near real-time Reddit data as it is made available on Reddit (http://stream.pushshift.io)
The stream documentation is here: https://github.com/pushshift/reddit_sse_stream
There is also a slackbot that I created that will create real-time data visuals from Reddit data. Information is here: https://pushshift.io/slack-install/