13 votes

Before hearing about Tildes, I wrote some Python scripts to find interesting Reddit content

https://john-millikin.com/reddit-front-page-2018

Not sure if this is of interest to anyone else, but it might be a good starting point if you want to find Reddit posts worth cross-posting here. It starts with the Python script to fetch the raw data, then some basic SQL commands to delete low-signal parts of the dataset.

Also relevant is http://files.pushshift.io/reddit/submissions/, an archive of Reddit posts going back to 2011.

1 comment

  1. rorso Link
    That's really cool. For a while now I've been meaning to use the reddit API to pull just the links from various subreddits relating to my hobbies, without the comments or self.posts or anything...

    That's really cool.

    For a while now I've been meaning to use the reddit API to pull just the links from various subreddits relating to my hobbies, without the comments or self.posts or anything else. Taking reddit back to being what it was supposed to be, as a content aggregator. In the meantime I just pull some subreddits in to Inoreader via RSS and it works OK, but being able to actively filter stuff out before it hits my feed would be great.

    2 votes