-
34 votes
-
AT&T says criminals stole phone records of ‘nearly all’ US customers in new data breach
26 votes -
Space data centres: ‘A figment of the imagination’ but one that could make Europe a space leader
15 votes -
Meet Mercy and Anita – the African workers driving the AI revolution, for just over a dollar an hour
18 votes -
The mystery of the £39 orange
11 votes -
Microsoft CEO of AI claims online content is 'freeware' [and can be used to train LLMs in the absence of a specific directives from the author against this]
43 votes -
Chicago’s NASCAR Race is a marvel of physics
6 votes -
GridStatus.io - see electricity use in each US region
8 votes -
I will fucking piledrive you if you mention AI again
119 votes -
US House GOP leaders vow to block online privacy bill over intraparty pushback
19 votes -
Meta hit with Norwegian complaint over its plans to use images and posts of users on Facebook and Instagram to train artificial intelligence models
27 votes -
Processing data from the James Webb Space Telescope • John Davies
8 votes -
Star botanist likely made up data about nutritional supplements, new probe finds
11 votes -
See the most detailed map of human brain matter ever created
14 votes -
Police are not primarily crime fighters
43 votes -
Many widely used reproductive health apps fail to protect highly sensitive data, study finds
33 votes -
Google Cloud accidentally deletes UniSuper’s online account due to ‘unprecedented misconfiguration’
41 votes -
Obsolete, but not gone: The people who won't give up floppy disks
23 votes -
ProtonMail discloses user data leading to arrest in Spain
41 votes -
New data shows deadly cost of US officials' failures with COVID in prisons
14 votes -
Big data reveals true climate impact of worldwide air travel
24 votes -
Extraverted introverts, cautious risk-takers, and selfless narcissists: A demonstration of why you can’t trust data collected on MTurk
27 votes -
Data show that the amount of sexual content in top films has sharply declined since 2000
33 votes -
‘The science isn’t there’: do dating apps really help us find our soulmate?
31 votes -
New products collect data from your brain. Where does it go?
4 votes -
How GM tricked millions of US drivers into being spied on (including me)
56 votes -
GM ends OnStar driver safety program after privacy complaints
38 votes -
HHS strengthens privacy of US reproductive health care data
10 votes -
The startup offering free toilets and coffee for delivery workers — in exchange for their data
26 votes -
Looking for help scraping and deleting a Reddit account
I have a couple of old Reddit accounts I’d like to delete as fully as possible. However one of them dates back to my teenage years and it’s some of the only writings I have from that time. Any...
I have a couple of old Reddit accounts I’d like to delete as fully as possible. However one of them dates back to my teenage years and it’s some of the only writings I have from that time. Any recommendations on good simple ways to scrape all the comments off of it and save them? Then what’s the best way to completely erase a Reddit footprint these days?
Looking for as simple a solution as possible, I’m not tech illiterate by any means but it’s also not a real strong suit for me.
18 votes -
Chrome/Firefox Plugin to locally scrape data from multiple URLs
As the title suggests, I am looking for a free chrome or firefox plugin that can locally scrape data from multiple URLs. To be a bit more precise, what I mean by it: A free chrome or firefox...
As the title suggests, I am looking for a free chrome or firefox plugin that can locally scrape data from multiple URLs. To be a bit more precise, what I mean by it:
- A free chrome or firefox plugin
- Local scraping: it runs in the browser itself. No cloud computing or "credits" required to run
- Scrape data: Collects predefined data from certain data fields within a website such as https://www.dastelefonbuch.de/Suche/Test
- Infinite scroll: to load data that only loads once the browser scrolls down (kind of like in the page I linked above)
I am not looking into programming my own scraper using python or anything similar. I have found plugins that "kind of" do what I am describing above, and about two weeks ago I found one that pretty much perfectly does what is described ("DataGrab"), but it starts asking to buy credits after running it a few times.
My own list:
- DataGrab: Excellent, apart from asking to buy credits after a while
- SimpleScraper: Excellent, but asks to buy credits pretty much immediately
- Easy Scraper: Works well for single pages, but no possibility to feed in multiple URLs to crawl
- Instant Data Scraper: Works well for single pages and infinite scroll pages, but no possibility to feed in multiple URLs to crawl
- "Data Scraper - Easy Web Scraping" / dataminer.io: Doesn't work well
- Scrapy.org: Too much programming, but looks quite neat and well documented
Any suggestions are highly welcome!
Edit: A locally run executable or cmd-line based program would be fine too, as long as it just needs to be configured (e.g., creating a list of URLs stored in a .txt or .csv file) instead of coded (e.g., coding an infinite scroll function from scratch).
8 votes -
FYI: This site claims to have harvested 4B+ Discord chats, today all yours for a price
41 votes -
ProtonMail on all the data that Outlook collects about your email
61 votes -
5.25-inch floppy disks expected to help run San Francisco trains until 2030
22 votes -
Sweden's public sector has ditched Big Tech in the name of privacy as a major telecom provider unveiled a new secure collaboration hub
14 votes -
GM sued for sale of OnStar driving data
54 votes -
Wait, does America suddenly have a record number of bees?
27 votes -
There is no evidence that CBD products reduce chronic pain, and taking them is a waste of money and potentially harmful to health, new research finds
58 votes -
CEO of data privacy company Onerep.com (used by the Mozilla Monitor service), founded dozens of people-search firms
44 votes -
GM cuts ties with two data firms amid heated lawsuit over driver data
32 votes -
Seems like all socials are being scraped for AI and personal/aggregate data. Is Tildes?
I was just reminded of that again when going back and looking at some of my old posts on reddit which is openly selling online data. Prompted me to use Redact which erases and overwrites comments...
I was just reminded of that again when going back and looking at some of my old posts on reddit which is openly selling online data. Prompted me to use Redact which erases and overwrites comments before deleting them. But that got me wondering if the same is true of Tildes? And how would we know?
34 votes -
Time to delete your Glassdoor account and data
102 votes -
Tell US Congress: Stop the TikTok ban
32 votes -
Automakers are sharing consumers’ driving behavior with insurance companies
58 votes -
Colorado Bureau of Investigation finds DNA scientist manipulated data in hundreds of cases over decades
31 votes -
What is the most reliable and affordable form of storage medium to use as a backup drive for your computer?
I just had my backup hard drive die and while it did last a few good years, I just want to know what everyone else is using and what gets the best bang for buck.
30 votes -
On International Women's Day, Northern European countries stand out for women who are looking to develop their careers – Iceland secured the top spot
3 votes -
Generative AI - We aren’t ready
27 votes -
Sen. Ron Wyden exposes data brokers selling US location data to anti-abortion groups that target abortion seekers
45 votes -
The beautiful maths which makes 5G faster than 4G, faster than 3G, faster than…
12 votes