-
10 votes
-
Looking for help scraping and deleting a Reddit account
I have a couple of old Reddit accounts I’d like to delete as fully as possible. However one of them dates back to my teenage years and it’s some of the only writings I have from that time. Any...
I have a couple of old Reddit accounts I’d like to delete as fully as possible. However one of them dates back to my teenage years and it’s some of the only writings I have from that time. Any recommendations on good simple ways to scrape all the comments off of it and save them? Then what’s the best way to completely erase a Reddit footprint these days?
Looking for as simple a solution as possible, I’m not tech illiterate by any means but it’s also not a real strong suit for me.
17 votes -
How GM tricked millions of drivers into being spied on (including me) (gifted link)
50 votes -
The startup offering free toilets and coffee for delivery workers — in exchange for their data
26 votes -
Chrome/Firefox Plugin to locally scrape data from multiple URLs
As the title suggests, I am looking for a free chrome or firefox plugin that can locally scrape data from multiple URLs. To be a bit more precise, what I mean by it: A free chrome or firefox...
As the title suggests, I am looking for a free chrome or firefox plugin that can locally scrape data from multiple URLs. To be a bit more precise, what I mean by it:
- A free chrome or firefox plugin
- Local scraping: it runs in the browser itself. No cloud computing or "credits" required to run
- Scrape data: Collects predefined data from certain data fields within a website such as https://www.dastelefonbuch.de/Suche/Test
- Infinite scroll: to load data that only loads once the browser scrolls down (kind of like in the page I linked above)
I am not looking into programming my own scraper using python or anything similar. I have found plugins that "kind of" do what I am describing above, and about two weeks ago I found one that pretty much perfectly does what is described ("DataGrab"), but it starts asking to buy credits after running it a few times.
My own list:
- DataGrab: Excellent, apart from asking to buy credits after a while
- SimpleScraper: Excellent, but asks to buy credits pretty much immediately
- Easy Scraper: Works well for single pages, but no possibility to feed in multiple URLs to crawl
- Instant Data Scraper: Works well for single pages and infinite scroll pages, but no possibility to feed in multiple URLs to crawl
- "Data Scraper - Easy Web Scraping" / dataminer.io: Doesn't work well
- Scrapy.org: Too much programming, but looks quite neat and well documented
Any suggestions are highly welcome!
Edit: A locally run executable or cmd-line based program would be fine too, as long as it just needs to be configured (e.g., creating a list of URLs stored in a .txt or .csv file) instead of coded (e.g., coding an infinite scroll function from scratch).
8 votes -
FYI: This site claims to have harvested 4B+ Discord chats, today all yours for a price
41 votes -
Sweden's public sector has ditched Big Tech in the name of privacy as a major telecom provider unveiled a new secure collaboration hub
13 votes -
ProtonMail on all the data that Outlook collects about your email
61 votes -
Wait, does America suddenly have a record number of bees?
27 votes -
There is no evidence that CBD products reduce chronic pain, and taking them is a waste of money and potentially harmful to health, new research finds
58 votes -
GM cuts ties with two data firms amid heated lawsuit over driver data
32 votes -
GM sued for sale of OnStar driving data
54 votes -
Seems like all socials are being scraped for AI and personal/aggregate data. Is Tildes?
I was just reminded of that again when going back and looking at some of my old posts on reddit which is openly selling online data. Prompted me to use Redact which erases and overwrites comments...
I was just reminded of that again when going back and looking at some of my old posts on reddit which is openly selling online data. Prompted me to use Redact which erases and overwrites comments before deleting them. But that got me wondering if the same is true of Tildes? And how would we know?
34 votes -
Tell US Congress: Stop the TikTok ban
32 votes -
Time to delete your Glassdoor account and data
102 votes -
CEO of data privacy company Onerep.com (used by the Mozilla Monitor service), founded dozens of people-search firms
44 votes -
Automakers are sharing consumers’ driving behavior with insurance companies (gifted link)
58 votes -
Colorado Bureau of Investigation finds DNA scientist manipulated data in hundreds of cases over decades
31 votes -
On International Women's Day, Northern European countries stand out for women who are looking to develop their careers – Iceland secured the top spot
3 votes -
Generative AI - We aren’t ready
27 votes -
Sen. Ron Wyden exposes data brokers selling US location data to anti-abortion groups that target abortion seekers
45 votes -
What is the most reliable and affordable form of storage medium to use as a backup drive for your computer?
I just had my backup hard drive die and while it did last a few good years, I just want to know what everyone else is using and what gets the best bang for buck.
30 votes -
The beautiful maths which makes 5G faster than 4G, faster than 3G, faster than…
12 votes -
Tumblr to begin selling user content to AI generative service companies, opt-out will be per blog
75 votes -
UK's NHS faces legal action over contract with data firm Palantir
12 votes -
Texas is right. The tech giants need to be regulated.
10 votes -
Vending machine error reveals secret face image database of Canadian college students
72 votes -
Google cut a deal with Reddit for AI training data
23 votes -
Walmart buying TV-brand Vizio for its ad-fueling customer data
48 votes -
Robots.txt governed the behavior of web crawlers for over thirty years; AI vendors are ignoring it or proliferating too fast to block
41 votes -
Reddit has a new AI training deal to sell user content
67 votes -
Introducing Mozilla Monitor Plus, a new tool to automatically remove your personal information from data broker sites
35 votes -
The relative share of Americans living in the West of the US has declined
21 votes -
How US anti-abortion ruling spurred federal action against the location data industry
24 votes -
The US hasn't seen syphilis numbers this high since 1950
19 votes -
Pew Research data on how many countries people have travelled to
25 votes -
Bruce Schneier on the CFPB’s proposed data rules
7 votes -
Criminals are getting increasingly adept at crafting malicious AI prompts to get data out of ChatGPT
22 votes -
‘Impossible’ to create AI tools like ChatGPT without copyrighted material, OpenAI says
44 votes -
Question about GDPR
I am in the EU. I asked a company in which I had an account to delete my account. They told me they would do that as long as I sent them an ID and a postal address. This is to ensure that "I am...
I am in the EU.
I asked a company in which I had an account to delete my account. They told me they would do that as long as I sent them an ID and a postal address. This is to ensure that "I am the right person".
I never gave them an ID and a postal address in the first place so how would that verify anything, and I'm using the email that I used to sign-up with them to ask for the deletion.
Am I in the wrong to believe that this should be easier? Are they misinterpreting the GDPR or am I?
What are my options if I do not want to send my ID and postal address?
--
Their arguments are:
Article 5(1)(f) of the GDPR requires us to meet security obligations in data processing. Since data deletion is permanent, we need to ensure that the request is indeed from the person concerned.
Furthermore, Article 12(6) of the GDPR states: "…when the data controller has reasonable doubts concerning the identity of the natural person making the request referred to in Articles 15 to 21, he may request the provision of additional information necessary to confirm the identity of the data subject."
10 votes -
Impact: FTC stops data broker X-Mode selling sensitive location data
16 votes -
Adopting rightwing policies ‘does not help centre-left win votes’
36 votes -
The New York Times sues OpenAI, Microsoft over the use of its stories to train chatbots
62 votes -
A quiet merger trial between antitrust enforcers and a pharma data giant called IQVIA reveals how bro-style executives control US medical data
13 votes -
Reindeer combine sleeping and digesting, Norwegian researchers found after extracting reindeer brain data
9 votes -
Simple Mobile Tools bought by ZipoApps (company offering apps with ads and tracking)
53 votes -
OpenAI suspends ByteDance's account after it used GPT to train its own AI model
20 votes -
Please help me understand and manage external hdd sleep
I have an external drive (3.5" hdd, SATA) in an enclosure (usb 3) (purchased separately), connected to a thunderbolt dock (OWC) connected alternately to an iMac and a macbook pro. The HDD goes to...
I have an external drive (3.5" hdd, SATA) in an enclosure (usb 3) (purchased separately), connected to a thunderbolt dock (OWC) connected alternately to an iMac and a macbook pro. The HDD goes to sleep, and causes problems. Freezes, weird internet access problems, kernel panics.
I have done some research, and can't seem to figure out:
how to know whether it is the drive, enclosure, or computer causing the sleep, although, fiddling with various settings on the mac seemed to have no effect, although it may have increased my battery usage :(
how to adjust settings on the drive, or in the enclosure.
How to determine what the sleep behavior of prospective drives will be.
As a workaround, I tried to write a zsh script to touch the drive ever few seconds. This kinda worked, but was a struggle to figure out appropriate permissions issues and how to make it run automatically.
I welcome all guidance, pointers to resources, clarifications, incantations, well-wishes.
8 votes -
Moderna, Merck vaccine with Keytruda cuts risk of deadly skin cancer returning in half, data says
9 votes -
Dropbox spooks users with new AI features that send data to OpenAI when used
49 votes