• Activity
  • Votes
  • Comments
  • New
  • All activity
  • Showing only topics with the tag "data". Back to normal view
    1. Shopping around for a new-and-improved backup solution

      A few days ago, I posted this and quickly realized that the world of data backups is far richer than just sudo rsync -av --delete --exclude=Videos /home /home_bkup. So now I'm window shopping the...

      A few days ago, I posted this and quickly realized that the world of data backups is far richer than just sudo rsync -av --delete --exclude=Videos /home /home_bkup.

      So now I'm window shopping the top Linux-supported backup solutions: borg, duplicacy, kopia, restic and--oh look--a core borg dev just dropped his own new-and-improved solution, vykar.

      Restic was the first tool I started to research, and I thought I really liked it, got as far as installing, initializing a test repo, creating a couple of snapshots. But restic seems to be, hmm, fussy about the source and destination paths, absolute vs relative paths, etc.

      The fact that merely renaming a parent directory (or grandparent, or great-grandparent, etc) causes restic to treat every unchanged byte below that as brand new ... that's a recipe for giant, bloated repos, and it's unacceptable to me ... and hey, lookit that, borg does not do that. So now, restic is out and borg is in.

      But what other pros v cons are there, that I haven't even realized need to be considered? What advantages/disadvantages do other apps offer? Which ones can I easily automate with nightly/hourly cron jobs? Which ones have their own even-better automated solutions?

      Do I even want encryption? All of my drives/volumes are LUKS encrypted, and anything I would store remotely would also get encrypted before it ever left my LAN ... plus, I'm just a bit nervous about having the backups encrypted, requiring working, functional software to restore/recover data from them....

      That may not seem like such a big concern, perhaps, but I am currently working my way thru decrypting a bunch of 10-15 year old TrueCrypt-ed volumes, which requires using an old, outdated version of VeraCrypt and a somewhat "cross-my-fingers" effort to find KeePass repos old enough (also outdated, KeePass 1.0 repos) to still contain the various passwords I used to encrypt those ancient volumes ... but also still use new enough master passwords that I can still get the KeePass repos unlocked.

      With rsync, I can literally just go into any backup, find the specific version of the specific file(s) I want to recover, and manually copy it back to my workspace. Is anything like that option available in any of these deduplicated/encrypted solutions, even if they're not encrypted? If (eg) a borg repo is created w/o encryption, the data is still all just borg-specific blobs, right? Or can I navigate into the repo and just manually grab files?

      Oh yeah ... for reference, the past 10-ish years, my backup routine has been to create a new, dated, destination folder, starting with a full backup of my /home folder (excluding things like Videos, Music, VMs, other bulky stuff that gets backed up separately/differently), and then running nightly diff backups into the same folder, while also maintaining a "one-day-older" second backup of the whole thing on a 2nd HDD ... then, every 3-6 months, zipping up the current backup folder and starting a new one.

      At any rate, there you go; that's the kind of stuff I'm thinking about now, as I overhaul my 20-year-old, 20TB (but could be 2TB) backup system.

      Any and all feedback, recommendations, tips are welcome. Danke.

      15 votes
    2. What steps can the average user do to secure their data privacy?

      With all of the identity verification laws in the pipeline, data breaches, and government overreach (mandated monitoring in new cars in the US), what steps can the average person take to secure...

      With all of the identity verification laws in the pipeline, data breaches, and government overreach (mandated monitoring in new cars in the US), what steps can the average person take to secure their anonymity and data and device privacy?

      I’m a tech-savvy person but nowhere near the level of a great many. It seems like in the face of overwhelming odds, making small changes is only a drop in the bucket. I have all the data encryption settings enabled on my phone, but I use services like Dropbox and rely on it heavily. I’ve always thought that if the product is free, you’re the product…but I pay for Dropbox, so they shouldn’t use my data for training AI (but they likely are). Setting up a personal cloud seems like a daunting task, as is getting involved in any of the small projects that people have going (decentralized networks, mesh…things, P2P, etc). I’ve focused more on securing my home networks recently so my Ubiquiti devices are restricted in what they can access, but I haven’t actually pen-tested my network yet. I have PopOS! installed on my home desktop because I got tired of Windows’ invasive…everything, but ultimately I don’t know what I’m doing.

      There’s probably a great many people out there that feel like it’s hopeless to try to do anything because it won’t matter as there’s such a heavy push to invade, restrict, and monetize our digital lives. What can the average person do to take control of our devices and data?

      34 votes
    3. Medium term cold storage options?

      Increasingly I'm looking at my backup solution and I'm not totally happy. My "threat model" I guess is if the house burns down and we only make it out with the shirts on our backs. Alternatively...

      Increasingly I'm looking at my backup solution and I'm not totally happy. My "threat model" I guess is if the house burns down and we only make it out with the shirts on our backs. Alternatively if I get hit by a bus I'd like a backup of passwords and maybe some instructions for my wife.

      Mostly irrelevant discussion on my current backup or lack of situation

      Up until recently I had a VPS running syncthing as a central backup for all my devices but it kind of looks like that got randomly wiped or something... my plan up until that happened was that I have a computer in a locker at work that I occasionally fired up to sync my syncthing stuff. This has some issues, the big one being that it doesn't deal with bus factor.

      My next plan (and the point of this topic) is to have some data stored offline in a safe deposit box at the bank or some other secure location and swap the data out at some interval like 6 months or 1 year. The stuff I REALLY care about is easily under 1gb and stuff I kind of care about (photos and that kind of thing) is < 1tb.

      Also currently I'm paying for iCloud each month even though I've mostly left the mac-osphere. This is where my < 1tb of photos are. I intend to download all of that and stop paying for iCloud in the coming months.

      TL;DR What are decent medium term cold storage options for < 1gb that I can be really sure will be good for several years (maybe 10 or 20 years at the extreme end) and is fairly cheap. I was thinking optical media but I'm kind of lost as to what specifically to get and how to not get conned by buying fake media (m discs). I (somewhat randomly) have an m disc drive in my computer but I don't know if thats overkill or not? My important stuff may even fit on a CD actually...

      24 votes
    4. Looking for an online spreadsheet to share with others (not Google or Microsoft)

      I figure the title is good enough, but, I just want to upload/make a spreadsheet in an .ods format so others can view it. Not edit it, not have to sign in to view, but still has sorting options or...

      I figure the title is good enough, but, I just want to upload/make a spreadsheet in an .ods format so others can view it. Not edit it, not have to sign in to view, but still has sorting options or whatnot. And in the .ods format.

      I'm seeing a few options online, but it seems more that they offer viewing but not sorting (which is a huge aspect of spreadsheets), or no importing, or doesn't support .ods.

      So I can keep searching and I'm sure something is out there, but does anyone already use a site for these requests?

      15 votes
    5. PSA: Flash storage warranties are long and legitimate (flash drives, SSDs, SD cards, etc.)

      If you have a flash drive, SSD drive (including NVMe drives), (micro)SD card, or some other popular flash memory media die on you, you might be able to get a free replacement, depending on the...

      If you have a flash drive, SSD drive (including NVMe drives), (micro)SD card, or some other popular flash memory media die on you, you might be able to get a free replacement, depending on the manufacturer and the product.

      I recently RMA'd a SanDisk microsd card that died unexpectedly. When I looked up their warranty, SanDisk has a lifetime warranty on most of their flash memory products. They even provided a return shipping label. Since they no longer make the card that died, they're sending an upgraded, currently available model.

      I've also RMA'd two Kingston NVMe drives. Both of them were getting a bit old, but the RMA was accepted, and in these instances I also received the newer version of the product. I did have to pay for return shipping myself, but it was well worth it.

      So if you're about to toss that broken flash media in the trash, double check to see if a warranty applies. It's worth the time and potential shipping cost/hassles in many cases.

      31 votes
    6. I tried ranking my albums out of five stars - I think I've gotten it wrong. Thoughts?

      TLDR/Warning: this might be a tedious read. But I'm curious if I could have gone about rating my albums better. I tend to simply either favourite an album or not. The idea of giving albums and...

      TLDR/Warning: this might be a tedious read. But I'm curious if I could have gone about rating my albums better.

      I tend to simply either favourite an album or not. The idea of giving albums and tracks marks out of five stars seems tedious, difficult to match to how I feel and just doesn't match how my head works. But my collection has grown over the decades and I've been bed bound a lot lately, so I'm trying to organise/categorise based on my feelings towards the albums rather than genre. I'm also hoping to rejuvenate my old interest in music (playing in a band and recording for a living took the shine off of casual listening for me). I thought it would be an interesting experiment to try out, so I rated songs from over 50 albums.

      I came up with a rigid and hopefully balanced definition for each rating:

      • 1 star - Dislike. I hope I never hear this song again (but I'll keep it purely because it's part of the album)
      • 2 stars - Neutral. It doesn't annoy me, but it's too generic to be interesting
      • 3 stars - Sometimes this song hits the spot.
      • 4 stars - This song usually hits the spot.
      • 5 stars - This song always hits the spot.

      Then I rate the album out of five stars based on the average of the song ratings. The result is that no albums got 5 stars, a seven got 4 stars, the vast majority of albums are rated at 3 or 2 stars. Even among the 3-star albums, some I like much more than others depending on whether they contain mostly consistent 3-star songs or half 4-star songs and half 2-star songs.

      I wonder if the lack of 5-star albums is because of the definitions I gave each of the 1-5-star possibilities. For example, I don't know if any song "always hits the spot". Or maybe it's just that I'm not as into my music as I used to be.

      Anyway, I thought maybe people interested in music and data might have thoughts on going about this differently. It's worth asking before I do the next 1,000 albums :) Maybe you'd define each of the 5 stars differently. Any takers?

      Edit: thanks to everyone for reading all this and commenting their thoughts. I have a system I'm happy with now, but always happy to continue to chat with fellow (and reluctant) pedants about this.

      13 votes
    7. Request for help: Backing up NASA public databases

      TL;DR: NASA's public Planetary Data System is at risk of being shut down. Anyone have any ideas for backing it up? Hi everyone, Bit of a long-shot here, but I wanted to try on high-quality tildes...

      TL;DR: NASA's public Planetary Data System is at risk of being shut down. Anyone have any ideas for backing it up?

      Hi everyone,

      Bit of a long-shot here, but I wanted to try on high-quality tildes before jumping back into the cesspool of reddit. I'm posting it in ~science rather than ~space as I figure interest in backing up public data is broader than just the space community.

      I work regularly with NASA's Planetary Data System, or PDS. It's a massive (~3.5petabytes!!) archive of off-world scientific data (largely but not all imaging data). PDS is integral for scientific research - public and private - around the world, and is maintained, for free, by NASA (with support of a number of Academic institutions).

      The current state of affairs for NASA is grim:

      And as a result, I (and many of my industry friends) have become increasingly concerned that PDS will be taken down as NASA is increasingly torn down for spare parts and irreparably damaged. This administration seems bent on destroying all forms of recording-keeping and public science, so who knows how long PDS will be kept up. Once it's down, it'll be a nightmare to try and collect it all again from various sources. I suspect we'll permanently lose decades worth of data - PDS includes information going all the way back to the Apollo missions!

      As such, we've been pushing to back-up as much of PDS as we can, but have absolutely no hope of downloading it all within the next year or two, nevermind in a few months if the current cuts impact us soon.

      If you or someone you know would be interested in helping figure out how we can back-up PDS before it's too late, please let me know here or in a DM. I've already tried reaching out to the Internet Archive, but did not hear anything back from them.

      Edit: to clarify, the larger problem is download speeds - we've topped out at 20mb/s with 8 connections.

      61 votes
    8. What resource should I use for how to investigate data at rest with Django?

      Finally embarking on a side-project that I will be doing with Django. One thing that I am having to consider is how to do encryption. Looking at the explanations of different levels of encryption...

      Finally embarking on a side-project that I will be doing with Django.

      One thing that I am having to consider is how to do encryption.

      Looking at the explanations of different levels of encryption here, I think data at rest is really all I need to do (although, I will probably use cloudflare tunnels which will also ensure data in transit but I just won't be implementing it myself is all).

      Now, doing data at rest, doing some research, django-cryptography comes up a lot but that hasn't been updated in forever, to point where an open issue on its repo points to a new library (django-cryptograph-5) that was made specifically cause the devs of django-cryptography seem to have abandoned it, but that same thing could happen to the new off-shoot.

      I can't tell if this means that I am looking on the wrong webpages for knowledge of how to do about this or when working in the python open-source ecosystem, there's no list of trustworthy reliable publishers of a library for data at rest encryption? like how Django REST Framework is so established, they even have sponsors now.

      6 votes
    9. Anyone know of any good way to transfer Apple Music playlists onto a hard drive?

      EDIT: As one user pointed out, this is not about Apple Music the streaming platform, this is about basically itunes but itunes no longer technically exists as an application. So a little...

      EDIT: As one user pointed out, this is not about Apple Music the streaming platform, this is about basically itunes but itunes no longer technically exists as an application.

      So a little background: my father just died and a big part of his life was listening to music, for most of his life he's been building themed compilations of songs he liked using whatever medium was available, magnetic reel tapes in the '60s and '70s, then cassette tapes, then CDs, and of course playlists for the last 20 or so years. Now my mother and I would like to back up and save a lot of that work as those compilations have a lot of sentimental value and are pretty unique. There's lots of old obscure rhythm and blues and soul songs that you aren't really going to come across anywhere else. However, it's pretty much all locked into Apple Music, which isn't really a problem in the here and now, because we all have tended to use macs since my mother adopted them in the '80s or '90s. However, we don't really want that data just locked into a private ecosystem that has been getting more and more restricted and where we have less and less control.

      So I'm looking for a way to keep those playlists intact and export them out of Apple Music in a playable format and into a less locked in system to then back them up. Most of the music should be DRM free as a lot of it would have been taken off of CDs probably as MP3 files, though a lot of that would've happened 15+ years ago.

      Does anyone have any ideas about the best way to do that? I seem to be able to manually export each one into a .txt file but of course it's not really playable sound files. My tech skills are pretty limited, I have about an average amount of knowledge or even slightly more for someone my age (30s) who grew up around computers and the internet but I grew up after it necessary to have basic coding skills to use computers so my experience doing even basic coding or running scripts is pretty much nil. Any ideas would be appreciated.

      Edit: it’s version 1.0.6.10

      18 votes