- cross-posted to:
- datahoarder@lemmy.ml
- technology@lemmy.world
- cross-posted to:
- datahoarder@lemmy.ml
- technology@lemmy.world
We backed up Spotify (metadata and music files). It’s distributed in bulk torrents (~300TB), grouped by popularity.
This release includes the largest publicly available music metadata database with 256 million tracks and 186 million unique ISRCs.
It’s the world’s first “preservation archive” for music which is fully open (meaning it can easily be mirrored by anyone with enough disk space), with 86 million music files, representing around 99.6% of listens.



A RAID6 of 24 * 20TB drives could contain that with both parity and hotswap, with room to spare. Let’s say $400 per refurb drive, $2500 rackmount SAS enclosure, $2000 SAS RAID card, $14,100 total. Assuming you already have the server and power and SAS cables.
You could budget this way down. I run 10+2 12TB with Unraid. No reason for a raid card if it’s for archive and personal use.
Oh totally, could do SATA instead of SAS too. I used to build out servers and render farms for motion graphics studios that needed the ability for multiple people to be doing high-bandwidth operations on the same network drive, and the above was just kind of the default offering.
100% this. People who store easily replaceable media on RAID are just throwing away money (unless you have a need for faster read/write). If it’s your family photos, copy of your in progress thesis, or other irreplaceable piece of info/content go for it.
I have like 40tb Unraid NAS and I get asked pretty much every time I talk to someone about it how I do backups. Easy, I backup my *arr stack databases and in case of a failure I restore them and let it pull down everything over time. Which I have done in the past when I wanted to upgrade quality, easier for me to scrub it all and start over than make upgrade profiles and such.
Or that’s what I would have done, now I mostly use DebridService du jour and Stremio :-)
Nice sounds similar to what I do. I bought Unraid because everyone always suggested it. No regrets. It really makes things simple and have only had minor problems (usually related to docker). I had a HDD fail and recovery was easy (besides the 18 hours of rebuild). But the drive is emulated during repair so no real downtime.
Family photos I use cloud storage for. I could probably get away with just doing local because I do hate that I’m sure they are being used to train AI. But I’d have to do off-site storage somewhere anyway. Parity drives don’t really do well in the “fire” scenario. Don’t want to have any risk of losing photos of my daughter growing up.