Related:

Major cyber attack could cost the world $3.5 trillion - Power Grid, Internet Outage

The one database/file/zip to save humanity, what is it?

Show Lemmy the downloadable URL of a Database or AI you know of so we can have a local backup copy that will improve the resilience and availability of Human Knowledge.

Given the state of AI being Corporatized I think we could definitely use links for whatever comes closest to a fully usable Open Source, fully self-contained downloadable AI.

Starter Pack:

★ Lemmy List

Databases

AI

  • sir_reginald@lemmy.world
    link
    fedilink
    English
    arrow-up
    30
    ·
    edit-2
    9 months ago

    This is too much catastrophism for my taste, but If I wanted to start archiving, I’ll start by downloading Wikipedia, The Library Genesis and the Gutenberg Project.

    Videos are too heavy to archive with ease, and they are probably of much less value of actual knowledge.

    • 𝒍𝒆𝒎𝒂𝒏𝒏@lemmy.one
      link
      fedilink
      English
      arrow-up
      9
      ·
      9 months ago

      Haven’t heard about the Gutenberg project before, seems pretty neat!

      I’d probably add repair.wiki to a list of things I’d archive, although some of that content is picture heavy so not as easily compressible as Wikipedia

      There was a project that allows you to download wikipedia and some other online resources into an easy to search & navigate UI, think it was called Kiwi something but can’t remember. It was targeted at regions with poor internet coverage

      • SeaJ@lemm.ee
        link
        fedilink
        English
        arrow-up
        4
        ·
        8 months ago

        Project Gutenberg has been a thing for a couple decades. I think they are starting to also create free audiobooks from books they have in their collection. There is an TTS AI service that I checked out a week ago (play.ht)and that does voicing very realistically from the text that I gave it and I might spring spend $40 for a month of that service and build some audiobooks. The paid version gives access to more voices and will do 1 million characters of text a year.

        Or if anyone knows a good open source online alternative, I’m all ears. I’d prefer to go that route but did not give anything that was a very good solution.

    • fubo@lemmy.world
      link
      fedilink
      English
      arrow-up
      8
      ·
      9 months ago

      Humanity has been using writing for millennia. It’s a proven technology. Photographs and video don’t tend to last longer than the one institution or family that cares about them.

      • fiat_lux@kbin.social
        link
        fedilink
        arrow-up
        2
        ·
        9 months ago

        Mostly due to previous physical constraints, I would argue. Thankfully there are fewer chances your hard drive is going to decompose into vinegar while sitting in your cupboard, and even if it does, it’s likely not the only copy.

        They’re also more limited for current data because they’re harder to parse and convert into other usable formats, but thankfully that will get better over time too.

        I still preference text-first data for various reasons, but let’s not dismiss the leagues of potential video has for communication and archival value, both intentional and unintentional.

    • fiat_lux@kbin.social
      link
      fedilink
      arrow-up
      7
      ·
      9 months ago

      Perhaps think of it more as knowledge decentralization as a form of resiliency for unplanned network outages. Sometimes the library of Alexandria just happens to catch fire, and it might be nobody’s fault at all.

      Besides, plenty of people grew up in families with a basic encyclopaedia or dictionary or a repair manual. This is essentially the same thing, just with less paper.

  • Elias Griffin@lemmy.worldOP
    link
    fedilink
    English
    arrow-up
    8
    ·
    9 months ago

    I’m particulary looking for anyone that already has a collection of Arxiv and Sci-Hub papers. Please curate your collection and make it available here!

    We also need a hashtag/topic/keyword for this project that is brief and catchy we can also use for a GitHub search, etc. Anyone?