Google has removed dozens of new Sci-Hub domain names from its search results in the United States. Unlike typical DMCA takedowns, the removals were triggered by a dated court order that was not enforced for several years. This appears to be one of the first times Google has deindexed an entire pirate site in the U.S. based on a ‘site blocking’ style injunction.

  • chocrates@piefed.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    19 hours ago

    It’s possible. Search engines are just big reference databases.
    They have crawlers that search the web based on links to each other and then save metadata about the pages.

    There are some projects already that you can use.

    The problem is the data, if everyone of us have to build it ourselves it’s going to be tedious, and more importantly biased to however you are scraping.