• daq@lemmy.sdf.org
    link
    fedilink
    English
    arrow-up
    1
    ·
    8 hours ago

    Huh? I can reach my site via curl that has neither. How did you come up with this random set of requirements?

    • grysbok@lemmy.sdf.org
      link
      fedilink
      English
      arrow-up
      0
      ·
      6 hours ago

      Odd. I just tried

      curl https://www.scrapingcourse.com/cloudflare-challenge

      and got

      Enable JavaScript and cookies to continue

      I’m clearly not on the same setup as you are, but my off-the-cuff guess is that your curl command was issued from a system that cloudflare already recognized (IP whitelist, cookies, I dunno).

      Anyways, I’m reading through this blog post on using cURL with cloudflare-protected sites and I’m finding it interesting.

      • daq@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        1
        ·
        4 hours ago

        Of course their challenge requires those things. How else could they implement it? Most users will never be presented with a challenge though and it is trivial to disable if you don’t want to ever challenge anyone. I was just saying CF blocks ML crawlers.