Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Sanskari Proxy

Nemo
October 13, 2021

Sanskari Proxy

A short presentation on the Sanskari Proxy project.

Nemo

October 13, 2021
Tweet

More Decks by Nemo

Other Decks in Research

Transcript

  1. The Problem Many Indian government websites are geo-blocked to be

    inaccessible outside India. DEEP WEB Impacts: Researchers, Critics, Citizens, Expats, Archivists, Travellers, Search Engines, Crawlers, and more.
  2. The Problem Many Indian government websites are geo-blocked to be

    inaccessible outside India. DEEP WEB Impacts: Researchers, Critics, Citizens, Expats, Archivists, Travellers, Search Engines, Crawlers, and more. Q: Is this censorship?
  3. The Idea Run a custom proxy for government websites that

    gets indexed and crawled by search engines, archivists, and is accessible to researchers and users outside India. 1. Make selfregistration.cowin.gov.in accessible at selfregistration.cowin.gov.in.sanskariproxy.in 2. (Optionally) Overwrite the robots.txt file to ensure everything gets archived/cached.
  4. The (Legal) Challenge Running an open proxy makes me legally

    liable for all requests under the IT Act (Intermediary Rules). Ref: - https://www.medianama.com/2021/02/223-summary-internet-intermediary-liability-2021/
  5. The (Technical) Challenge I made a list of all Government

    of India websites (~12k Domains), from multiple sources: - Censys API - Certificate Transparency Logs (crt.sh) - GOIDirectory.nic.in List: git.io/JrjcV
  6. The Compromise Run a simple authenticated proxy only accessible to

    trusted researchers and users. Pro: - Limited legal liability. - Trusted users only - Better than shady VPNs Cons: - Still not accessible to search engines
  7. Future Research 1. Find the extent of GeoBlocking. 2. Get

    (some) geoblocked websites indexed/archived legally.