Yes, scraping public data in India is generally permissible, provided it complies with privacy, copyright, and sector-specific regulations. Courts have recognized public data as non-proprietary, but unauthorized commercial exploitation or misuse of personal information violates the Information Technology Act, 2000, and the Personal Data Protection Act (PDPA), 2023 (effective 2026). The Reserve Bank of India (RBI) and Securities and Exchange Board of India (SEBI) also impose sectoral restrictions on financial data scraping.
Key Regulations for Scraping Public Data in India
- Data Privacy Laws: The PDPA (2023) prohibits scraping personal data without explicit consent or a lawful basis, imposing penalties up to ₹250 crore for violations. Publicly available data does not automatically grant exemption if it contains identifiable personal information.
- Copyright and Database Rights: The Copyright Act, 1957, protects structured databases (e.g., court judgments, government compilations) from unauthorized extraction. Scraping such data for commercial purposes may constitute infringement under Section 2(o).
- Sectoral Restrictions: RBI’s Master Direction on IT Governance (2023) and SEBI’s Data Access Policy (2024) restrict automated scraping of financial market data to licensed entities. Unauthorized scraping of such data may trigger enforcement actions under the Securities Contracts (Regulation) Act, 1956.
Critical Considerations:
- Automated Tools: Use of bots or scrapers must align with the Intermediary Guidelines and Digital Media Ethics Code (2021), which require transparency in data collection practices.
- Government Data: The Right to Information Act, 2005 grants access to government-held data, but scraping bulk datasets without approval may violate the Government Open Data License (2023).
- Jurisdictional Risks: Foreign entities must comply with the Foreign Exchange Management Act, 1999 if data is transferred outside India, as per RBI’s Data Localization Guidelines (2018).