We are expanding our data services to offer a growing library of downloadable datasets in CSV and plain text formats. These files are built from our own data collection pipelines and are designed for direct use in analysis, automation, and development workflows.
What to Expect ¶
The upcoming data files will cover areas where we already have deep collection infrastructure:
- Proxy lists — validated and categorized proxy servers with speed, anonymity level, and geolocation data
- Domain lists — curated collections of domains by industry, technology stack, or geographic region
- Keyword datasets — harvested keyword lists from search autocomplete and related search sources, organized by topic and language
- Price data — historical and snapshot cryptocurrency pricing from multiple exchanges in standardized OHLCV format
- URL collections — categorized URL lists for testing, research, and benchmarking web tools
File Formats ¶
All datasets will be published in standard formats that work with any tool:
- CSV — comma-separated files with headers, ready for import into spreadsheets, pandas, databases, or any data processing pipeline
- Plain text — one entry per line, ideal for scripting, grep, and command-line workflows
Files will include documentation headers describing the schema, collection date, and source methodology.
Who This Is For ¶
These datasets are useful for developers building web tools, SEO professionals running analysis at scale, researchers studying web trends, and anyone who needs clean, structured data without building their own scraping infrastructure.
New files will be published regularly as our collection pipelines produce fresh data. Subscribe to our updates or check back for new releases.