RAW RANKED SITES ABOUT
#DATASETS

The most comprehensive list of datasets websites last updated on Sep 1 2023.
Stats collected from various trackers included with free apps.
1
Growjo - The Fastest Growing Companies in 2020 Indexing and predicting the world’s fastest growing companies in 2020 in terms of number of employees, estimated revenue, etc
2
Open Government Data (OGD) Platform India Open Government Data Platform (OGD) India is a single-point of access to Datasets/Apps in open format published by Ministries/Departments. Details of Events, Visualizations, Blogs, infographs.
3
Places Data & Foot-Traffic Insights | SafeGraph.com SafeGraph's Points-of-Interest (POI) data, geofences, business listings, & foot-traffic data empowers firms to do better geolocation, marketing attribution, retail analytics, & location intelligence. Our mission is to power innovation through open access to geospatial datasets.
4
Track your SERP keywords and website statistics with 3rd party authoritative datasets and social signals or run a site audit to inspect your site''s SEO readiness and load performance to dominate your competitors.
6
iLovePhD | Love the Process of Getting PhD ilovephd provides free Scopus Indexed journals list, Medical image datasets, IoT security Thesis,and learn research methodology. Get jobs and scholarships details for PhD and PDF positions.
7
Welcome - Home - DataHub - Frictionless Data Find, Share and Publish Quality Data Online. The fastest way for individuals, teams and organizations to publish, deploy and share their data.
10
Places Data & Foot-Traffic Insights | SafeGraph.com SafeGraph's Points-of-Interest (POI) data, geofences, business listings, & foot-traffic data empowers firms to do better geolocation, marketing attribution, retail analytics, & location intelligence. Our mission is to power innovation through open access to geospatial datasets.
11
Academic Torrents We''ve designed a distributed system for sharing enormous datasets - for researchers, by researchers. The result is a scalable, secure, and fault-tolerant repository for data, with blazing fast download speeds.
12
The One Generator Generate everything from random numbers, dates, documents, names and addresses to full JSON, HTML, XML and SQL datasets.
13
John Snow Labs | NLP & AI in Healthcare John Snow Labs is an award-winning AI company that helps healthcare and life science organizations put AI to work faster. We provide a high-compliance AI platform, state-of-the-art NLP libraries, and a data market with over 2,000 expert-curated datasets.
14
DPhi - Democratizing Data Science Learning Learn Data Science for free through courses, practice on real-world datasets and discuss with leading domain experts and data science enthusiasts.
17
Machine Learning Training Data for AI | Object Detection Datasets - Cogito Cogito provides best quality training data sets for machine learning and AI. Get deep learning training data for object detection with data enrichment services.
18
Home - Global Gene Corp Our vision is to bring the benefits of precision healthcare to everyone, no matter where they live or where they come from.We founded Global Gene Corp to address what we believe is the most important challenge to delivering effective precision healthcare to a global market: the lack of diversity in the genomic datasets used to inform the development of novel diagnostics and therapies.
19
Tracking Progress in Natural Language Processing | NLP-progress Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
20
Best-in-Class Data Labeling Platform | Playment Build high-quality ground truth datasets with ML-assisted tools, sophisticated project management software, expert human workforce, and much more.
21
BigML.com Machine Learning made beautifully simple for everyone. Take your business to the next level with the leading Machine Learning platform.
22
Dataset Storage and Dataset Search Platform | IEEE DataPort Easily store and access hundreds of datasets, including big data datasets, through IEEE's dataset storage and dataset search platform, DataPort.
23
Home - DataMed | bioCADDIE Data Discovery Index DataMed is a prototype biomedical data search engine. Its goal is to discover data sets across data repositories or data aggregators. In the future it will allow searching outside these boundaries. DataMed supports the NIH-endorsed FAIR principles of Findability, Accessibility, Interoperability and Reusability of datasets with current functionality assisting in finding datasets and providing access information about them.
24
Mockaroo - Random Data Generator and API Mocking Tool | JSON / CSV / SQL / Excel A free test data generator and API mocking tool - Mockaroo lets you create custom CSV, JSON, SQL, and Excel datasets to test and demo your software.
25
OmicsDI: Home Omics Discovery Index is an integrated and open source platform facilitating the access and dissemination of omics datasets. It provides a unique infrastructure to integrate datasets coming from multiple omics studies, including at present proteomics, genomics, transcriptomics and metabolomics.OmicsDI stores metadata coming from the public datasets from every resource using an efficient indexing system, which is able to integrate different biological entities including genes, proteins and metabolites with the relevant life science literature. OmicsDI is updated daily, as new datasets get publicly available in the contributing repositories.
26
Open Data on AWS Sharing data in the cloud lets data users spend more time on data analysis rather than data acquisition. Browse available data and learn how to register your own datasets.
27
Kaiko - Digital Assets Data Provider Kaiko provides real-time and historical cryptocurrency trade data, order books, and aggregated prices through a cryptocurrency API, downloadable CSV files, and a livestream WebSocket. We cover 100+ crypto exchanges and 35,000+ trading pairs for Bitcoin, Ethereum, and altcoins.
28
Dimensions | The Next Evolution in Linked Scholarly Information From idea to impact: Navigate the links between grants, publications, datasets, clinical trials, patents, and policy documents.
29
Machine Learning Datasets - Wavefront Technologies Wavefront Technologies a software company in Coimbatore deliver high-quality Machine Learning(ML) Datasets for projects ( Artificial intelligence) in the USA, UK, and India.