10 Free Data Sources for Data Science in 2024

Kaggle Datasets: Kaggle is a platform for data scientists and machine learning enthusiasts to share data and compete in machine learning competitions. The Kaggle Datasets section contains a vast collection of datasets on a variety of topics, including finance, healthcare, and social science.

UCI Machine Learning Repository: The UCI Machine Learning Repository is a well-known repository of datasets for machine learning tasks. The datasets in the UCI repository are often used in benchmark studies and research papers.

Google Dataset Search: Google Dataset Search is a tool that helps you find datasets from a variety of sources, including government websites, academic repositories, and private companies.

World Bank Open Data: The World Bank Open Data website provides access to a wide range of economic and social development data from around the world.

Government Open Data Portals: Many governments around the world have open data portals that provide access to data on a variety of topics, such as transportation, education, and environment.

Quandl: Quandl is a platform for financial data. It provides access to a variety of financial datasets, including stock prices, economic indicators, and exchange rates.

OpenFDA: OpenFDA is a platform for the U.S. Food and Drug Administration's (FDA) drug and device data. It provides access to a variety of datasets, including adverse event reports, product recalls, and medical device listings.

Data.gov: Data.gov is a website that provides access to a variety of U.S. government datasets. It includes datasets from a variety of agencies, including the Department of Labor, the Department of Health and Human Services, and the National Aeronautics and Space Administration (NASA).

Amazon Web Services (AWS) Public Datasets: AWS provides access to a variety of public datasets on its cloud platform. The datasets include data on a variety of topics, such as weather, climate, and demographics.

Microsoft Azure Open Datasets: Microsoft Azure also provides access to a variety of public datasets on its cloud platform. The datasets include data on a variety of topics, such as customer behavior, social media, and healthcare.