Datasets for Data Analytics and Visualization
This page pulls together a collection of reliable data sources for visualization and analysis projects. Feel free to use this list as a jumping off point for homeworks and course projects! It is by no means exhaustive; there are nearly unlimited great data sources available on the internet. If you’d like to suggest an addition to this page you can do so with this form.
Repositories
Google dataset search
Responsible datasets in context
UCI Machine Learning Repository
Data is Plural
DataHub
Awesome public datasets
Data for existing articles
Five-thrity-eight
New York Times
Washington Post
Propublica
CityLab
Society
Stanford open policing dataset
Zillow House prices
Gun Violence Archieve
Gapminder data
Our World in Data
World Inequality Database
World Bank Open Data
Emerson College Polling
U.S. Economy
Federal reserve data
Bureau of economic analysis
Congressional budget office
Campaign Finance Data
Bureau of Labor Statistics
Other economic data resources
City/State data (Housing, health, transportation etc.)
California Open Data
NYC Open Data
NYS Open Data
LA Open Data
US Census Bureau
Public health
CDC Data Portal
Sports and games
Data sources for video games
Lahman Baseball database
Kaggle NBA Database
Sports reference (paid for query access)
Fan Graphs (Paid)
Climate
NOAA
Google Earth Engine
Transportation
NHTSA Crash Data
Bureau of Transportation Statistics
Entertainment
IMDB Data
Media APIs
https://developer.spotify.com/documentation/web-api https://developers.google.com/youtube/v3/docs https://developer.apple.com/documentation/applemusicapi