Resources
Datasets
Data Repository
- Kaggle
- VAST Challenges
- UCI Machine Learning Repository
- Data is Plural (curated by Jeremy Singer-Vine; github).
- Data Commons
- Awesome Public Datasets
- Google Cloud Public Datasets
- Tableau Sample Data
- FiveThirtyEight data
From Authorities
- Data.gov
- Census.gov
- NYC OpenData
- CDC Data (Disease Control and Prevention)
- World Bank Catalog
- IPEDS data
- Bureau of Labor Statistics data
- data.gov.hk
- Office for National Statistics (UK)
- List of Historical Ballot Measures in SF
Specialized
- Public APIs
- Big Graph Data Sets
- Stanford Large Network Dataset Collection
- MalNet graphs (300GB) and MalNet images (80GB)
- DiffusionDB: A large-scale text-to-image prompt gallery dataset based on Stable Diffusion
- nlp-datasets
- Registry of Open Data on AWS
- Recommendation and Ratings Public Data
- Healthcare Datasets from NIH grant
- Outlier Detection Datasets (ODDS)
- Time Series (anomaly detection + classification)
- Energy and Climate Data
- Spatio-temporal (e.g., trajectory)
- NYC Taxi Trip Records
- SafeGraph (POI, Movements)
- Drone, Porto Taxi, MS T-Drive, MS GPS Traj,
- Bird Movement (Osprey, White-fronted Goose, IPT, e-birds, avian radar)
- Movebank
- Euring
- Seamap (e.g., BOEM Aerial Survey y1, y2)
- Global Biodiversity Information Facility
Others: Uber, Yelp, Zillow, IMDB, DBLP, Wikipedia, Social Network Apps
Visualization Tools
Sketching the Prototypes
- Adobe Illustrator - A powerful vector graphics software for creating complex illustrations, logos, and graphic designs.
- Figma - A free and collaborative interface design tool for making vector graphics.
Visualization Programming Toolkits
- D3.js - A popular JavaScript library for building interactive web visualizations.
- Plotly - A visualization library to make mostly common charts; it supports both Python and Javascript.
- Streamlit - A Python library to build interactive web visualizations without too much front-end skills.
- Matplotlib and Seaborn - They are both traditional Python libraries to plot visualizations
Visualization Authoring Interfaces and Software
- Tableau Public - Free version of Tableau for publishing visualizations on the web
- Tableau for Students - Free license for students using the desktop version of Tableau
- PowerBI - Microsoft's service for creating dashboards and data analysis without coding
Color Tools
- Color Brewer - Generate color palettes for maps.
- Paletton - Generate color palettes based on the color wheel
- 0 to 255 - Generate color palettes based on a given color
- Flat UI Colors - With a list of presets of different themes
- Adobe Color Palette Generator - A comprehensive color tool with several good functions
- D3-scale-chromatic
Web Development
Web Basics
Git & GitHub
About Curiosity:
"The important thing is not to stop questioning. Curiosity has its own reason for existing. One cannot help but be in awe when he contemplates the mysteries of eternity, of life, of the marvelous structure of reality. It is enough if one tries merely to comprehend a little of this mystery every day."
— Albert Einstein