Data Science Resources : Différence entre versions

De Transport
m
(Machine Learning)
(19 révisions intermédiaires par le même utilisateur non affichées)
Ligne 1 : Ligne 1 :
=Books=
+
=General Books=
 
* Free books on data science http://www.learndatasci.com/free-data-science-books
 
* Free books on data science http://www.learndatasci.com/free-data-science-books
 +
* Many online books on various data science topics on https://bookdown.org/
  
 
=Code=
 
=Code=
 
* Data science using Python https://github.com/jakevdp/PythonDataScienceHandbook (see [[Programming_Resources|Programming resources]] for Python and other languages)
 
* Data science using Python https://github.com/jakevdp/PythonDataScienceHandbook (see [[Programming_Resources|Programming resources]] for Python and other languages)
* [https://github.com/jupyter/jupyter/wiki/A-gallery-of-interesting-Jupyter-Notebooks Gallery of Jupyter Notebooks on programming, data science, etc.]
+
* Python data science handbook https://jakevdp.github.io/PythonDataScienceHandbook/
 +
** [https://github.com/jupyter/jupyter/wiki Gallery of Jupyter Notebooks on programming, data science, etc.]
 
* Examples and tutorials (Jupyter notebooks) for the transportation data management course CIV8760 (in French) https://github.com/nsaunier/CIV8760/
 
* Examples and tutorials (Jupyter notebooks) for the transportation data management course CIV8760 (in French) https://github.com/nsaunier/CIV8760/
 +
 +
=Data Management=
 +
* [http://www.datacarpentry.org/lessons/ Data carpentry]
 +
* A Quick Guide to Organizing Computational Biology Projects https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1000424
 +
* Données de recherche https://guides.biblio.polymtl.ca/donneesrecherche
 +
 +
=Statistics=
 +
* Statistical Thinking for the 21st Century
 +
* Learning Statistics with R  https://learningstatisticswithr.com/
 +
* Answering questions with data https://crumplab.com/statistics/
 +
* Carnegie Mellon University free online courses: [https://oli.cmu.edu/courses/probability-statistics-open-free/ Probability & Statistics] [https://oli.cmu.edu/courses/statistical-reasoning-copy/  Statistical Reasoning]
 +
* Scientific Approaches to Transportation Research http://onlinepubs.trb.org/Onlinepubs/nchrp/cd-22/start.htm
 +
* Understanding and Communicating Multimodal Transportation Data http://web.cecs.pdx.edu/~monserec/t.data/
 +
* (French) resources
 +
** Cours MTH2302C: Probabilités et statistique, Denis Marcotte http://cours.polymtl.ca/geo/marcotte/mth2302c.html
 +
** Notes et ebooks de Ricco Rakotomalala http://eric.univ-lyon2.fr/~ricco/
 +
** Cours du master économétrie et statistique appliquée de l'Université d’Orléans https://www.univ-orleans.fr/deg/masters/ESA/CH/churlin_E.htm#_Universit%C3%A9_d%27Orl%C3%A9ans,_Master_Econom
  
 
=Machine Learning=
 
=Machine Learning=
 
* list of machine learning books http://matpalm.com/blog/cool_machine_learning_books/
 
* list of machine learning books http://matpalm.com/blog/cool_machine_learning_books/
 
** Pattern Recognition and Machine Learning by Christopher Bishop free at https://www.microsoft.com/en-us/research/uploads/prod/2006/01/Bishop-Pattern-Recognition-and-Machine-Learning-2006.pdf
 
** Pattern Recognition and Machine Learning by Christopher Bishop free at https://www.microsoft.com/en-us/research/uploads/prod/2006/01/Bishop-Pattern-Recognition-and-Machine-Learning-2006.pdf
 +
* Neural Networks: Zero to Hero https://karpathy.ai/zero-to-hero.html https://github.com/karpathy/nn-zero-to-hero (NanoGPT https://github.com/karpathy/nanoGPT)
 
* [http://profs.polymtl.ca/jagoulet/Site/Goulet_web_page_BOOK.html Probabilistic Machine Learning for Civil Engineers, James Goulet]
 
* [http://profs.polymtl.ca/jagoulet/Site/Goulet_web_page_BOOK.html Probabilistic Machine Learning for Civil Engineers, James Goulet]
 
* [https://www.microsoft.com/en-us/research/publication/fourth-paradigm-data-intensive-scientific-discovery/ Hey, T. Tansley, S. & Tolle, K. (Eds.) The Fourth Paradigm: Data-Intensive Scientific Discovery Microsoft Research, 2009]
 
* [https://www.microsoft.com/en-us/research/publication/fourth-paradigm-data-intensive-scientific-discovery/ Hey, T. Tansley, S. & Tolle, K. (Eds.) The Fourth Paradigm: Data-Intensive Scientific Discovery Microsoft Research, 2009]
Ligne 15 : Ligne 35 :
 
* (French) reference book: Cornuéjols, A.; Miclet, L. & Kodratoff, Y. Apprentissage Artificiel Eyrolles, 2002
 
* (French) reference book: Cornuéjols, A.; Miclet, L. & Kodratoff, Y. Apprentissage Artificiel Eyrolles, 2002
 
* (French) [http://eric.univ-lyon2.fr/~ricco/cours/supports_data_mining.html Notes pour les cours de data mining de Ricco Rakotomalala]
 
* (French) [http://eric.univ-lyon2.fr/~ricco/cours/supports_data_mining.html Notes pour les cours de data mining de Ricco Rakotomalala]
 +
** [http://eric.univ-lyon2.fr/~ricco/tanagra/ Tanagra by the same author]
 
* MOOC by IVADO on deep learning https://cours.edulib.org/courses/course-v1:IVADO+IA-101+P2018/
 
* MOOC by IVADO on deep learning https://cours.edulib.org/courses/course-v1:IVADO+IA-101+P2018/
 +
* Finnish MOOC https://buildingai.elementsofai.com/
 +
* Software
 +
** [http://www.cs.waikato.ac.nz/ml/weka/ Weka]
 +
** [http://gretl.sourceforge.net/ Gretl (econometrics)]
  
 
=Data Visualization=
 
=Data Visualization=
Ligne 24 : Ligne 49 :
 
** [http://hcil.umd.edu/ Human-Computer Interaction Lab] and [https://www.cattlab.umd.edu/ Center for Advanced Transportation Technology Laboratory], University of Maryland
 
** [http://hcil.umd.edu/ Human-Computer Interaction Lab] and [https://www.cattlab.umd.edu/ Center for Advanced Transportation Technology Laboratory], University of Maryland
 
** TRB [https://www.teachamerica.com/viz/viz2006.html 5th], [http://teachamerica.com/VIZ11/index.html 6th], [https://teachamerica.com/VIZ13/index.html 7th], [http://viz17.businesscatalyst.com/videos.html 8th] and [http://www.cvent.com/events/9th-international-visualization-in-transportation-symposium-a-better-view/event-summary-aa788f12e69f4c5c83936e5c800b1152.aspx 9th] International Visualization in Transportation Symposium and Workshop
 
** TRB [https://www.teachamerica.com/viz/viz2006.html 5th], [http://teachamerica.com/VIZ11/index.html 6th], [https://teachamerica.com/VIZ13/index.html 7th], [http://viz17.businesscatalyst.com/videos.html 8th] and [http://www.cvent.com/events/9th-international-visualization-in-transportation-symposium-a-better-view/event-summary-aa788f12e69f4c5c83936e5c800b1152.aspx 9th] International Visualization in Transportation Symposium and Workshop
** Professorts [http://www.professeurs.polymtl.ca/thomas.hurtut/\#dataviz Thomas Hurtut (Polytechnique)], [http://perso.telecom-paristech.fr/~elc/ Éric Lecolinet (Télécom ParisTech)]
+
** Professors [http://www.professeurs.polymtl.ca/thomas.hurtut/\#dataviz Thomas Hurtut (Polytechnique)], [http://perso.telecom-paristech.fr/~elc/ Éric Lecolinet (Télécom ParisTech)]
 
* Courses  
 
* Courses  
 
** [https://www.graphics.stanford.edu/courses/cs448b-04-winter/ Data Visualization (CS448b)], Stanford
 
** [https://www.graphics.stanford.edu/courses/cs448b-04-winter/ Data Visualization (CS448b)], Stanford
 
** [https://www.cs171.org/ CS117], Hanspeter Pfister, Harvard
 
** [https://www.cs171.org/ CS117], Hanspeter Pfister, Harvard
 
* Videos: [https://www.youtube.com/watch?v=AdSZJzb-aX8 The Art of Data Visualization | PBS Digital Studios], [https://www.youtube.com/watch?v=-xS7QJhVbcM Harvard i-lab | Data Visualization for Non-Programmers], [https://www.youtube.com/watch?v=aT4JvF7sglg Mike Bostock (D3js) - Keynote], [https://www.youtube.com/watch?v=R-oiKt7bUU8 Designing Data Visualizations with Noah Iliinsky]
 
* Videos: [https://www.youtube.com/watch?v=AdSZJzb-aX8 The Art of Data Visualization | PBS Digital Studios], [https://www.youtube.com/watch?v=-xS7QJhVbcM Harvard i-lab | Data Visualization for Non-Programmers], [https://www.youtube.com/watch?v=aT4JvF7sglg Mike Bostock (D3js) - Keynote], [https://www.youtube.com/watch?v=R-oiKt7bUU8 Designing Data Visualizations with Noah Iliinsky]
* Libraries
+
* Libraries / tools
 
** Python: [https://matplotlib.org/ matplotlib], [https://seaborn.pydata.org/ seaborn]
 
** Python: [https://matplotlib.org/ matplotlib], [https://seaborn.pydata.org/ seaborn]
 
** R: [https://ggplot2.tidyverse.org/ ggplot2]
 
** R: [https://ggplot2.tidyverse.org/ ggplot2]
 
** Javascript: [https://d3js.org/ D3.js]
 
** Javascript: [https://d3js.org/ D3.js]
 +
** Old: [http://www.gnuplot.info gnuplot]
 +
 +
=Time Series=
 +
* Forecasting: Principles and Practice (2nd ed) https://otexts.com/fpp2/
  
 
=Spatial Data=
 
=Spatial Data=
 
* Introduction to Geospatial Concepts https://datacarpentry.org/organization-geospatial/
 
* Introduction to Geospatial Concepts https://datacarpentry.org/organization-geospatial/
* A Gentle Introduction to GIS https://docs.qgis.org/3.10/en/docs/gentle_gis_introduction/
+
* QGIS documentation: [https://docs.qgis.org/3.22/fr/docs/index.html français], [https://docs.qgis.org/3.22/en/docs/index.html english]
* QGIS how-to: [https://www.qgistutorials.com/en/docs/3/creating_heatmaps.html heatmaps], [https://www.giscourse.com/how-to-add-openstreetmap-basemaps-in-qgis-3-0/|add OSM layer]
+
** A Gentle Introduction to GIS https://docs.qgis.org/latest/en/docs/gentle_gis_introduction/index.html
 +
** QGIS how-to: [https://www.qgistutorials.com/en/docs/3/creating_heatmaps.html heatmaps], [https://www.giscourse.com/how-to-add-openstreetmap-basemaps-in-qgis-3-0/|add OSM layer]
 
* SpatiaLite cookbook http://www.gaia-gis.it/gaia-sins/spatialite-cookbook-5/index.html
 
* SpatiaLite cookbook http://www.gaia-gis.it/gaia-sins/spatialite-cookbook-5/index.html
 
* (French) books from EPFL: Systèmes d'Information Géographique [https://www.researchgate.net/publication/320979981_Systemes_d%27Information_Geographique_1 Partie 1] et [https://www.researchgate.net/publication/320980079_Systemes_d%27Information_Geographique_2 Partie 2]
 
* (French) books from EPFL: Systèmes d'Information Géographique [https://www.researchgate.net/publication/320979981_Systemes_d%27Information_Geographique_1 Partie 1] et [https://www.researchgate.net/publication/320980079_Systemes_d%27Information_Geographique_2 Partie 2]
Ligne 43 : Ligne 73 :
  
 
=Spatial Analysis=
 
=Spatial Analysis=
 +
* [https://ipeagit.github.io/intro_access_book/en/index.en.html Introduction to urban accessibility, a practical guide with R]
 
* [https://paezha.github.io/applied_spatial_statistics/ Applied Spatial Statistics with R], Prof. Antonio Paez
 
* [https://paezha.github.io/applied_spatial_statistics/ Applied Spatial Statistics with R], Prof. Antonio Paez
 +
* [https://mgimond.github.io/Spatial/index.html Intro to GIS and Spatial Analysis]
 
* [https://pysal.org Python Spatial Analysis Library (PySAL)]
 
* [https://pysal.org Python Spatial Analysis Library (PySAL)]
 
* [https://geodacenter.github.io/documentation.html Documentation de GeoDa]
 
* [https://geodacenter.github.io/documentation.html Documentation de GeoDa]
 
+
* [https://www.spatialanalysisonline.com/ Geospatial Analysis - A comprehensive guide]
=Statistics=
+
* Carnegie Mellon University free online courses: [https://oli.cmu.edu/courses/probability-statistics-open-free/ Probability & Statistics] [https://oli.cmu.edu/courses/statistical-reasoning-copy/  Statistical Reasoning]
+
* Scientific Approaches to Transportation Research http://onlinepubs.trb.org/Onlinepubs/nchrp/cd-22/start.htm
+
* Understanding and Communicating Multimodal Transportation Data http://web.cecs.pdx.edu/~monserec/t.data/
+
* (French) resources
+
** Cours MTH2302C: Probabilités et statistique, Denis Marcotte http://cours.polymtl.ca/geo/marcotte/mth2302c.html
+
** Notes et ebooks de Ricco Rakotomalala http://eric.univ-lyon2.fr/~ricco/
+
** Cours du master économétrie et statistique appliquée de l'Université d’Orléans https://www.univ-orleans.fr/deg/masters/ESA/CH/churlin_E.htm#_Universit%C3%A9_d%27Orl%C3%A9ans,_Master_Econom
+

Version du 31 mai 2023 à 16:12

General Books

Code

Data Management

Statistics

Machine Learning

Data Visualization

Time Series

Spatial Data

Spatial Analysis