Irizarry, Rafael A.,

Introduction to data science : data analysis and prediction algorithms with R / Rafael A. Irizarry. - xxx, 713 pages : illustrations ; 26 cm

Installing R and RStudio -- Getting started with R and RStudio -- R Basics -- Programming basics -- The tidyverse -- Importing data -- Introduction to data visualization -- ggplot2 -- Visualizing data distributions -- Data visualization in practice -- Data visualization principles -- Robust summaries -- Introduction to statistics with R -- Probability -- Random variables -- Statistical inference -- Statistical models -- Regression -- Linear models -- Association is not causation -- Introduction to data wrangling -- Reshaping data -- Joining tables -- Web scraping -- String processing -- Parsing dates and times -- Text mining -- Introduction to machine learning -- Smoothing -- Cross validation -- The caret package -- Examples of algorithms -- Machine learning in practice -- Large datasets -- Clustering -- Introduction to productivty tools -- Accessing the terminal and installing Git -- Organizing with Unix -- Git and GitHub -- Reproducible projects with RStudio and R markdown.

'The book begins by going over the basics of R and the tidyverse. You learn R throughout the book, but in the first part we go over the building blocks needed to keep learning during the rest of the book'--

9780367357986 RM419.19 (PTSL)


R (Computer program language)
Information visualization.
Data mining.
Statistics--Data processing.
Probabilities--Data processing.
Computer algorithms.
Quantitative research.