
- Contenu:
- Data integrity is the new focal point of the data science revolution. Now that everybody is onboard with the role of data in people''s lives and business, it''s not an unfair question to ask, "Can you prove that your data is accurate?" In this course, you can learn how to identify and address many of the data integrity issues facing modern data scientists, using R and the tidyverse. Discover how to handle missing values and duplicated data. Find out how to convert data between different units and tackle poorly formatted text. Plus, learn how to detect outliers, address structural issues, and identify red flags that indicate potential data quality issues. Where possible, instructor Mike Chapple shows how to correct the issues using R, but the same principles can be applied to any statistical programing language.
Informations sur le titre
Titre: Cleaning Bad Data in R
Auteur: Chapple, Mike
Éditeur: LinkedIn
Catégorie: eLearning, Software & Programmieren, IT
3473
Exemplaires
3473
Disponible
0
Réservation
Durée du prêt: 180 jours