此主题为学生提供了从基本数据分析到使用R和XGBoost的高级机器学习概念的旅程。每周通过代表性的业务示例研究,我们发现数据如何形成有效的管理和决策。该主题逐渐建立在R编程和机器学习知识的基础上,从而为学生提供了与每周主题相关的R分配的实践经验。需要对任何语言的统计和先前的基本编程技能的基本理解。-------------------------------------------------------------------------------- Part I: Fundamentals of data analytics - Importance of data - Big data - The process of data collection - The process of data cleaning -------------------------------------------------------------------------------- Part II: Human behavior - Non-linear relationships - Missing responses - Biases - Choices and value estimates -------------------------------------------------------------------------------- Part III: Machine learning hiccups - Overfitting and underfitting - Corelation vs causality - Statistical hypothesis testing - Text analysis
主要关键词