Machine Learning –I

Paper Code: 
MBB 227
Credits: 
4
Contact Hours: 
90.00
Max. Marks: 
100.00
Objective: 

Course Outcomes (COs):

Courseoutcomes

Learningandteaching

Strategies

AssessmentStrategies

On completion of this course, the students will be able to;

 

CO 83.Formulate a problem for business analytics.

CO 84. Install python and orange tool for machine learning implementation on business problem.

CO 85.Prepare the dataset for computation after collected it from the business domain based data source.

CO 86.Select suitable machine learning technique for designing a model.

CO 87.Develop a machine learning model for business problems.

CO 88. Evaluate and compare the performance of machine learning models.

Approach inteaching:InteractiveLectures, GroupDiscussion,Tutorials, CaseStudy

Learningactivitiesforthestudents:

Self-learningassignments,presentation

Class test,Semester endexaminations,Quiz,Assignments,Presentation

 

18.00

Introduction to Data Mining and machine learning: Basic Data Mining Tasks, Data Mining versus Knowledge Discovery in Databases, Applications of  Machine Learning, Machine Learning vs AI , Types of Machine Learning, Metrics, Accuracy Measures: Precision, recall, F-measure, confusion matrix, cross-validation, bootstrap,   Probability and likelihood, probability distribution. Data Mining tool Orange.

 

18.00

Understand the Problem by Understanding the Data, unbalanced data, Unsupervised Learning: Association rules, Apriori algorithm, FP tree algorithm, and their implementation in python and Orange tool, Market Basket Analysis and Association Analysis.

 

18.00

Clustering: k-means and implementation of k-means using python and Orange tool, Concept of other clustering algorithms: Expectation Maximization (M) algorithm, Hierarchical clustering, and DBSCAN.

 

18.00

Classification & Prediction: model Construction, performance, attribute selection Issues: under,Over-fitting, cross validation, tree pruning methods, missing values, Information Gain, Gain Ratio, Gini Index, continuous classes. Classification and Regression Trees (CART) and C 5.0 .Implementation of decision tree in python and Orange tool.

 

18.00

Classification & Prediction: Linear Regression, Multiple Linear Regression, Logistic Regression, Naïve Bayes and Support Vector Machines(SVM), Implementation of Linear Regression, Logistic Regression, Naïve Bayes and SVM in python and Orange tool.

*Case studies related to entire topics are to be taught.

 

 

Essential Readings: 
  • Jiawei Han &MichelineKamber, “Data Mining: Concepts & Techniques”, Morgan Kaufmann Publishers, Third Edition.
  • Sebastian Raschka&VahidMirjalili,” Python Machine Learning”, Second Edition,Packt>.
  • McKinney ,Python for Data Analysis. O’ Reilly Publication,2017.
  • Curtis Miller, ”Hands-On Data Analysis with NumPy and Pandas"
  • (Latest editions of the above books are to be referred)

 

References: 

Suggested readings

  • Curtis Miller,” Hands-On Data Analysis with NumPy and Pandas"
  • (Latest editions of the above books are to be referred)

E resources

Journals

 

Academic Year: