Artificial Intelligence
Artificial Intelligence
Artificial Intelligence
AI
and
Certificate Program in
Artificial Intelligence and Machine Learning
A Practitioner’s Approach (CAIML)
ML
Jointly Organized by
National Institute
of
Technology, Warangal
E&ICT Academy
About the Program
CAIML is an intensive application oriented, real-world scenario based program in AI & ML. CAIML is a 6 Months
(Weekends), intensive skill oriented, practical training program required for building business models for
analytics. It is designed to give the participant enough exposure to the variety of applications that can be built
using techniques covered under this program. This course is designed for the experienced professionals from
variety of IT backgrounds. No prior knowledge of statistics or modeling is assumed.
OBJECTIVES
Ÿ Acquire advanced Data Analysis skills.
Ÿ Stay Industry relevant and grow in your career.
Ÿ Create AI/ML solutions for various business problems.
Ÿ Build and deploy production grade AI/ML applications.
Ÿ Apply AI/ML methods, techniques and tools immediatel
Data Visualization
10% 7%
5% Data Management
8%
20%
Artificial Intelligence 10% Statistical Thinking
20% 20%
AI a multidisciplinary field that requires a range of skills in statistics, mathematics, predictive modeling and
business analysis. An AI professional should feel at ease to build the algorithms necessary, work with various
data sources (often in disparate forms) and an innate ability to ask the right questions and find the right answer.
This module helps layout the canvas on which the rest of the modules are built.
Foundations of AI and ML
Descriptive Statistics
Ÿ Data exploration (histograms, bar chart, box plot, line graph, scatter plot)
Ÿ Qualitative and Quantitative Data
Ÿ Measure of Central Tendency (Mean, Median and Mode),
Ÿ Measure of Positions (Quartiles, Deciles, Percentiles and Quantiles),
Ÿ Measure of Dispersion (Range, Median, Absolute deviation about median, Variance and
Standard deviation), Anscombe's quartet
Ÿ Other Measures: Quartile and Percentile, Interquartile Range
Probability
Ÿ Probability (Joint, marginal and conditional probabilities)
Ÿ Probability distributions (Continuous and Discrete)
Ÿ Density Functions and Cumulative functions
Module 2: Data Management
This is foundational to Data Scientists. This requires a nontrivial understanding of the real-world problems. It
involves judgments such as those about the relevance and representativeness of the data. This module helps
participants to have a good understanding of the methods, methodologies and techniques from the basics of
statistics and probability obtain supporting evidence through data, isolate or identify factors to construct
models that can uncover relationships and variation in processes.
Data Management
Ÿ Bag-of-words
Ÿ Regular Expressions
Ÿ Sentence Splitting and Tokenization
Ÿ Punctuations and Stop words, Incorrect spellings
Ÿ Properties of words and Word cloud
Ÿ Lemmatization and Term-Document TxD computation
Ÿ Sentiment Analysis (Case Study)
This is foundational to Data Scientists. This requires a nontrivial understanding of the real-world problems. It
involves judgments such as those about the relevance and representativeness of the data. This module helps
participants to have a good understanding of the methods, methodologies and techniques from the basics of
statistics and probability obtain supporting evidence through data, isolate or identify factors to construct
models that can uncover relationships and variation in processes.
Predictive analytics is an area of statistics that deals with extracting information from data and using it to
predict trends and behavior patterns. Predicting an outcome, predicting counts, predicting a value - all these
have immumerable use cases in CRM, Fraud detection, Portfolio Management, Sales and Marketing. Predictic
Analytics is approached from Regression (glm) and Time Series models in this module.
Predictive Analytics
Ÿ
Predictive analytics is an area of statistics that deals with extracting information from data and using it to
predict trends and behavior patterns. Predicting an outcome, predicting counts, predicting a value - all these
have immumerable use cases in CRM, Fraud detection, Portfolio Management, Sales and Marketing. Predictic
Analytics is approached from Regression (glm) and Time Series models in this module.
K-Nearest Neighbors
Ÿ Computational geometry; Voronoi Diagrams; Delaunay Triangulations
Ÿ K-Nearest Neighbor algorithm; Wilson editing and triangulations
Ÿ Aspects to consider while designing K-Nearest Neighbor
Machine Learning
Ÿ Linear learning machines and Kernel space, Making Kernels and working in feature space
Ÿ SVM for classification and regression problems.
Decision Trees
Ÿ ID4, C4.5, CART
Ensembles methods
Ÿ Bagging & boosting and its impact on bias and variance
Ÿ C5.0 boosting
Ÿ Random forest
Ÿ Gradient Boosting Machines and XGBoost
Artificial Intelligence is utilized heavily in computizing cognitive functions such as speech and Vision. Often
these functions are achieved through the use of Neural networks. In this module, we will study very popular NN
architectures for achieving various cognitive functions such as Object recognition, natural language processing
besides explore reinforcement learning. We will study and practice various use cases ranging from text
generation, object tagging to fraud detection and learning games such as 2048.
Artificial Intelligence
Ÿ Time Series Forecasting
Case study 3: Sentiment Analysis or Topic Mining from New York Times
Ÿ Similarity measures (Cosine Similarity, Chi-Square, N Grams)
Ÿ Part-of-Speech Tagging
Ÿ Stemming and Chunking