Microsoft Word - B.Tech. - 3rd - Yr - CSE (DS) - 2022 - 23
Microsoft Word - B.Tech. - 3rd - Yr - CSE (DS) - 2022 - 23
Microsoft Word - B.Tech. - 3rd - Yr - CSE (DS) - 2022 - 23
Based On
NEP2020
(Effective from the Session: 2024-25)
B.TECH.
COMPUTER SCIENCE & ENGINEERING - (DATA SCIENCE)
CURRICULUM STRUCTURE
SEMESTER- V
Codes L T P CT TA Total PS TE PE
Constitution of India/
10 BNC501/ Essence of Indian Traditional
BNC502 2 0 0 20 10 30 70
Knowledge
Total 900 23
*The Mini Project or Internship (4 weeks) conducted during summer break after IV semester and will be assessed during V
semester.
*It is desirable that the students should do their Summer Internship or Mini Project in their specialization area in line with
the B.Tech. program
SEMESTER- VI
Codes L T P CT TA Total PS TE PE
1 3 1 0 20 10 30 70 100 4
BCS601 Software Engineering
2 Big Data and Analytics 3 1 0 20 10 30 70 100 4
BCDS601
3 Computer Networks 3 1 0 20 10 30 70 100 4
BCS603
4 BCAI061/ Departmental Elective-III 3 0 0 20 10 30 70 100 3
BCDS061/
BCAM061/
BCDS062
5 Open Elective-I 3 0 0 20 10 30 70 100 3
6 0 0 2 50 50 100 1
BCS652 Software Engineering Lab
7 0 0 2 50 50 100 1
BCDS651 Big Data and Analytics Lab
8 0 0 2 50 50 100 1
BCS653 Computer Networks Lab
Constitution of India/
9 BNC601/ Essence of Indian
BNC602 2 0 0 20 10 30 70
Traditional Knowledge
Total 800 21
The Mini Project or Internship (4 weeks) will be done during summer break after VI Semester and will be assessed
during VII semester.
It is desirable that the students should do their Summer Internship or Mini Project in their specialization area in
line with the B.Tech. program.
Departmental Elective-I
Departmental Elective-II
1.BCAM051 - Cloud Computing
2.BCAI052 - Natural Language Processing
3.BCS056-Application of Soft Computing
4.BCS057 Image Processing
Departmental Elective-III
1. BCAI061 - Cyber Forensic Analytics
2.BCDS061 - Image Analytics
3.BCAM061 – Social Media Analytics and Data Analysis
4.BCDS062 – Machine Learning Techniques
B.TECH.
Computer Science and Engineering (Data Science)
BCS501 DATABASE MANAGEMENT SYSTEM
Course Outcome (CO) Bloom’s Knowledge Level (KL)
At the end of course, the student will be able to understand
CO 1 Apply knowledge of database for real life applications. K3
CO 2 Apply query processing techniques to automate the real time problems of databases. K3, K4
CO 3 Identify and solve the redundancy problem in database tables using normalization. K2, K3
Understand the concepts of transactions, their processing so they will familiar with broad range K2, K4
CO 4 of database management issues including data integrity, security and recovery.
CO 5 Design, develop and implement a small database project using database tools. K3, K6
DETAILED SYLLABUS 3-1-0
Unit Topic Proposed
Lecture
Introduction: Overview, Database System vs File System, Database System Concept and
Architecture, Data Model Schema and Instances, Data Independence and Database Language and
Interfaces, Data Definitions Language, DML, Overall Database Structure. Data Modeling Using the
I Entity Relationship Model: ER Model Concepts, Notation for ER Diagram, Mapping Constraints, 08
Keys, Concepts of Super Key, Candidate Key, Primary Key, Generalization, Aggregation, Reduction
of an ER Diagrams to Tables, Extended ER Model, Relationship of Higher Degree.
Relational data Model and Language: Relational Data Model Concepts, Integrity Constraints,
Entity Integrity, Referential Integrity, Keys Constraints, Domain Constraints, Relational Algebra,
Relational Calculus, Tuple and Domain Calculus. Introduction on SQL: Characteristics of SQL,
Advantage of SQL. SQl Data Type and Literals. Types of SQL Commands. SQL Operators and Their
II 08
Procedure. Tables, Views and Indexes. Queries and Sub Queries. Aggregate Functions. Insert, Update
and Delete Operations, Joins, Unions, Intersection, Minus, Cursors, Triggers, Procedures in SQL/PL
SQL
Data Base Design & Normalization: Functional dependencies, normal forms, first, second, 8 third
III normal forms, BCNF, inclusion dependence, loss less join decompositions, normalization using FD, 08
MVD, and JDs, alternative approaches to database design
Transaction Processing Concept: Transaction System, Testing of Serializability, Serializability of
Schedules, Conflict & View Serializable Schedule, Recoverability, Recovery from Transaction
IV Failures, Log Based Recovery, Checkpoints, Deadlock Handling. Distributed Database: Distributed 08
Data Storage, Concurrency Control, Directory System.
Concurrency Control Techniques: Concurrency Control, Locking Techniques for Concurrency
V Control, Time Stamping Protocols for Concurrency Control, Validation Based Protocol, Multiple 08
Granularity, Multi Version Schemes, Recovery with Concurrent Transaction, Case Study of Oracle.
Text books:
1. Korth, Silbertz, Sudarshan,” Database Concepts”, McGraw Hill
2. Date C J, “An Introduction to Database Systems”, Addision Wesley
3. Elmasri, Navathe, “Fundamentals of Database Systems”, Addision Wesley
4. O’Neil, Databases, Elsevier Pub.
5. RAMAKRISHNAN”Database Management Systems”,McGraw Hill
6. Leon & Leon,”Database Management Systems”, Vikas Publishing House
7. Bipin C. Desai, “An Introduction to Database Systems”, Gagotia Publications
8. Majumdar & Bhattacharya, “Database Management System”, TMH
BCDS501 INTRODUCTION TO DATA ANALYTICS AND VISUALIZATION
Course Outcome (CO) Bloom’s Knowledge Level (KL)
At the end of course, the student will be able to understand
Describe the life cycle phases of Data Analytics through discovery, planning and building. K3
CO 1
CO 2 Understand and apply Data Analysis Techniques. K3, K4
CO 3 Implement various Data streams. K2, K3
CO 4 Understand item sets, Clustering, frame works & Visualizations. K2, K4
Design new algorithms, prove them correct, and analyze their asymptotic and absolute runtime K4, K6
CO 1
and memory demands.
Find an algorithm to solve the problem (create) and prove that the algorithm solves the problem K5, K6
CO 2
correctly (validate).
Understand the mathematical criterion for deciding whether an algorithm is efficient, and know K2, K5
CO 3
many practically important problems that do not admit any efficient algorithms.
CO 4 Apply classical sorting, searching, optimization and graph algorithms. K2, K4
Understand basic techniques for designing algorithms, including the techniques of recursion, K2, K3
CO 5
divide-and-conquer, and greedy.
DETAILED SYLLABUS 3-1-0
Divide and Conquer with Examples Such as Sorting, Matrix Multiplication, Convex Hull and
Searching.
III Greedy Methods with Examples Such as Optimal Reliability Allocation, Knapsack, Minimum 08
Spanning Trees – Prim’s and Kruskal’s Algorithms, Single Source Shortest Paths - Dijkstra’s and
Bellman Ford Algorithms.
Dynamic Programming with Examples Such as Knapsack. All Pair Shortest Paths – Warshal’s and
Floyd’s Algorithms, Resource Allocation Problem. Backtracking, Branch and Bound with Examples
IV Such as Travelling Salesman Problem, Graph Coloring, n-Queen Problem, Hamiltonian Cycles and 08
Sum of Subsets.
Selected Topics: Algebraic Computation, Fast Fourier Transform, String Matching, Theory of NP
V Completeness, Approximation Algorithms and Randomized Algorithms 08
Text books:
1. Thomas H. Coreman, Charles E. Leiserson and Ronald L. Rivest, “Introduction to Algorithms”, Printice Hall of India.
2. E. Horowitz & S Sahni, "Fundamentals of Computer Algorithms",
3. Aho, Hopcraft, Ullman, “The Design and Analysis of Computer Algorithms” Pearson Education, 2008.
4. LEE "Design & Analysis of Algorithms (POD)",McGraw Hill
5. Richard E.Neapolitan "Foundations of Algorithms" Jones & Bartlett Learning
6. Jon Kleinberg and Éva Tardos, Algorithm Design, Pearson, 2005.
7. Michael T Goodrich and Roberto Tamassia, Algorithm Design: Foundations, Analysis, and Internet Examples, Second
Edition, Wiley, 2006.
8. Harry R. Lewis and Larry Denenberg, Data Structures and Their Algorithms, Harper Collins, 1997
9. Robert Sedgewick and Kevin Wayne, Algorithms, fourth edition, Addison Wesley, 2011.
10. Harsh Bhasin,”Algorithm Design and Analysis”,First Edition,Oxford University Press.
11. Gilles Brassard and Paul Bratley,Algorithmics:Theory and Practice,Prentice Hall,1995.
BCAI051 MATHEMATICAL FOUNDATION AI, ML AND DATA SCIENCE
Course Outcome (CO) Bloom’s Knowledge Level (KL)
At the end of course, the student will be able to:
Understand and apply the probability distributions, random number generation and density K2, K4, K6
CO 1 estimations to perform analysis of various kinds of data
Understand and manipulate data, design and perform simple Monte Carlo experiments, and be K5, K6
CO 2 able to use resampling methods
CO 4 Perform appropriate statistical tests using R and visualize the outcome K2, K4
Discuss the results obtained from their analyses after creating customized graphical and K2, K3
CO 5 numerical summaries
Inferential Statistics: Sampling & Confidence Interval, Inference & Significance. Estimation and
Hypothesis Testing, Goodness of fit, Test of Independence, Permutations and Randomization Test,
ttest/z-test (one sample, independent, paired), ANOVA, chi-square.
II Linear Methods for Regression Analysis: multiple regression analysis, orthogonalization by 08
Householder transformations (QR); singular value decomposition (SVD); linear dimension reduction
using principal component analysis (PCA).
Vector Spaces- Vector Space, Subspace, Linear Combination, Linear Independence, Basis, Dimension,
Finding a Basis of a Vector Space, Coordinates, Change of Basis
IV 08
Inner Product Spaces- Inner Product, Length, Orthogonal Vectors, Triangle Inequality, Cauchy-
Schwarz Inequality, Orthonormal (Orthogonal) Basis, Gram-Schmidt Process
Linear Transformations- Linear Transformations and Matrices for Linear Transformation, Kernel
and Range of a Linear Transformations, Change of Basis
V 08
Eigenvalues and Eigenvectors- Definition of Eigenvalue and Eigenvector, Diagonalization,
Symmetric Matrices and Orthogonal Diagonalization
References:
1. S.C. Gupta & V.K. Kapoor, “Fundamentals of Mathematical Statistics”, Sultan Chand & Sons
2. Sheldon M. Ross, “Introduction to Probability and Statistics for Engineers and Scientists”, Academic Press.
3. Dudewicz, E.J., Mishra, S.N., “Modern Mathematical Statistics”, Willy
4. Purohit S. G., Gore S. D., Deshmukh S. K., “Statistics using R, Narosa
5. Rizzo, M. L., “Statistical Computing with R”, Boca Raton, FL: Chapman & Hall/CRC Press
6. Normal Maltoff, The Art of R programming, William
7. Dalgaard, Peter, “Introductory statistics with R”, Springer Science & Business Media
8. M. D. Ugarte, A. F. Militino, A. T. Arnholt, “Probability and Statistics with R”, CRC Press
9. Kundu, D. and Basu, A., “Statistical computing – existing methods and recent developments”, Narosa
10. Gentle, James E., Härdle, Wolfgang Karl, Mori, Yuich, “Handbook of Computational Statistics”, Springer
11. Givens and Hoeting, “Computational Statistics”, Wiley Series in Prob. and Statistics
12. Elementary Linear Algebra by Ron Larson, 8th edition, Cengage Learning, 2017
BCS058 DATA WAREHOUSING & DATA MINING
CO 2 Understand and implement classical models and algorithms in data warehouses and data mining K3
CO 3 Characterize the kinds of patterns that can be discovered by association rule mining, K1, K2
classification and clustering
CO 4 Master data mining techniques in various applications like social, scientific and environmental K3
context
CO 5 Develop skill in selecting the appropriate data mining algorithm for solving practical problems. K1, K2
Data Warehouse Process and Technology: Warehousing Strategy, Warehouse /management and
Support Processes, Warehouse Planning and Implementation, Hardware and Operating Systems for
II Data Warehousing, Client/Server Computing Model & Data Warehousing. Parallel Processors & 08
Cluster Systems, Distributed DBMS implementations, Warehousing
Software, Warehouse Schema Design
Data Mining: Overview, Motivation, Definition & Functionalities, Data Processing, Form of Data
Pre-processing, Data Cleaning: Missing Values, Noisy Data, (Binning, Clustering, Regression,
Computer and Human inspection), Inconsistent Data, Data Integration and Transformation. Data
III Reduction:-Data Cube Aggregation, Dimensionality reduction, Data Compression, Numerosity 08
Reduction, Discretization and Concept
hierarchy generation, Decision Tree
Understand the essentials of BI & data analytics and the corresponding terminologies K2
CO 1
CO 4 Understand & Implement the K-Means Clustering with Iris Dataset K2, K3
Demonstrate the real time scenario (Case study) by using BI & Analytics K5, K6
CO 5
techniques
DETAILED SYLLABUS 3-0-0
Object Oriented Analysis: Object oriented design, Object design, Combining three models, Designing
algorithms, design optimization, Implementation of control, Adjustment of inheritance, Object
representation, Physical packaging, Documenting design considerations.
Structured analysis and structured design (SA/SD), Jackson Structured Development (JSD).
III Mapping object-oriented concepts using non-object-oriented language, Translating classes into data 08
structures, Passing arguments to methods, Implementing inheritance, associations encapsulation.
Object oriented programming style: reusability, extensibility, robustness, programming in the large.
Procedural v/s OOP, Object oriented language features. Abstraction and Encapsulation.
C++ Basics: Overview, Program structure, namespace, identifiers, variables, constants, enum, operators,
typecasting, control structures
IV C++ Functions: Simple functions, Call and Return by reference, Inline functions, Macro Vs. Inline 08
functions, Overloading of functions, default arguments, friend functions, virtual functions
Objects and Classes: Basics of object and class in C++, Private and public members, static data and
function members, constructors and their types, destructors, operator overloading, type conversion.
Inheritance: Concept of Inheritance, types of inheritance: single, multiple, multilevel, hierarchical,
V 08
hybrid, protected members, overriding, virtual base class
Polymorphism : Pointers in C++, Pointes and Objects, this pointer, virtual and pure virtual functions,
Implementing polymorphism
Text Books
1. James Rumbaugh et. al, “Object Oriented Modeling and Design”, 2nd Edition Pearson Education
2. Grady Booch, James Rumbaugh, Ivar Jacobson, “The Unified Modeling Language User Guide”, Pearson Education
3. Object Oriented Programming With C++, E Balagurusamy, McGraw-Hill Education
4. C++ Programming, Black Book, Steven Holzner, dreamtech
5. Object Oriented Programming in Turbo C++, Robert Lafore, Galgotia
6. Object Oriented Programming with ANSI and Turbo C++, Ashok Kamthane, Pearson
7. The Compete Reference C++, Herbert Schlitz, McGraw-Hill Education
BCAI052 NATURAL LANGUAGE PROCESSING
Course Outcome (CO) Bloom’s Knowledge Level (KL)
Proposed
Unit Topic
Lecture
INTRODUCTION: Origins and challenges of NLP – Language Modeling: Grammar-based LM,
Statistical LM – Regular Expressions, Finite-State Automata – English Morphology, Transducers for
lexicon and rules, Tokenization, Detecting and Correcting Spelling Errors, Minimum Edit Distance
WORD LEVEL ANALYSIS: Unsmoothed N-grams, Evaluating N-grams, Smoothing,
I 08
Interpolation and Backoff – Word Classes, Word Tokenization, Math with words TF-IDF Vectors,
Finding meaning in word count (Semantic Analysis), Linguistic Background: Outline of English
Syntax, Introduction to Semantics and Knowledge Representation, Zipf’s Law
SYNTACTIC ANALYSIS: Context Free Grammars, Grammar rules for English, Grammars and
Parsing: Grammars and sentence Structure, Top-Down and Bottom-Up Parsers, Transition
Network Grammars, Top- Down Chart Parsing. Feature Systems and Augmented Grammars: Basic
Feature system for English, Morphological Analysis and the Lexicon, Parsing with Features,
II 08
Augmented Transition Networks.
Ambiguity Resolution: Statistical Methods, Probabilistic Language Processing, Estimating
Probabilities, Part-of-Speech tagging, Obtaining Lexical Probabilities, Probabilistic Context-Free
Grammars, Best First Parsing Feature structures, Unification of feature structures.
SEMANTICS AND PRAGMATICS:
Requirements for representation, First-Order Logic, Description Logics – Syntax-Driven Semantic
III analysis, Semantic attachments – Word Senses, Relations between Senses, Thematic Roles, 08
selectional restrictions – Word Sense Disambiguation, WSD using Supervised, Dictionary &
Thesaurus, Bootstrapping methods – Word Similarity using Thesaurus and Distributional methods.
BASIC CONCEPTS of Speech Processing: Speech Fundamentals: Articulatory Phonetics –
Production And Classification Of Speech Sounds; Acoustic Phonetics – Acoustics Of Speech
IV 08
Production; Review Of Digital Signal Processing Concepts; Short-Time Fourier Transform,
FilterBank And LPC Methods.
SPEECH-ANALYSIS: Features, Feature Extraction And Pattern Comparison Techniques: Speech
Distortion Measures– Mathematical And Perceptual 08
V
Real World NLP Challenges-Information Extraction and Question Answering, Dialog Engines,
Optimization, Parallelization and batch processing.
Text books:
1. Daniel Jurafsky, James H. Martin―Speech and Language Processing: An Introduction to Natural Language
Processing, Computational Linguistics and Speech, Pearson Publication, 2014.
2. Steven Bird, Ewan Klein and Edward Loper, ―Natural Language Processing with Python, First Edition, OReilly
Media, 2009.
3. Lawrence Rabiner And Biing-Hwang Juang, “Fundamentals Of Speech Recognition”, Pearson Education, 2003.
4. Daniel Jurafsky And James H Martin, “Speech And Language Processing – An Introduction To Natural Language
Processing, Computational Linguistics, And Speech Recognition”, Pearson Education, 2002.
5. Frederick Jelinek, “Statistical Methods Of Speech Recognition”, MIT Press, 1997.
6. 1. Breck Baldwin, ―Language Processing with Java and LingPipe Cookbook, Atlantic Publisher, 2015.
7. Richard M Reese, ―Natural Language Processing with Java, OReilly Media, 2015.
8. Nitin Indurkhya and Fred J. Damerau, ―Handbook of Natural Language Processing, Second Edition, Chapman and
Hall/CRC Press, 2010.
9. Tanveer Siddiqui, U.S. Tiwary, ―Natural Language Processing and Information Retrieval, Oxford University Press,
2008.
BCAM051 CLOUD COMPUTING
II Cloud Enabling Technologies Service Oriented Architecture: REST and Systems of Systems – 08
Web Services – Publish, Subscribe Model – Basics of Virtualization – Types of Virtualization –
Implementation Levels of Virtualization – Virtualization Structures – Tools and Mechanisms –
Virtualization of CPU – Memory – I/O Devices –Virtualization Support and Disaster Recovery.
III Cloud Architecture, Services And Storage: Layered Cloud Architecture Design – NIST Cloud 08
Computing Reference Architecture – Public, Private and Hybrid Clouds – laaS – PaaS – SaaS –
Architectural Design Challenges – Cloud Storage – Storage‐as‐a‐Service – Advantages of Cloud
Storage – Cloud Storage Providers – S3.
IV Resource Management And Security In Cloud: Inter Cloud Resource Management – Resource 08
Provisioning and Resource Provisioning Methods – Global Exchange of Cloud Resources – Security
Overview – Cloud Security Challenges – Software‐as‐a‐Service Security – Security Governance –
Virtual Machine Security – IAM – Security Standards.
V Cloud Technologies And Advancements Hadoop: MapReduce – Virtual Box — Google App 08
Engine – Programming Environment for Google App Engine –– Open Stack – Federation in the
Cloud – Four Levels of Federation – Federated Services and Applications – Future of Federation.
Text books:
1. Kai Hwang, Geoffrey C. Fox, Jack G. Dongarra, “Distributed and Cloud Computing, From Parallel Processing to the
Internet of Things”, Morgan Kaufmann Publishers, 2012.
2. Rittinghouse, John W., and James F. Ransome, ―Cloud Computing: Implementation, Management and Security, CRC
Press, 2017.
3. Rajkumar Buyya, Christian Vecchiola, S. ThamaraiSelvi, ―Mastering Cloud Computing, Tata Mcgraw Hill, 2013.
4. Toby Velte, Anthony Velte, Robert Elsenpeter, “Cloud Computing – A Practical Approach, Tata Mcgraw Hill, 2009.
5. George Reese, “Cloud Application Architectures: Building Applications and Infrastructure in the Cloud: Transactional
Systems for EC2 and Beyond (Theory in Practice), O’Reilly, 2009.
BCS056 APPLICATION OF SOFT COMPUTING
CO 1 Recognize the feasibility of applying a soft computing methodology for a particular problem K2, K4
Know the concepts and techniques of soft computing and foster their abilities in designing and K4, K6
CO 2 implementing soft computing- based solutions for real-world and engineering problems
Apply neural networks to pattern classification and regression problems and compare K3, K5
CO 3 solutions by various soft computing approaches for a given problem.
CO 4 Apply fuzzy logic and reasoning to handle uncertainty and solve engineering problems K3, K4
IV Fuzzy Logic –II (Fuzzy Membership, Rules) : Membership functions, interference in fuzzy logic, 08
fuzzy if-then rules, Fuzzy implications and Fuzzy algorithms, Fuzzyfications & Defuzzificataions,
Fuzzy Controller, Industrial applications
V Genetic Algorithm(GA): Basic concepts, working principle, procedures of GA, flow chart of GA, 08
Genetic representations, (encoding) Initialization and selection, Genetic operators, Mutation,
Generational Cycle, applications.
Text books:
1. S. Rajsekaran & G.A. Vijayalakshmi Pai, “Neural Networks,Fuzzy Logic and Genetic Algorithm:Synthesis and
Applications” Prentice Hall of India.
2. N.P.Padhy,”Artificial Intelligence and Intelligent Systems” Oxford University Press. Reference Books:
3. Siman Haykin,”Neural Networks”Prentice Hall of India
4. Saroj Kaushik, Sunita Tiwari, “Soft Computing: Fundamentals, Techniques and Applications”, McGraw Hill
Education
5. Timothy J. Ross, “Fuzzy Logic with Engineering Applications” Wiley India.
6. Kumar Satish, “Neural Networks” Tata Mc Graw Hill
BCS057 IMAGE PROCESSING
Understand and apply oracle 11 g products for creating tables, views, indexes, sequences and K2, K4
CO 1 other database objects.
Design and implement a database schema for company data base, banking data base, library K3, K5,
CO 2
information system, payroll processing system, student information system. K6
CO 3 Write and execute simple and complex queries using DDL, DML, DCL and TCL K4, K5
CO 4 Write and execute PL/SQL blocks, procedure functions, packages and triggers, cursors. K4, K5
Enforce entity integrity, referential integrity, key constraints, and domain constraints on K3, K4
CO 5 database.
DETAILED SYLLABUS
Data Definition Language (DDL) Statements: (Create table, Alter table, Drop table)
Data Manipulation Language (DML) Statements
Database Management Lab Data Query Language (DQL) Statements: (Select statement with operations like Where
(BCS-551) clause, Order by, Logical operators, Scalar functions and Aggregate functions)
CO 5 Implement and evaluate the performance of KNN algorithm on different datasets K3, K4
DETAILED SYLLABUS
1. To get the input from user and perform numerical operations (MAX, MIN, AVG, SUM, SQRT, ROUND) using in
R.
2. To perform data import/export (.CSV, .XLS, .TXT) operations using data frames in R.
3. To get the input matrix from user and perform Matrix addition, subtraction, multiplication, inverse transpose and
division operations using vector concept in R.
4. To perform statistical operations (Mean, Median, Mode and Standard deviation) using R.
5. To perform data pre-processing operations i) Handling Missing data ii) Min-Max normalization
6. To perform dimensionality reduction operation using PCA for Houses Data Set
7. To perform Simple Linear Regression with R.
8. To perform K-Means clustering operation and visualize for iris data set
9. Learn how to collect data via web-scraping, APIs and data connectors from suitable sources as specified by
the instructor.
10. Perform association analysis on a given dataset and evaluate its accuracy.
11. Build a recommendation system on a given dataset and evaluate its accuracy.
12. Build a time-series model on a given dataset and evaluate its accuracy.
13. Build cartographic visualization for multiple datasets involving various countries of the world; states and
districts in India etc.
14. Perform text mining on a set of documents and visualize the most important words in a visualization such as
word cloud.
Note: The Instructor may add/delete/modify/tune experiments, wherever he/she feels in a justified manner
It is also suggested that open source tools should be preferred to conduct the lab (R, Python etc. )
BCS553 DESIGN AND ANALYSIS OF ALGORITHM LAB
Course Outcome (CO) Bloom’s Knowledge Level (KL)
Implement algorithm to solve problems by Dynamic programming, backtracking, branch and K4, K5
CO 4 bound approach.
CO 5 Implement algorithm to solve problems by branch and bound approach. K3, K4
DETAILED SYLLABUS
CO 1 Demonstrate knowledge of Big Data Analytics concepts and its applications in business. K1, K2
CO 2 Demonstrate functions and components of Map Reduce Framework and HDFS. K1, K2
II Hadoop: History of Hadoop, Apache Hadoop, the Hadoop Distributed File System,
components of Hadoop, data format, analyzing data with Hadoop, scaling out, Hadoop
streaming, Hadoop pipes, Hadoop Echo System.
Map Reduce: Map Reduce framework and basics, how Map Reduce works, developing a 08
Map Reduce application, unit tests with MR unit, test data and local tests, anatomy of a Map
Reduce job run, failures, job scheduling, shuffle and sort, task execution, Map Reduce types,
input formats, output formats, Map Reduce features, Real-world Map Reduce
III HDFS (Hadoop Distributed File System): Design of HDFS, HDFS concepts, benefits and
challenges, file sizes, block sizes and block abstraction in HDFS, data replication, how does
HDFS store, read, and write files, Java interfaces to HDFS, command line interface. Hadoop
file system interfaces, data flow, data ingest with Flume and Scoop, Hadoop archives,
08
Hadoop I/O: compression, serialization, Avro and file-based data structures. Hadoop
Environment: Setting up a Hadoop cluster, cluster specification, cluster setup and
installation, Hadoop configuration, security in Hadoop, administering Hadoop, HDFS
monitoring & maintenance, Hadoop benchmarks, Hadoop in the cloud
IV Hadoop Eco System and YARN: Hadoop ecosystem components, schedulers, fair and
capacity, Hadoop 2.0 New Features - NameNode high availability, HDFS federation, MRv2,
YARN, Running MRv1 in YARN.
NoSQL Databases: Introduction to NoSQL
MongoDB: Introduction, data types, creating, updating and deleing documents, querying,
09
introduction to indexing, capped collections
Spark: Installing spark, spark applications, jobs, stages and tasks, Resilient Distributed
Databases, anatomy of a Spark job run, Spark on YARN
SCALA: Introduction, classes and objects, basic types and operators, built-in control
structures, functions and closures, inheritance.
V Hadoop Eco System Frameworks: Applications on Big Data using Pig, Hive and HBase
Pig - Introduction to PIG, Execution Modes of Pig, Comparison of Pig with Databases, Grunt, 09
Pig Latin, User Defined Functions, Data Processing operators,
Hive - Apache Hive architecture and installation, Hive shell, Hive services, Hive metastore,
comparison with traditional databases, HiveQL, tables, querying data and userdefined
functions, sorting and aggregating, Map Reduce scripts, joins & subqueries.
HBase – Hbase concepts, clients, example, Hbase vs RDBMS, advanced usage, schema
design, advance indexing, Zookeeper – how it helps in monitoring a cluster, how to build
applications with Zookeeper.
IBM Big Data strategy, introduction to Infosphere, BigInsights and Big Sheets, introduction
to Big SQL.
Text books and References:
1. Michael Minelli, Michelle Chambers, and Ambiga Dhiraj, "Big Data, Big Analytics: Emerging Business
Intelligence and Analytic Trends for Today's Businesses", Wiley
2. Big-Data Black Book, DT Editorial Services, Wiley
3. Dirk deRoos, Chris Eaton, George Lapis, Paul Zikopoulos, Tom Deutsch, “Understanding Big Data Analytics for
Enterprise Class Hadoop and Streaming Data”, McGrawHill.
4. Thomas Erl, Wajid Khattak, Paul Buhler, “Big Data Fundamentals: Concepts, Drivers and Techniques”, Prentice
Hall.
5. Raj Kamal, Preeti Saxena, “Big Data Analytics”, McGraw Hill Education
6. Bart Baesens “Analytics in a Big Data World: The Essential Guide to Data Science and its Applications (WILEY
Big Data Series)”, John Wiley & Sons
7. ArshdeepBahga, Vijay Madisetti, “Big Data Science & Analytics: A HandsOn Approach “, VPT
8. Anil Maheshwari, “Big Data”, Second Edition, McGraw Hill
9. Anand Rajaraman and Jeffrey David Ullman, “Mining of Massive Datasets”, CUP
10. Tom White, "Hadoop: The Definitive Guide", O'Reilly.
11. Eric Sammer, "Hadoop Operations", O'Reilly.
12. Chuck Lam, “Hadoop in Action”, MANNING Publishers
13. Deepak Vohra, “Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools”,
Apress
14. E. Capriolo, D. Wampler, and J. Rutherglen, "Programming Hive", O'Reilly
15. Lars George, "HBase: The Definitive Guide", O'Reilly.
16. Alan Gates, "Programming Pig", O'Reilly.
17. Michael Berthold, David J. Hand, “Intelligent Data Analysis”, Springer
18. Bill Franks, “Taming the Big Data Tidal Wave: Finding Opportunities in Huge Data Streams with Advanced
Analytics”, John Wiley & sons
19. Glenn J. Myatt, “Making Sense of Data”, John Wiley & Sons
20. Pete Warden, “Big Data Glossary”, O’Reilly
BCS601 SOFTWARE ENGINEERING
Course Outcome (CO) Bloom’s Knowledge Level (KL)
Manage software development process independently as well as in teams and make use of
CO 5 Various software management tools for development, maintenance and analysis. K5
Software Design: Basic Concept of Software Design, Architectural Design, Low Level Design:
Modularization, Design Structure Charts, Pseudo Codes, Flow Charts, Coupling and Cohesion
Measures, Design Strategies: Function Oriented Design, Object Oriented Design, Top-Down and
III Bottom-Up Design. Software Measurement and Metrics: Various Size Oriented Measures: 08
Halestead’s Software Science, Function Point (FP) Based Measures, Cyclomatic Complexity
Measures: Control Flow Graphs.
Software Testing: Testing Objectives, Unit Testing, Integration Testing, Acceptance Testing,
Regression Testing, Testing for Functionality and Testing for Performance, TopDown and BottomUp
Testing Strategies: Test Drivers and Test Stubs, Structural Testing (White Box Testing),
IV Functional Testing (Black Box Testing), Test Data Suit Preparation, Alpha and Beta Testing of 08
Products. Static Testing Strategies: Formal Technical Reviews (Peer Reviews), Walk Through, Code
Inspection, Compliance with Design and Coding Standards.
Software Maintenance and Software Project Management: Software as an Evolutionary Entity,
Need for Maintenance, Categories of Maintenance: Preventive, Corrective and Perfective
Maintenance, Cost of Maintenance, Software Re- Engineering, Reverse Engineering. Software
V Configuration Management Activities, Change Control Process, Software Version Control, An 08
Overview of CASE Tools. Estimation of Various Parameters such as Cost, Efforts,
Schedule/Duration, Constructive Cost Models (COCOMO), Resource Allocation Models, Software
Risk Analysis and Management.
Text books:
1. RS Pressman, Software Engineering: A Practitioners Approach, McGraw Hill.
2. Pankaj Jalote, Software Engineering, Wiley
3. Rajib Mall, Fundamentals of Software Engineering, PHI Publication.
4. KK Aggarwal and Yogesh Singh, Software Engineering, New Age International Publishers.
5. Ghezzi, M. Jarayeri, D. Manodrioli, Fundamentals of Software Engineering, PHI Publication.
6. Ian Sommerville, Software Engineering, Addison Wesley.
7. Kassem Saleh, “Software Engineering”, Cengage Learning.
8. P fleeger, Software Engineering, Macmillan Publication
BCS603 COMPUTER NETWORKS
Explain basic concepts, OSI reference model, services and role of each layer of OSI model and
CO1 K1, K2
TCP/IP, networks devices and transmission media, Analog and digital data transmission
CO2 Apply channel allocation, framing, error and flow control techniques. K3
Describe the functions of Network Layer i.e. Logical addressing, subnetting & Routing
CO3 K2, K3
Mechanism.
Explain the different Transport Layer function i.e. Port addressing, Connection Management,
CO4 K2, K3
Error control and Flow control mechanism.
CO5 Explain the functions offered by session and presentation layer and their Implementation. K2, K3
Explain the different protocols used at application layer i.e. HTTP, SNMP, SMTP, FTP,
CO6 K2
TELNET and VPN.
DETAILED SYLLABUS 3-0-0
II Link layer: Framing, Error Detection and Correction, Flow control (Elementary Data Link Protocols,
Sliding Window protocols). 08
Medium Access Control and Local Area Networks: Channel allocation, Multiple access protocols,
LAN standards, Link layer switches & bridges (learning bridge and spanning tree algorithms).
III Network Layer: Point-to-point networks, Logical addressing, Basic internetworking (IP, CIDR,
ARP, RARP, DHCP, ICMP), Routing, forwarding and delivery, Static and dynamic routing, Routing 08
algorithms and protocols, Congestion control algorithms, IPv6.
IV Transport Layer: Process-to-process delivery, Transport layer protocols (UDP and TCP),
Multiplexing, Connection management, Flow control and retransmission, Window management, 08
TCP Congestion control, Quality of service.
V Application Layer: Domain Name System, World Wide Web and Hyper Text Transfer Protocol,
Electronic mail, File Transfer Protocol, Remote login, Network management, Data compression, 08
Cryptography – basic concepts.
Text books and References:
1. Behrouz Forouzan, “Data Communication and Networking”, McGraw Hill
2. Andrew Tanenbaum “Computer Networks”, Prentice Hall.
3. William Stallings, “Data and Computer Communication”, Pearson.
4. Kurose and Ross, “Computer Networking- A Top-Down Approach”, Pearson.
5. Peterson and Davie, “Computer Networks: A Systems Approach”, Morgan Kaufmann
6. W. A. Shay, “Understanding Communications and Networks”, Cengage Learning.
7. D. Comer, “Computer Networks and Internets”, Pearson.
8. Behrouz Forouzan, “TCP/IP Protocol Suite”, McGraw Hill.
BCAI061 CYBER FORENSIC ANALYTICS
Text Books:
1. Marjie T. Britz, “Computer Forensics and Cyber Crime”, Pearson, 2013.
2. Garima Tiwari, “Understanding Laws– Cyber Laws And Cyber Crimes”, Lexis Nexis, 2014.
3. Chuck Easttom, Jeff Taylor, “Computer Crime, Investigation, and the Law”, Course Technology, 2018.
4. Eoghan Casey, “Digital Evidence and Computer Crime: Forensic Science, Computers, and the Internet”,
Eoghan Casey, 2018.
BCDS061 IMAGE ANALYTICS
Course Outcome (CO) Bloom’s Knowledge Level (KL)
Infer the basics and fundamentals of digital image processing and Apply the various techniques K1, K2
CO 1 for intensity transformations functions. Implement Color image Smoothing and
Sharpening.
CO 2 Illustrate Morphological operation and Apply Some Basic Morphological Algorithms. K2, K3
Apply image segmentation techniques such as Optimum Global Thresholding using Otsu’s K3, K4
CO 3 Method, Active Contours: Snakes and Level Sets for various real-time applications.
Analysis various Feature Extraction methods and Implement for various real-time applications. K3, K4
CO 4
Apply and Analysis various Image Pattern Classification methods such as MinimumDistance K3, K4
CO 5 Classification, Optimum (Bayes) Statistical Classification, and Deep Convolutional Neural
Network.
DETAILED SYLLABUS 3-0-0
DETAILED SYLLABUS
Network Fundamentals
Network Structures, equivalence, homophile, clustering, snowball sampling, contact
tracing and random walks,
III Ego-ceneterd networks, dominance hierarchies, Third Party Records, affiliation 08
Network
Citation Networks, Peer to Peer Networks, Recommender Networks, Biological
Networks
Social Network and Modelling
Social Contexts: Affiliation and Identity, social capital, structural holes, structural balance,
Predictive Modeling, Descriptive Modeling, community/ anomaly detection
IV Facebook Analytics: Introduction, parameters, demographics, Analyzing page audience.
Reach and Engagement analysis 08
Google Analytics: Brief Implementation Technology, Limitations, Performance
Concerns, Privacy Issues. Introduction and Working, Google Website Optimizer
Processing, Visualizing and Social Media Data Analytics
Processing and Visualizing Data, Influence Maximization,Link Prediction, Collective
Classification,
Applications in Advertising and Game Analytics, collecting and visualizing social media
V data, visualization and exploration.
Social Network and Web Data Analytics Methods, Clickstream analysis, A/B testing,
online surveys, Web crawling and Indexing. 08
Natural Language ProcessingTechniques for Micro-text Analysis, Trend, social
influencers on judgements, opinion spread, judgement
Text and Reference Books:
1. Matthew Ganis, Avinash Kohirkar, Social Media Analytics: Techniques and Insights for Extracting Business Value
Out of Social Media Pearson 2016
2. Jim Sterne, Social Media Metrics: How to Measure and Optimize Your Marketing Investment Wiley Latest edition
3. Brian Clifton, Advanced Web Metrics with Google Analytics, John Wiley & Sons; 3rd Edition edition (30 Mar
2012)
4. Ganis/Kohirka, SOCIAL MEDIA ANALYTICS Paperback – 29 September 2016 by Pearson.
BCDS062 MACHINE LEARNING TECHNIQUES
Course Outcome (CO) Bloom’s Knowledge Level (KL)
CO 1 To understand the need for machine learning for various problem solving. K1, K2
CO 2 To understand a wide variety of learning algorithms and how to evaluate models generated from K1, K3
data.
CO 3 To understand the latest trends in machine learning. K2, K3
CO 4 To design appropriate machine learning algorithms and apply the algorithms to a real-world K4, K6
problem.
CO 5 To optimize the models learned and report on the expected accuracy that can be achieved by K4, K5
applying the models,
DETAILED SYLLABUS 3-0-0
DECISION TREE LEARNING - Decision tree learning algorithm, Inductive bias, Inductive
inference with decision trees, Entropy and information theory, Information gain, ID-3 Algorithm,
Issues in Decision tree learning.
III 08
INSTANCE-BASED LEARNING – k-Nearest Neighbour Learning, Locally Weighted Regression,
Radial basis function networks, Case-based learning.
Text books:
1. Tom M. Mitchell, ―Machine Learning, McGraw-Hill Education (India) Private Limited, 2013.
2. Ethem Alpaydin, ―Introduction to Machine Learning (Adaptive Computation and Machine
Learning), The MIT Press 2004.
3. Stephen Marsland, ―Machine Learning: An Algorithmic Perspective, CRC Press, 2009.
4. Bishop, C., Pattern Recognition and Machine Learning. Berlin: Springer-Verlag.
BCDS651 BIG DATA AND ANALYTICS LAB
Course Outcome (CO) Bloom’s Knowledge Level (KL)
1. Downloading and installing Hadoop; Understanding different Hadoop modes. Startup scripts, Configuration files.
2. Implement the following file management tasks in Hadoop:
i. Adding files and directories
ii. Retrieving files
iii. Deleting files Hint: A typical Hadoop workflow creates data files (such as log files) elsewhere and copies them
into HDFS using one of the above command line utilities
3. Implement of Matrix Multiplication with Hadoop Map Reduce
4. Write a Map Reduce program that mines weather data. Hint: Weather sensors collecting data every hour at many
locations across the globe gather a large volume of log data, which is a good candidate for analysis with Map Reduce,
since it is semi structured and record-oriented
5. Run a basic Word Count Map Reduce program to understand Map Reduce Paradigm. 6. Implementation of K-means
clustering using Map Reduce
7. Installation of Hive along with practice examples.
8. Installation of HBase, Installing thrift along with Practice examples
9. Patrice importing and exporting data from various data bases .
10. Write PIG Commands: Write Pig Latin scripts sort, group, join, project, and filter your data.
11. Run the Pig Latin Scripts to find Word Count .
12. Run the Pig Latin Scripts to find a max temp for each and every year.
Note: The Instructor may add/delete/modify/tune experiments
BCS651 SOFTWARE ENGINEERING LAB
Course Outcome (CO) Bloom’s Knowledge Level (KL)
DETAILED SYLLABUS
Note: The Instructor may add/delete/modify/tune experiments, wherever he/she feels in a justified manner
It is also suggested that open source tools should be preferred to conduct the lab ( Java , JSP , Bootstrap
Firebug , WampServer , MongoDB, etc)
Software Engineering Lab (BCS651): Mapping with Virtual Lab
Name of the Lab Name of the Experiment
Software
Engineering Lab
(BCS-651)
BCS653 COMPUTER NETWORKS LAB
Course Outcome (CO) Bloom’s Knowledge Level (KL)
DETAILED SYLLABUS