May 2, 2025
| Module | Syllabus | Important |
|---|---|---|
| 1 | REINFORCEMENT OVERVIEW: | |
| Course logistics and overview. Origin and history of Reinforcement Learning | ||
| research. Its connections with other related fields and with different branches | ||
| of machine learning. Reinforcement Learning Problem - Elements of | ||
| Reinforcement Learning- Limitations and Scope-Examples- Extended | ||
| Example: Tic-Tac-Toe. Are ‘representation learning’ and ‘reinforcement | ||
| learning’ the same terms? - Future Trends- Deep reinforcement learning. | 1. What is Reinforcement Learning? Define its key components (agent, environment, state, action, reward, policy, value function). |
Are Representation Learning and Reinforcement Learning the same? Compare and explain their roles in ML. (Prev Year Q1)
What are the main challenges and limitations of Reinforcement Learning? In which domains is it most effective? (Prev Year Q2)
How is RL related to Supervised/Unsupervised Learning, Control Theory, and Operations Research? (Prev Year Q11)
Explain how Tic-Tac-Toe can be solved using Reinforcement Learning. (Prev Year Q5)
What are some real-world applications of RL? | | 2 | REINFORCEMENT DECISION PROCESS AND POLICIES: Markov Decision Process:Introduction to Markov decision process (MDP), state and action value functions. Bellman expectation equations, optimality of value functions and policies, Bellman optimality equations. Policy gradient methods- Reducing variance in policy gradientestimates. | 1. What is a Markov Decision Process (MDP)? Explain with an example (e.g., chess).
Define state-value and action-value functions.
What is the Bellman Expectation Equation and how does it lead to the Bellman Optimality Equation?
What does it mean for a value function and policy to be optimal? (Prev Year Q12)
What are the methods to derive optimal policy? (e.g., value iteration, policy iteration with examples)
What is the exploration vs exploitation trade-off in Q-learning? (Prev Year Q13)
Compare Policy Gradient methods and Value-based methods (like Q-learning) — advantages and disadvantages. (Prev Year Q3)
What are some real-world applications of Policy Gradient methods in robotics, game playing, continuous control? (Prev Year Q6) **** | | 3 | REINFORCEMENT ALGORITHMS AND APPLICATIONS: Algorithms for control learning, Q-learning, Discrete action space: SARSA – Lambda, DQN- Deep Q Network. Continuousaction space:Deep Deterministic Policy Gradient (DDPG), Asynchronous Advantage Actor-Critic Algorithm(A3C). | 1. Explain Q-learning with an example. What is the discount factor (γ) and how does it affect learning?
What is SARSA(λ) and how does it differ from standard SARSA? (Prev Year Q4)
Explain the Deep Q-Network (DQN) algorithm.
What is the DDPG (Deep Deterministic Policy Gradient) algorithm?
What is A3C (Asynchronous Advantage Actor-Critic)? | | 4 | REPRESENTATION LEARNING OVERVIEW: Machine learning on graphs, Background and Traditional Approaches- Graph Statistics and Kernel Methods, Neighborhood Overlap Detection, Graph Laplacians and Spectral Methods, Neighborhood Reconstruction Methods, Multi-relational Data and Knowledge Graphs | 1. What is Representation Learning on Graphs?
How do neighborhood reconstruction methods help in node classification? (Prev Year Q7)
What are the advantages of kernel methods in graph-based learning? (Prev Year Q8)
How does multi-relational data help in link prediction in knowledge graphs? (Prev Year Q14) | | 5 | GRAPH NEURAL NETWORK: The Graph Neural Network Model, Neural Message Passing, Generalized Neighborhood Aggregation, Generalized Update Methods, Graph Pooling, Graph Neural Networks in Practice, GNNs and Graph Convolutions | 1. What is a Graph Neural Network (GNN)?
Explain neural message passing and how it enables information propagation in graphs. (Prev Year Q9)
What is generalized neighborhood aggregation?
What is graph pooling and how is it used?
How do GNNs differ from CNNs and FCLs?
What are some real-world applications of GNNs (e.g., social networks, recommender systems)? (Prev Year Q10)
What are some emerging trends in GNNs such as attention mechanisms and scalability improvements? (Prev Year Q15) |