This repository focuses on Reinforcement Learning related concepts, use cases, point of views and learning approaches. These are purely based on my learnings, readings, experiences in dealing with practical / real-life context and scenarios.
This is where the difference between LEADERS and LAGGARDS in this space !!
Use Case Theme | Description | Industry Relevancy | Category |
---|---|---|---|
Pricing and Promotion Analytics | Ability to apply advanced pricing and promotion strategies to improve product margins | Agriculture | Next Best Actions for Customer |
Waste and Cost reduction | Optimize warehouse logistics and network for reduced waste and maintenance cost reduction | Agriculture | Optimize Complex Operations |
Production Operations Management | Solving Scheduling and Production allocation challenges to optimize and improvise yield | Agriculture | Optimize Complex Operations |
Optimization of Product Design Process | Ability to optimize product design processes to shorten development cycle for new vehicles, features and improvise quality | Automotive | Optimize Product Development Cycle / Design |
Load Balancing | Ability to balance the load of electricity grids in a situation of varying demand cycles | Energy and Utilities | Optimize Complex Operations |
Yield Optimization | Ability to enable real-time well monitoring and precision drilling for improved yield in Oil operations | Energy and Utilities | Optimize Complex Operations |
Trading Strategy Optimization | Ability to optimize the trading strategy for an options-trading portfolio | Financial Services | Optimize Complex Operations |
Customer HyperPersonalization | Delivering advanced personalization abilities that adapt promotions, next best offers and recommendations for increase customer satisfaction and increased sales | Financial Services | Next Best Actions for Customer |
Clinical Trials | The well being of patients during clinical trials is extremely important along with the actual results of the study. In this scenario, the exploration is equivalent to identifying the best treatment, and exploitation is treating patients as effectively as possible during the trial process. | Life Sciences | Optimize Complex Operations |
Effective Inventory Management with Robotics | Stock and pick inventory using Robots | Retail and CPG | Optimize Product Development Cycle / Design |
Network Routing | Routing is the process of selecting a path for traffic in a network, such as telephone networks or computer networks (internet) etc. Allocation of channels to the right users, such that the overall throughput is maximised, can be formulated as a MABP. | Generic / Common | Optimize Product Development Cycle / Design |
Online Advertising | The goal of an advertising campaign is to maximise revenue from displaying ads. The advertiser makes revenue every time an offer is clicked by a web user. Similar to MABP, there is a trade-off between exploration, where the goal is to collect information on an ad’s performance using click-through rates, and exploitation, where we stick with the ad that has performed the best so far. | Generic / Common | Next Best Actions for Customer |
Other References: