A type of machine learning where an agent learns to make decisions by taking actions and receiving feedback from those actions.