对于一个强化智能学习系统来说,“美”只不过是一个奖励信号而已。

Reinforcement learning is all about gamifying the learning process.

This type of machine learning uses a reward-penalty method to teach an AI system. If it makes the right move, it gets rewarded. If it makes a mistake, it receives a penalty.

In other words, reinforcement learning forces a system to learn and adapt quickly, or it otherwise loses serious numerical rewards. It's a feedback-based machine learning method in which the AI agent learns to (rightly) behave in an environment by taking actions and seeing those actions' results.

In short, the agent learns from experience without any pre-programming and doesn't require any human supervision.

请您先登陆,再发跟帖!