搜索资源列表
RLApplet
- Applet for Reinforcement learning Game
HCGreedy
- 一种贪婪算法编码,可用于各种强化学习的实现中。-Encoding a greedy algorithm can be used to achieve a variety of reinforcement learning.
TankWar6
- 强化学习版坦克大战,利用Q学习来完成坦克大战,体现强化学习的基本原理。-Battle City version of reinforcement learning, the use of Q-learning to accomplish physics, reflecting strengthening basic principles of learning.
reinforementlearning11
- 关于强化学习必不可少的论文,讲的很详细,希望对大家学习有利。-Papers about reinforcement learning is indispensable, very detailed, hope is good for everybody to learn
wanju
- 混凝土简支梁的弯矩曲率曲线绘制,输入截面尺寸及配筋情况,即可得到整个阶段的弯矩曲率曲线-Concrete beam bending curvature of the curve plotting, input section size and reinforcement, the bending curvature of curve to get the whole stage
Q-Learning
- Q 学习方面的MATLAB程序,对于研究强化学习和自适应动态规划的朋友很有参考价值-Q learning MATLAB. For the study of reinforcement learning and adaptive dynamic programming
N-bandit
- this code simulates the n - armed bandit problem in control systems of reinforcement learning
cailiaoyonglaingbilifenxijisuanshu
- 预拌混凝土使用量/6层以上建筑主筋使用HRB400级钢筋比例计算/可循环材料使用比例计算-Use ready-mixed concrete volume, 6 floors above the building using HRB400 steel reinforcement ratio calculation, the ratio is calculated using recycled materials
maze
- 经典的迷宫问题 可用于对强化学习最优路径的研究-The classic maze can be used to study the optimal path reinforcement learning
Platform-design-program
- 在此小程序上,可方便进行桩基础的承台设计,2~16桩承台的冲切、剪切、配筋计算。输入简单参数即可按规范进行桩基础配筋设计,可能是设计院许多结构设计人员想要的。-On this small program that can facilitate the design of pile foundation pile, punching 2 to 16 caps, and shear reinforcement calculation. In
2009_Barris
- GFRP REINFORCEMENT OF FLEXURAL BEAMS
Shear_Behavior_of_Concrete_Beams_Reinforced
- GFRP REINFORCEMENT OF FLEXURAL BEAMS
h1
- OpenSees关于钢筋材料的定义所用的TCL语言,对初学者编程有一定的帮助-OpenSees definition of reinforcement material used in TCL language programming for beginners have some help
approxrl
- this is a function for reinforcement learning (RL) and dynamic programming (DP) algorithms
RL1
- Reinforcement Learning , Q-LEarning
conf_2011_04_01
- Distributed Reinforcement Learning Based MAC Protocols for Autonomous Cognitive Secondary Users
salamander
- webots 强化学习 避障算法 惩罚 随机 -webots punishment reinforcement learning obstacle avoidance algorithm randomly
KLSPI-Algorithm-theory
- 基于核函数的增强学习算法的理论分析与算法描述,及其在反馈控制上的应用-Based on theoretical analysis and descr iption of reinforcement learning algorithm kernel function, and its application in the feedback control
RL-in-Multi-Continuous-Action-Spaces
- 机器学习,大规模连续行为空间的强化学习算法研究及其应用分析-Machine learning, large-scale study of reinforcement learning algorithm continuously analyzes the behavior of space and its applications
REINFORCEMENT_LEARNING
- reinforcement Learning Q learning method