91精品国产91久久久久久_国产精品二区一区二区aⅴ污介绍_一本久久a久久精品vr综合_亚洲视频一区二区三区

合肥生活安徽新聞合肥交通合肥房產生活服務合肥教育合肥招聘合肥旅游文化藝術合肥美食合肥地圖合肥社保合肥醫院企業服務合肥法律

代做COMP532、代寫a video game from OpenAI Gym

時間:2024-04-19  來源:合肥網hfw.cc  作者:hfw.cc 我要糾錯



COMP5**-202**4 Assignment 2
You need to solve each of the following problems. The assignment aims to design and
implement a deep reinforcement learning agent for a video game from OpenAI Gym or
Gymnasium. You must also include a brief report describing and discussing your solutions to the
problems. Students can do the assignment in groups or individuals.
● This assignment is worth 15% of the total mark for COMP5**
● 80% of the assignment marks will be awarded for correctness of results
● 20% of the assignment marks will be awarded for the quality of the accompanying report
● Students will do the assignment in groups
● The assignment marks will be awarded for correctness of results
● We expect 5 students in one group (it would be fine to have groups of 1, 2, 3, and 4 as
well, but it is suggested to have groups of 5), please find your team members on your
own.
● Only one single submission is needed for each group
● The same marks will be granted to all the members in the same group
● Please list all your group members (names, emails, student ids) and individual
contributions in your submitted report
Submission Instructions
● Deadline: 22 Apr 2024 17:00 (UK Time)
● Send all solutions as a single PDF document containing your answers, results, and
discussion of the results. Attach the source code for the programming problems as
separate files.
● Submit your solution via Canvas.
● Penalties for late submission apply in accordance with departmental policy as set
out in the student handbook, which can be found at
https://intranet.csc.liv.ac.uk/student/msc-handbook.pdf and the University Code of
Practice on Assessment, found at
https://www.liverpool.ac.uk/media/livacuk/tqsd/code-of-practice-on-assessment/code_of_
practice_on_assessment.pdf
Problem 1 (80 marks)
Implement a deep reinforcement learning agent for a game or environment of OpenAI Gym or
Gymnasium.
Use the lunar_lander environment:
https://gymnasium.farama.org/environments/box2d/lunar_lander/.
Please plot the learning progress of your method from 0 to 1000 episodes. You can have a
figure to show rewards and another figure to show training loss.
Please use a video or gifs or figures to demonstrate how your agent works.
Prepare a report explaining your solution and containing your results, and discussion of the
results.
Attach the source code as separate files. For example, .ipnb - an ipython notebook file.
Problem 2 (20 marks)
Explain exploration and exploitation for deep reinforcement learning.

請加QQ:99515681  郵箱:99515681@qq.com   WX:codinghelp













 

掃一掃在手機打開當前頁
  • 上一篇:代做CSE340、代寫Parsing編程語言
  • 下一篇:泰國留學簽離境后要注銷嗎(泰國留學簽注銷的流程是什么)
  • 無相關信息
    合肥生活資訊

    合肥圖文信息
    急尋熱仿真分析?代做熱仿真服務+熱設計優化
    急尋熱仿真分析?代做熱仿真服務+熱設計優化
    出評 開團工具
    出評 開團工具
    挖掘機濾芯提升發動機性能
    挖掘機濾芯提升發動機性能
    海信羅馬假日洗衣機亮相AWE  復古美學與現代科技完美結合
    海信羅馬假日洗衣機亮相AWE 復古美學與現代
    合肥機場巴士4號線
    合肥機場巴士4號線
    合肥機場巴士3號線
    合肥機場巴士3號線
    合肥機場巴士2號線
    合肥機場巴士2號線
    合肥機場巴士1號線
    合肥機場巴士1號線
  • 短信驗證碼 酒店vi設計 deepseek 幣安下載 AI生圖

    關于我們 | 打賞支持 | 廣告服務 | 聯系我們 | 網站地圖 | 免責聲明 | 幫助中心 | 友情鏈接 |

    Copyright © 2025 hfw.cc Inc. All Rights Reserved. 合肥網 版權所有
    ICP備06013414號-3 公安備 42010502001045