Recent Questions

[Recommended] i need help with 2 questions 3 and 4 NOT

SOLUTION AT Academic Writers Bay i need help with 2 questions 3 and 4 NOT 5. Robotics subject 3. Reinforcement Learning Reinforcements learning (RL) agents learn by taking state-dependent actions and experiencing reward arising from interaction with their environments. One method is to use a table-based Q-learning algorithm. Figure 1: The inverted pendulum problem Q-learning tables are discrete, but most real-world tasks involve systems that have continuous states and are controlled using continuous actions. With this in mind, consider how a table-based Q-learning algorithm could learn to balance an inverted pendulum (as shown in Fig. 1). To achieve this: (a) Describe a suitable reward function. [3 marks] (b) Describe a suitable choice of states and explain why they are appropriate. [3 marks] (c) Describe a suitable choice of actions and explain why they are appropriate and how they relate to the states discussed in part (a). [3 marks] (d) Discuss how an inverted pendulum task could be either an MDP or a POMDP. [2 marks] Question 3 continued … Question 3 continued (e) Discuss how simulated experience generated from a model within a RL agent can increase the speed with which the RL algorithm convergence. How can this assist finding a solution in the inverted pendulum task? [4 marks] (f) Dyna-Q algorithm is one such model-based approach to RL. Using high-level pseudo code in no more than 12 lines, describe the operation of the Dyna-Q algorithm and describe all its key terms. [5 marks] 4. State estimation (a) When building a full state feedback controller, why is if often necessary to use some form of state estimator? [3 marks] (b) The Luenberger observer is a deterministic state estimator. Draw its signal flow graph to illustrate its operation and explain the design and function of the Luenberger gain L. [3 marks] (c) The Kalman filter is a stochastic state estimator. Draw and compare a signal flow graph of the Kalman estimator with that of the Luenberger observer, illustrating all the Kalman estimator’s important components, including its noise sources. [4 marks] Question 4 continued … Question 4 continued (d) The Kalman filter iteratively computes 5 variables as illustrated below Write a short paragraph on each of the terms 1 – 5 to explain their meaning and function. [10 marks] 5. Gaussian processes Describe the main difference between using Gaussian Processes and Support Vector Machines in approximating linear functions. [20 marks] CLICK HERE TO GET A PROFESSIONAL WRITER TO WORK ON THIS PAPER AND OTHER SIMILAR PAPERS CLICK THE BUTTON TO MAKE YOUR ORDER

Get your personalized original solution

Get 20% OFF your first order

We will deliver a custom paper tailored to your requirements with a good discount in 24 hours.

Use Discount

Everything is perfect. The writer covered all the questions to my assignment perfectly.

Thank you so much for your service! I got an A for my paper! I really appreciate it!

Amazing Editing. Thanks you team Phantom Tutors.

Amazing neat work! and super timing, thank you so much

I have used your service before and have never been disappointed. As far as I can tell, this met all my required metrics. thank you for much for the quick turn around.

Well-written and received on time. Very appreciative and thankful.

I liked the quality and skills of the writer. I will continue to work with you.

Thank you for this very good and professional writer!

Excellent as always, have many assignments headed your way, have an amazing blessed and prosperous 2022, thank you for all you do in excellence.

I would like to keep this writer for my future projects.

Grade Guarantee of an A or B

Everything we submit comes with a grade guarantee of an A or B, and we hit this mark over ninety percent of the time!

100% Confidentiality

Information about customers is confidential and never disclosed to third parties.

100% Money Back Guarantee

If you are convinced that our writer has not followed your requirements, feel free to ask for a refund.

30 Day Revision Policy

You can ask for revisions until you are satisfied with the solution.

Timely Delivery with 24x7 support

No missed deadlines – 99% of assignments are completed in time.

FREE Plagiarism Report(Save $5)

We complete all papers from scratch and your solution is 100% original. You will get a plagiarism report.

FREE Bibliography Page(Save $5)

If you neeed a bibliography page, we shall provide it for free

FREE Title page(Save $10)

If you neeed a title page, we shall provide it for free.

FREE Formatting(APA, MLA, Harvard, Chicago/Turabian) (Save $15)

Depending with your Homework help, we shall provide formating for free

FREE Expert Proofreading(Save $10)

Get your paper checked by an expert proofreader: no grammatical and spelling mistakes, a perfect match with your requirements, and correspondence with the chosen academic level.