Monte – Carlo algorithm generalisation to higher state and action spaces

I am reading a research paper on the formulation of MDP problems to ICU treatment decision making. the paper applies a Monte-Carlo approach to approximate the value function. Below is a screenshot of the excerpt that I came across. The last sentence of the excerpt reads “The approach is scalable for growing number of states…

Details