Clinical ToolsRef. 01MedAgentBenchRealistic virtual EHR environment to benchmark medical LLM agents. Stanford ML Group evaluation framework.stanfordmlgroupPython212 stars46 forks