Este é um exemplo fantástico do ecossistema de avaliações de IA proposto por @WhiteHouse plano de ação de IA de @DavidSacks @mkratsios47 @sriramk @deanwball sendo colocado em ação
lmarena.ai
lmarena.ai19 de ago., 20:03
🧬 BiomedArena is here! We’re honored to partner with @DataTecnica and @NIH CARD, who developed BiomedArena to evaluate LLMs for biomedical discovery, and to help expand this domain-specific track in community-driven evaluations. 🧪 Biomedical science is complex, high-stakes, and constantly evolving. 📊 CARDBiomedBench & tabular reasoning tests show that no current model can reliably meet the reasoning & domain-specific knowledge demands of biomedical researchers. Learn more about BiomedArena in thread 👇 🧵 #AI #LLMs #BiomedicalAI #AIEvaluation #OpenScience #LMArena #BiomedArena #NIH
1,67K