Tämä on loistava esimerkki tekoälyn arviointiekosysteemistä@WhiteHouse jota ehdotetaan tekoälyn toimintasuunnitelmassa @DavidSacks @mkratsios47 @sriramk @deanwball alkaen
lmarena.ai
lmarena.ai19.8. klo 20.03
🧬 BiomedArena is here! We’re honored to partner with @DataTecnica and @NIH CARD, who developed BiomedArena to evaluate LLMs for biomedical discovery, and to help expand this domain-specific track in community-driven evaluations. 🧪 Biomedical science is complex, high-stakes, and constantly evolving. 📊 CARDBiomedBench & tabular reasoning tests show that no current model can reliably meet the reasoning & domain-specific knowledge demands of biomedical researchers. Learn more about BiomedArena in thread 👇 🧵 #AI #LLMs #BiomedicalAI #AIEvaluation #OpenScience #LMArena #BiomedArena #NIH
1,67K