Introducing LifeSciBench
Introducing LifeSciBench, an expert-authored, expert-reviewed benchmark for evaluating how AI systems handle real-world life science research tasks and decisions.
This is a summary curated by AIFuture. Read the complete article at the original source:
Read the full story on OpenAI