CTI-REALM is Microsoft’s open-source benchmark that evaluates AI agents on real-world detection engineering. It measures whether an agent can take cyber threat intelligence (CTI) and produce validated ...
In 2025, my team within the Soldier Evaluation Directorate won the U.S. Army Test and Evaluation Command (ATEC)’s AI Challenge with a tool that could ...
A computational method called scSurv, developed by researchers at Institute of Science Tokyo, links individual cells to patient outcomes using widely available bulk RNA sequencing data. The approach ...
Sandia National Laboratories conducted the first-ever blind comparison of seven commercial PV modeling software, revealing that differences in weather handling, system modeling, derates, and ...
Tests and exams often inform too late what should have been known earlier. Stealth assessment and adaptive training may provide exciting opportunities.