Things & Thinks-Issue LXVII

Things & Thinks-Issue LXVII

📚Research Digest

Microsoft's MAI-DxO: Orchestrating Smarter, Cheaper Clinical Reasoning

What it is about

This study presents a new benchmark by Microsoft—Sequential Diagnosis Benchmark (SDBench)—designed to evaluate AI systems in realistic clinical diagnostic scenarios. Unlike traditional multiple-choice evaluations, SDBench simulates real-world medical encounters by requiring stepwise reasoning: starting from a brief case summary, the AI (or physician) must iteratively ask questions and request tests before making a diagnosis. It uses 304 challenging NEJM clinicopathological conference cases and includes cost estimation for each test or physician interaction. The authors also introduce MAI Diagnostic Orchestrator (MAI-DxO), an AI framework inspired by team-based clinical reasoning. MAI-DxO simulates a virtual panel of physician roles, enabling strategic, cost-conscious diagnostic decisions. When paired with OpenAI’s o3 model, MAI-DxO achieved up to 85.5% diagnostic accuracy—far exceeding the 20% average of experienced physicians—while also reducing costs by up to 70% compared to non-orchestrated AI models.

Article content

What it means

By demonstrating that structured orchestration boosts both diagnostic accuracy and cost-efficiency, the authors suggest that AI systems can eventually serve as powerful diagnostic collaborators, especially in under-resourced settings. The model-agnostic nature of MAI-DxO provides resilience to rapidly evolving AI tools, avoiding constant retraining. Moreover, tools like MAI-DxO could assist not only in improving care delivery but also in medical education, patient triage, and global health access—particularly where experienced specialists are scarce. Future efforts will need to test such systems in routine, everyday clinical environments and integrate modalities like imaging for broader diagnostic coverage. Ultimately, this signals a shift toward AI-augmented, team-based medical diagnostics that blend efficiency, accuracy, and accessibility.

SensorLM

What it is about

This next one, by Google researchers, introduces SensorLM, a family of foundational models that align wearable sensor data with natural language. Recognizing the challenge of interpreting real-world sensor streams due to the scarcity of paired sensor-text data, the authors developed a hierarchical captioning pipeline to systematically generate textual descriptions capturing statistical, structural, and semantic aspects of raw sensor data. This enabled the creation of the largest known sensor-language dataset, covering over 59.7 million hours of wearable recordings from 103,000+ individuals. SensorLM extends well-known multimodal architectures like CLIP and CoCa into a unified sensor-language framework. Through rigorous evaluations in domains like human activity recognition and healthcare, SensorLM outperforms current methods across zero-shot, few-shot, and cross-modal tasks, demonstrating strong generalization, scaling properties, and efficient learning with limited labels.

Article content

What it means

SensorLM;s zero-shot and few-shot abilities could drastically reduce the need for labeled data, accelerating deployment in fields like personalized health monitoring, elderly care, and activity tracking. The hierarchical captioning technique could influence future dataset construction in other sensing modalities. Moreover, by integrating sensor data into the same multimodal space as text and vision, SensorLM paves the way for more intuitive human-AI interaction, where users can query sensor states or trends in plain language. This research signals a move toward foundation models for ambient intelligence, making real-world sensing more explainable, scalable, and adaptable.


🖇Digital Healthcare News

#GenAI and #BigTech in #Healthcare

Elon Musk’s xAI launched ‘Grok for Government’ for healthcare and science use.

Physician networking company Doximity released a free AI scribe, highlighting the subsector's lack of technical moat.

Regulatory Brief

UK NHS will develop AI technology to scan NHS systems to flag safety issues in real time and trigger crucial inspections earlier, as part of the UK government’s Plan for Change to shift NHS services from analogue to digital under the 10 Year Health Plan.

Aktiia received FDA 510(k) clearance for its over-the-counter cuffless blood pressure monitor, G0 Blood Pressure Monitoring System, also known as the Hilo Band.

Neu Health, a smartphone platform for Parkinson's disease and dementia, announced it received FDA 510(k) clearance for its smartphone-based tremor measurement module, which aims to quantify tremor in adults with mild to moderate Parkinson's disease.  

Pharma/Device Brief

Revolution Medicines collaborated with Iambic Therapeutics’ AI drug discovery platform, striking a multi-year technology and research collaboration.

Funding, Deals, Mergers & acquisitions

Ambience Healthcare, an ambient AI documentation company, raised $243M

OpenEvidence, an AI-enabled medical research aggregate platform for doctors, raised $210M

AI-enabled imaging company Aidoc raised $150M, partly to accelerate the development of CARE, a clinical-grade foundation model.

Slingshot AI raised $53M to launch an AI chatbot trained on sessions with human therapists

Mandolin, a platform that uses AI automation to enhance access to specialty drugs, raised $40M

Arbital Health, an AI platform to help manage value-based care contracts, raised $31M

Charta Health, an AI-enabled platform that automates billing and coding workflows, raised $22M

Consumer Digital Health & Other News

Samsung acquired digital health company Xealth, a platform that helps providers manage digital tools,


📙Longread of the Month

Not really a long read but definitely worth a thought

Article content

🦜Tweet of the Month

This is a funny one...

Article content

📊Chart of the Month

According to this McKinsey review, pharma leads in AI economic potential with high R&D acceleration opportunities (40-120% gains)

Article content

Liked what you read? Subscribe & Share! I would love to hear your feedback and thoughts. You can also connect with me via Twitter and LinkedIn!

To view or add a comment, sign in

More articles by Santosh Shevade

  • Things & Thinks-Issue LXXI

    📚Research Digest Hippocratic AI, Wellspan Health & multilingual reach What it is about This paper, co-written by teams…

  • AI Works in Demos. Reality Is Messier.

    Introduction: The Jagged Edge of Intelligence You may have noticed this several times..

  • Things & Thinks-Issue LXX

    📚Research Digest Thirty Days with an AI Scribe: Less Burnout, More Time for Care What it is about This multicenter…

    1 Comment
  • Things & Thinks-Issue LXIX

    📚Research Digest Epic's Comet: Scaling Generative Models for Predictive Healthcare What it is about This preprint, by…

  • Everyone Gets a Copilot. Now What?

    A year ago, enterprise leaders were scrambling to “get into AI.” Now, they’re busy distributing it.

    6 Comments
  • Things & Thinks-Issue LXVIII

    📚Research Digest Google Gemini's Insulin Resistance Literacy and Understanding Agent What it is about This research…

  • Plot Twist: Your Boring Industry Knowledge May Just Be the Hottest Skill in Tech

    Is the pendulum swinging away? Looks like we are going from "anyone can code" to "actually, maybe we need people who…

    4 Comments
  • Things & Thinks-Issue LXVI

    📚Research Digest Insights from Stanford’s Med-HELM Evaluation What it is about Med-HELM is a benchmarking study from…

    4 Comments
  • Pilots Don’t Scale Themselves. Pharma Needs a Smarter Lab.

    Over the past year, nearly every major pharmaceutical company has launched some form of generative AI initiative…

    2 Comments
  • Things & Thinks-Issue LXV

    📚Research Digest ➡️HealthBench What it is about This paper by OpenAI researchers introduces HealthBench, an…

    3 Comments

Others also viewed

Explore content categories