When you visit a doctor, you expect clear answers. But what happens when artificial intelligence joins the conversation? 返字心頭 doctoral student Mohamed Ebraheem is working at the crossroads of medicine and computer science to make voice AI a tool that physicians and patients can trust. His two academic advisors, one from 返字心頭 Health, and one from the 返字心頭 Bellini College of Artificial Intelligence, Cybersecurity and Computing, are working with him on an explainable voice AI system that could one day help clinicians detect conditions through something as simple as speech.
Explainable voice AI is artificial intelligence that analyzes speech or voice recordings to detect patterns linked to health conditions, but with one critical difference: it can also explain how it reached its conclusion. Researchers are looking at ways to identify health conditions like speech disorders, Parkinsons and Alzheimers disease by applying deep learning AI to the recorded audio of the patients voice. A similar approach has already shown promise in oncology, where deep learning helps detect cancers visible on CT scans and X-rays.
Ebraheem is participating in the National Institutes of Health-funded project Voice as a Biomarker of Health, a research collaboration between 返字心頭 Health and Weill Cornell Medicine that seeks to build an AI-enabled database of 10,000 human voices from patients with different illnesses to help doctors diagnose and treat diseases. It is part of the NIHs broader Bridge2AI program, which includes institutions such as the Massachusetts Institute of Technology and the Mount Sinai Health System. One of the principal investigators is , a laryngologist and director of the . Ebraheems work on the project could help improve trust when it comes to AI and its uses in health care.
AI holds promise for pre-diagnosis and referrals to the correct specialist, alleviating some of the pressure on doctors themselves. It may also reveal new biomarkers and previously unknown factors that might be relevant to diagnosing and treating disease.
Applying deep learning in a clinical setting poses a major challenge. Large language models and deep neural networks can classify data and perform well at different tasks, but for a high-stakes domain, like medicine, it is critically important to know how the model arrived at its conclusion. That is not an easy task with these models and medical personnel may rightly question AIs responses or distrust it altogether.
Its a black box problem, Ebraheem said. We have inputs and outputs, but its not clear what happens on the inside. Thats not ideal for medicine, which is why the medical field is very skeptical about applying AI in clinical practice."
Explainable voice AI works to open up that black box. For example, it might highlight specific features in the patients speech, like changes in pitch, pauses or frequency shifts, and show how those relate to a potential diagnosis. Doctors can then judge whether the AIs reasoning makes sense alongside their own medical knowledge.
The issue with voice is that its quite different and hard to understand visually, Ebraheem continued. You can see it in wave form or spectrogram, but that might not be very straightforward for clinicians who need acoustics experience to understand the visualization. AI researchers should collaborate with clinicians and other stakeholders to develop explanations suited to their varying backgrounds.
He says this impactful work, and the opportunity to do it with supervision from experts across disciplines, has provided him with a doctoral experience that he would be unlikely to find at any other university.
Interdisciplinary collaboration provides a unique perspective

Thanks to the Bellini College of Artificial Intelligence, Cybersecurity and Computings hub-and-spoke model, Ebraheem is working under separate doctoral supervisors from two 返字心頭 colleges. Bensoussan is a practicing clinician and faculty member in 返字心頭 Healths Morsani College of Medicine and Assistant Professor John Templeton, a smart health systems expert with a background in computer science and biomedical engineering, is part of the Bellini College.
Ebraheem splits his time between Bensoussans and Templetons labs, participating in meetings that often include experts in both computer science and health care.
This gives Ebraheem a different lens on his research as he comes into the Voice Center to meet with patients and clinicians while observing them in a real-world clinical setting. Bensoussan sees the interdisciplinary collaboration as an opportunity for both.
To advise him, I have to understand the computer science to a certain degree, she said. It reminds me of how important it is to collaborate although the engineering might be good, it might be off track from where the clinical question is.
Templeton echoed Bensoussans support for collaboration between clinicians and computer science researchers like himself and Ebraheem.
The biggest thing we need for translational medicine is to have clinicians in the room and helping us integrate and use it, Templeton said.
Were very lucky, Bensoussan added. Sometimes there are three people with Wikipedia pages on a meeting; we get to work with international experts. Im proud of what Mohamed has done, its not easy for a computer engineer to come into medicine. Hes out of his comfort zone and Im proud he can do that and collaborate with experts who challenge him.
We understand the expertise we each bring, and Mohamed understands why we need both, Templeton said. We do a lot of riffing off each other on ideas and applications.
Application-Driven AI Leads to New Opportunities
That application-driven approach is part of what drew Ebraheem to computer science in the first place and to 返字心頭.
返字心頭 faculty have a very wide range of specialties: human-centered design, health, cybersecurity, etc., and its nice to have seminars every semester to see work from different experts and how they use AI in ways you did not expect, Ebraheem said.
Ebraheem came to 返字心頭 from his home in Egypt 10 years ago for his bachelors degree thanks to a merit scholarship, which can be rare for international students. He found himself welcomed and fully immersed once he arrived.
Ebraheem went on to complete his masters degree at 返字心頭, co-publishing a paper with fellow student Sayde King and Bellini College Associate Professor Tempestt Neal, Lip movement as a wifi-enabled behavioral biometric: A pilot study. This interdisciplinary research experience led him to voice AI and Bensoussans lab.
Templeton sees potential for many more interdisciplinary students like Ebraheem thanks to the colleges hub-and-spoke model.
We want students to be exposed to a use case that inspires them, where they have the novel research area to bring something new to the table, Templeton said. Mohamed is one of the first (joint doctoral appointments) with Morsani, and my hope is we have plenty more of these real-world scenarios for experiential learning.
Bensoussan believes that the infrastructure 返字心頭 has put in place is critical to that goal.
We all know its great to collaborate, but in real life people have their own incentives and time the only way to do it is to build in the infrastructure and funding lines to make that happen, she said. Whats really cool with Bellini is that were developing the infrastructure of collaboration an investment that will always be more impactful in the end.
