Researchers develop AI system translating protein sequences into natural language-Xinhua

Researchers develop AI system translating protein sequences into natural language

Source: Xinhua

Editor: huaxia

2026-07-02 22:13:15

JERUSALEM, July 2 (Xinhua) -- Researchers have developed an AI system called "BetaDescribe" that translates complex protein sequences into readable natural-language descriptions, the Israel Institute of Technology said in a statement on Thursday.

This breakthrough, published in the U.S. scientific journal Proceedings of the National Academy of Sciences, is expected to significantly accelerate medical research and reduce costs in drug discovery and biotechnology, the researchers said.

Proteins are essential to biological functions and underpin medical advances such as the diabetes drug Ozempic.

However, despite the existence of billions of proteins in nature, scientists have identified the functions of only a small fraction, largely due to years of costly lab work.

"BetaDescribe" works like a specialized translator, turning raw biological data into clear, detailed descriptions of a protein's function, metabolic role, and medical potential, according to the researchers.

The AI's ability to rapidly generate evidence-based hypotheses about the functions of unknown proteins could significantly shorten the path from basic discovery to medical and industrial applications, the researchers concluded.