Article count:16428 Read by:87919360

Hottest Technical Articles
Exclusive: A senior executive of NetEase Games was taken away for investigation due to corruption
OPPO is going global, and moving forward
It is reported that Xiaohongshu is testing to directly direct traffic to personal WeChat; Luckin Coffee is reported to enter the US and hit Starbucks with $2, but the official declined to comment; It is reported that JD Pay will be connected to Taobao and Tmall丨E-commerce Morning News
Yu Kai of Horizon Robotics stands at the historical crossroads of China's intelligent driving
Lei Jun: Don't be superstitious about BBA, domestic brands are rising in an all-round way; Big V angrily criticized Porsche 4S store recall "sexy operation": brainless and illegal; Renault returns to China and is building a research and development team
A single sentence from an overseas blogger caused an overseas product to become scrapped instantly. This is a painful lesson. Amazon, Walmart, etc. began to implement a no-return and refund policy. A "civil war" broke out between Temu's semi-hosted and fully-hosted services.
Tmall 3C home appliances double 11 explosion: brands and platforms rush to
Shareholders reveal the inside story of Huayun Data fraud: thousands of official seals were forged, and more than 3 billion yuan was defrauded; Musk was exposed to want 14 mothers and children to live in a secret family estate; Yang Yuanqing said that Lenovo had difficulty recruiting employees when it went overseas in the early days
The app is coming! Robin Li will give a keynote speech on November 12, and the poster reveals a huge amount of information
It is said that Zhong Shanshan asked the packaged water department to sign a "military order" and the entire department would be dismissed if the performance did not meet the standard; Ren Zhengfei said that it is still impossible to say that Huawei has survived; Bilibili reported that employees manipulated the lottery丨Leifeng Morning News
Account Entry

Shocking the scientific community! DeepMind AI solves the "protein folding" problem and overcomes the great challenge of biology in the past 50 years

Latest update time:2020-12-01
    Reads:


Nature: It will change everything!

Author | Bei Shuang

AI has once again made a major breakthrough in the field of biological sciences!

On November 30th, US time, DeepMind, an artificial intelligence company under Google's parent company Alphabet, publicly announced that it has successfully solved protein folding prediction, a major problem in the biological community for 50 years.

The AI ​​system that solved this problem was AlphaFold, which shocked the scientific community when it was launched in 2018.

DeepMind stated in its official blog: The latest version of AlphaFold has been recognized by the authoritative protein structure prediction evaluation organization (Critical Assessment of protein Structure Prediction, CASP) in terms of accurately predicting protein folding structure through amino acid sequence.

As soon as the news came out, it immediately appeared on the cover of Nature magazine, with a direct commentary title: "It will change everything!".

At the same time, Google CEO and Chief Executive Officer Sundar Pichai, Stanford professor Fei-Fei Li, Musk and many other technology giants also retweeted their congratulations as soon as possible!

So what kind of research is this major breakthrough that shocked the technology, biology and scientific communities?

1


AlphaFold: Overcoming a 50-year biological problem

First of all, we need to understand why we need to predict protein folding structure?

As we all know, proteins are essential to life. Almost all diseases, including cancer and dementia, are related to the function of proteins. The function of proteins is determined by their 3D structure.

Christian Anfinsen, winner of the 1972 Nobel Prize in Chemistry, once proposed that the 3D structure of a protein can be calculated and predicted based on its 1D amino acid sequence.

But a practical challenge is that a protein's 3D structure can fold in billions of ways before it is formed.

American molecular biologist Cyrus Levinthal pointed out that if brute force is used to calculate all possible configurations of proteins, it may take longer than the time of the universe. A typical protein may have 10 300 possible configurations.

Therefore, from 1972 to the present, how to accurately predict how proteins fold has been a major challenge in the biological community.

However, the major challenge that has plagued the biological community for 50 years was successfully overcome by DeepMind yesterday. The company's latest AlphaFold system achieved an overall median score of 92.4GDT in the 14th CASP evaluation.

This means that the mean error (RMSD) of AlphaFold's predictions is only 1.6 angstroms (1 angstrom equals 0.1nm), equivalent to the width of an atom.

More importantly, even for the most challenging proteins - free modeling proteins, AlphaFold's median score reached 87.0 GDT

The prediction accuracy of the free modeling class in CASP continues to improve (GDT)

Two examples of free modeling-like protein targets

In this regard, Professor John Moult, President of CASP, said at a press conference,

DeepMind's AlphaFold system has achieved unparalleled accuracy in protein structure prediction. The grand challenges in computer science over the past 50 years have been largely solved.

It should be noted that CASP is the most authoritative organization in the world for evaluating protein structure prediction technology. It was founded by Professors John Moult and Krzysztof Fidelis in 1994 and conducts blind review every two years. Among them, GDT (Global Distance Test) is the main indicator used by CASP to measure prediction accuracy, and its range is from 0-100.

In simple terms, GDT can be roughly considered as the percentage of amino acid residues within a threshold distance to the correct position, and a GDT of around 90 can be considered competitive with the results obtained by experimental methods.

In this regard, Arthur D. Levinson, founder and CEO of CALICO, spoke highly of it:

AlphaFold is the culmination of a generation of products that predict protein structures with incredible speed and accuracy. This leap forward demonstrates how computational methods will transform biological research and hold great promise for accelerating the drug discovery process.

2


The AI ​​mechanism behind AlphaFold

A folded protein can be viewed as a "spatial graph" where residues are nodes and edges are densely connected together.

This diagram represents the neural network model architecture of the AlphaFold system. The model operates on both protein sequences and amino acid residues - iteratively passing information between the two representations to generate structure.

This process is important for understanding the physical interactions within proteins as well as their evolutionary history.

For the latest version of AlphaFold, the researchers created an attention-based neural network system that was trained end-to-end to try to interpret the structure of this graph while reasoning about the implicit graph it built. It refines this graph structure by using multiple sequence alignment (MSA) and representations of amino acid residue pairs.

By iterating this process, the system can make accurate predictions about the basic physical structure of proteins and is able to determine highly accurate structures within a few days. In addition, AlphaFold can also use internal confidence levels to predict which parts of each predicted protein structure are reliable.

The data used by the AlphaFold system comes from a large database of about 170,000 protein structures and protein sequences of unknown structures. When training, it uses about 128 TPU v3 cores (roughly equivalent to 100-200 GPUs) and runs for only a few weeks. This is a relatively small amount of computation in the context of most of the most advanced large models used in machine learning today.

3


Second-generation AlphaFold

DeepMind co-founder and CEO Demis Hassabis said: "DeepMind's ultimate vision has always been to build general AI to accelerate the pace of scientific discovery and help us better understand the world around us."

This time, the AlphaFold system has overcome a major problem that has been solved for 50 years, which means that DeepMind has taken another solid step towards this vision.

AlphaFold became a blockbuster when it was first launched in 2018. In the CASP competition, the "Olympic Games for Protein Structure Prediction" at the time, AlphaFold achieved the highest accuracy among all contestants, and was 8 times that of the second place.

After two years of hard work, DeepMind updated AlphaFold based on the new deep learning structure system, breaking its own record again - jumping from 60GDT to 92.4GDT.

Compared with other similar AIs, AlphaFold's accuracy is also far ahead.

The DeepMind development team said that AlphaFold can achieve unprecedented accuracy because its research method was inspired by the fields of biology, physics and machine learning, and the research results on protein folding over the past half century played an important role.

As an AI tool for the scientific community, AlphaFold's application scenarios and value have already become apparent.

As the epidemic continued to spread this year, DeepMind researchers used AlphaFold to predict several protein structures of the coronavirus SARS-CoV-2, including ORF3a, ORF8, etc.

Despite the challenging nature of this protein structure and the paucity of related sequences, AlphaFold achieved high accuracy in both predictions compared to the experimentally determined structures.

In addition to deepening our understanding of known diseases, AlphaFold's application potential will also extend to unknown areas of biology.

Since DNA specifies the sequence of amino acids that make up a protein structure, researchers reading protein sequences from nature on a large scale may have to count hundreds of millions of proteins in the Universal Protein Database (UniProt). What’s more, only about 170,000 of these proteins have 3D structures.

AI technologies like AlphaFold can help researchers discover yet-to-be-identified proteins.

Reference link:

  • https://deepmind.com/blog/article/alphafold-a-solution-to-a-50-year-old-grand-challenge-in-biology

  • https://www.cnbc.com/2020/11/30/deepmind-solves-protein-folding-grand-challenge-with-alphafold-ai.html


Previous recommendations




Latest articles about

Database "Suicide Squad" 
Exclusive: Yin Shiming takes over as President of Google Cloud China 
After more than 150 days in space, the US astronaut has become thin and has a cone-shaped face. NASA insists that she is safe and healthy; it is reported that the general manager of marketing of NetEase Games has resigned but has not lost contact; Yuanhang Automobile has reduced salaries and laid off employees, and delayed salary payments 
Exclusive: Google Cloud China's top executive Li Kongyuan may leave, former Microsoft executive Shen Bin is expected to take over 
Tiktok's daily transaction volume is growing very slowly, far behind Temu; Amazon employees exposed that they work overtime without compensation; Trump's tariff proposal may cause a surge in the prices of imported goods in the United States 
OpenAI's 7-year security veteran and Chinese executive officially announced his resignation and may return to China; Yan Shuicheng resigned as the president of Kunlun Wanwei Research Institute; ByteDance's self-developed video generation model is open for use丨AI Intelligence Bureau 
Seven Swordsmen 
A 39-year-old man died suddenly while working after working 41 hours of overtime in 8 days. The company involved: It is a labor dispatch company; NetEase Games executives were taken away for investigation due to corruption; ByteDance does not encourage employees to call each other "brother" or "sister" 
The competition pressure on Douyin products is getting bigger and bigger, and the original hot-selling routines are no longer effective; scalpers are frantically making money across borders, and Pop Mart has become the code for wealth; Chinese has become the highest-paid foreign language in Mexico丨Overseas Morning News 
ByteDance has launched internal testing of Doubao, officially entering the field of AI video generation; Trump's return may be beneficial to the development of AI; Taobao upgrades its AI product "Business Manager" to help Double Eleven丨AI Intelligence Bureau 

 
EEWorld WeChat Subscription

 
EEWorld WeChat Service Number

 
AutoDevelopers

About Us Customer Service Contact Information Datasheet Sitemap LatestNews

Room 1530, Zhongguancun MOOC Times Building,Block B, 18 Zhongguancun Street, Haidian District,Beijing, China Tel:(010)82350740 Postcode:100190

Copyright © 2005-2024 EEWORLD.com.cn, Inc. All rights reserved 京ICP证060456号 京ICP备10001474号-1 电信业务审批[2006]字第258号函 京公网安备 11010802033920号