Harvard Medical School's Church listens in to how proteins talk; learning their language

Machine learning accelerates the design of synthetic proteins with desired functions, facilitating future therapeutic, diagnostic and biotechnology applications

Synthetic biologists have taken the evolution of proteins into their own hands by changing some that occur in nature or even by synthesizing them from scratch. Such engineered proteins are used as highly efficacious drugs, components of synthetic gene circuits that sense biological signals, or in the production of high-value chemicals in ways that are more effective and sustainable than petroleum-based methods.

To engineer them, they use two very different approaches. In "directed evolution", they randomly vary the linear sequence of amino acid building blocks encoding a natural protein and screen for variants with the desired activity; or they use "rational design" to model proteins based on their actual 3D structures to identify amino acids that likely will impact protein function. However, directed evolution can only cover a small part of the enormous space of possible protein sequences, while rational design approaches are limited by the relative scarcity of painstakingly resolved 3D protein structures. CAPTION Wyss Institute researchers have created an approach to engineer proteins that uses deep learning, and moves a lot of laborious laboratory experiments to the computer.  CREDIT Klinsley Stocum{module In-article}

Now, a research team led by George Church, Ph.D. at Harvard's Wyss Institute for Biologically Inspired Engineering and Harvard Medical School (HMS) has created a third approach to engineering proteins that uses deep learning to distill the fundamental features of proteins directly from their amino acid sequence without the need for additional information. The approach robustly predicts the functions of both natural and de novo designed proteins, and moves a lot of laborious laboratory experiments to the computer, achieving up to two orders of magnitude cost reduction compared to existing approaches. The study is published in Nature Methods.

Church is a Founding Core Faculty member of the Wyss Institute and Lead of its Synthetic Biology platform. He also is the Robert Winthrop Professor of Genetics at Harvard Medical School and Professor of Health Sciences and Technology at Harvard University and the Massachusetts Institute of Technology (MIT).

"Instead of extensively characterizing proteins to understand their design principles, we used a neural network to learn those rules in an unbiased way, by systematically looking for patterns in a vast trove of raw protein sequences in public databases," said Surojit Biswas, one of the three co-first authors on the study who is a graduate student in Church's group. "The neural network learned a lot of the rules that we as humans have previously come to know through many painstaking studies, and beyond that, it also discovered new features in proteins."

The neural network approach, which the researchers named "unified representation" (UniRep), can be likened to learning a language where the learner builds a semantic understanding of how complex sentences are constructed from strings of letters and words. In protein language, UniRep was trained to predict the next amino acid in a protein sequence starting from its first one by exploring all the possibilities in protein sequences contained in public databases. Importantly, while proceeding through the remainder of the protein, one amino acid at a time, UniRep makes and draws on an internal "summary" of the sequence it has seen so far in the protein, which the team calls its "hidden state", to take into account its sequence and structural features. Feeding that information, and results from many other proteins, back into its algorithm, UniRep gradually revises the way it constructs hidden states, which improves its predictive capabilities over time. In the language analogy, the learner will be able to predict the next word of a sentence he is reading with increasing likelihood, based on a constantly improving understanding of syntax and choice of words.

"We trained UniRep on about 24 million protein sequences for roughly 3 weeks to enable it to predict sequences and their relationship to features like protein stability, secondary structure, and accessibility of internal sequences to surrounding solvents within proteins it had never seen before," said Grigory Khimulya, who was a student at Harvard College and is also a co-first author along with Biswas and Ethan Alley. "UniRep accurately described these features in proteins from very different protein families whose structures had been well-characterized in previous studies, even in synthetic proteins that don't have a counterpart in nature."

The team took UniRep a step further and used it as a tool to predict how single amino acid substitutions impact the function of proteins. The neural network robustly quantified the effects of single amino acid mutations in eight different proteins with diverse biological functions including enzyme catalysis, DNA binding, molecular sensing. Also, using the Aequorea victoria green fluorescent protein (GFP) as a model, they tasked UniRep to analyze 64,800 variants of the protein, each carrying 1-12 mutations, which demonstrated that it could accurately anticipate how the distribution and relative burden of mutations changed the protein's brightness.

"Compared to other strategies, our data-driven approach reaches state-of-the-art or superior performance in predicting multiple properties of proteins at costs much lower than other methods," said Church. "This makes it a truly empowering tool for protein engineers in many areas."

"This new deep-learning-based computational approach to protein engineering has the potential to accelerate the design of synthetic proteins with functions tailored for any desired application, whether it before for therapeutics, diagnostics, biomanufacturing, biocatalysis, or any other application. It literally can change the way we carry out molecular design in the future," said Wyss Founding Director Donald Ingber, M.D., Ph.D., who is also the Judah Folkman Professor of Vascular Biology at HMS and the Vascular Biology Program at Boston Children's Hospital, as well as Professor of Bioengineering at Harvard's John A. Paulson School of Engineering and Applied Sciences.

Strathclyde quantum technology partnership secures £4.6 million

A University of Strathclyde-led quantum technology partnership, which is aiming to develop some of the world's most powerful computers, has secured funding worth a total of £4.6 million.

Researchers at Strathclyde and photonics and quantum technology company M Squared are collaborating to develop advanced computing technology, which could strengthen banking security, enhance traffic management and support aerospace security. The partners aim to work with fintech and aerospace businesses to deliver significant advancements in the UK's quantum capabilities.

The research has gained £2.6 million in Prosperity Partnership funding from the Engineering and Physical Sciences Research Council (EPSRC) as part of UK Research and Innovation, while M Squared has invested a further £1.7 million, and Strathclyde has contributed £300,000.

Future applications of the technology developed in this partnership include accelerated drug design for improved healthcare, novel materials for aerospace and manufacturing and the speed-up of optimization problems including scheduling and logistics for enhanced traffic management or improved efficiency in energy distribution across the National grid.{module In-article}

Prosperity Partnership grants are awarded to projects that build links between the UK's research base and leading industry partners. The project will complement the work of the UK-wide Quantum Technology Hubs, which explore the properties of quantum mechanics and how they can be harnessed for use in technology. Strathclyde is the only university that is a partner in all four of the hubs.

Strathclyde's lead in the Prosperity Partnership project is Dr. Jonathan Pritchard, a Research Fellow in the University's Department of Physics.

He said: "We are excited to have secured significant funding to develop this new experimental platform, which is currently not supported within the quantum technology hubs but which offers the potential to be truly disruptive in its ability to scale in future to extremely large numbers of qubits.

"This is a great opportunity, both for Strathclyde and for the UK as a whole, to establish new capability by working directly with global leaders in supplying commercial laser systems to quantum computing activities.

"This project will develop a new platform for quantum computing, based on scalable arrays of neutral atoms, and will work together with partners from industries including security and defense, aerospace and fintech to establish applications where this new quantum computer can deliver an advantage."

Dr. Graeme Malcolm, CEO, and founder at M Squared said: "This partnership with the University of Strathclyde is a demonstration of the scientific community harnessing the enormous benefits of collaboration between world-leading research institutions and advanced industry in the quest to develop frontier technologies.

"At M Squared we are thrilled to be at the heart of this groundbreaking work which the new funding will help to accelerate. Cross-sector investment is proving critical for progressing the UK's quantum computing capabilities, and we are extremely proud to be at the forefront.

"Glasgow has all the requisite components to pioneer the coming quantum era on the world stage and play a globally significant role in shaping the future of these defining years for the trajectory of this technology."

Business Secretary Andrea Leadsom said: "Cyber-attacks can have a particularly nasty impact on businesses, from costing them thousands of pounds in essential revenue to reputational harm.

"Cyber-criminals operate in the shadows, with the severity, scale, and complexity of breaches constantly evolving. It's critical that we are ahead of the game and developing new technologies and methods to confront future threats, supporting our businesses and giving them peace of mind to deliver their products and services safely.

"Investing in our world-leading researchers and businesses to develop better defense systems makes good business and security sense."

UK Research and Innovation Chief Executive, Professor Sir Mark Walport, said: "Our citizens and businesses must be able to access digitally secure products and services that are not vulnerable to cyber threats.

"The investments announced today will help to ensure the UK has a robust system in place to withstand cyber threats and create a safer future online, increasing trust and productivity in our economy."

EPSRC Executive Chair Professor Lynn Gladden said: "By combining expertise from across academia and industry, the Prosperity Partnerships will break new ground in areas of fundamental research that also provide major commercial opportunities.

"The partnerships announced today demonstrate the critical role that collaboration between UK researchers and industry partners will play in developing the revolutionary technologies of tomorrow."

Where more conventional computers operate on a binary system of zeroes and ones, quantum computers work on a system of qubits, which can be zeroes and ones simultaneously. As the number of qubits increases, the power of these computers rises dramatically, with each additional atom doubling the capacity of the computer. The new project is targeting 100 qubits, offering more computational power than even the largest available supercomputer.

Blanket of light may give better quantum supercomputers

Researchers from DTU Physics describe in an article in Science, how -- by simple means -- they have created a 'carpet' of thousands of quantum-mechanically entangled light pulses

Quantum mechanics is one of the most successful theories of natural science, and although its predictions are often counterintuitive, not a single experiment has been conducted to date of which the theory has not been able to give an adequate description.

Along with colleagues at bigQ (Center for Macroscopic Quantum States - a Danish National Research Foundation Center of Excellence), center leader Prof. Ulrik Lund Andersen is working on understanding and utilizing macroscopic quantum effects:

"The prevailing view among researchers is that quantum mechanics is a universally valid theory and therefore also applicable in the macroscopic day-to-day world we normally live in. This also means that it should be possible to observe quantum phenomena on a large scale, and this is precisely what we strive to do in the Danish National Research Foundation Center of Excellence bigQ," says Ulrik Lund Andersen. 

In a new article in the most prestigious international journal Science, the researchers describe how they have succeeded in creating entangled, squeezed light at room temperature. A discovery that could pave the way for less expensive and more powerful quantum supercomputers. CAPTION Artwork illustrating the cluster state generated in our work.  CREDIT Jonas S. Neergaard-Nielsen{module In-article}

Their work concerns one of the most notoriously difficult quantum phenomena to understand: entanglement. It describes how physical objects can be brought into a state in which they are so intricately linked that they can no longer be described individually.

If two objects are entangled, they must be seen as a unified whole regardless of how far from each other they are. They will still behave as one unit--and if the objects are measured individually, the results will be correlated to such a degree that it cannot be described based on the classical laws of nature. This is only possible using quantum mechanics.

Entanglement is not restricted to pairs of objects. In their efforts to observe quantum phenomena on a macroscopic scale, the researchers at bigQ managed to create a network of 30,000 entangled pulses of light arranged in a two-dimensional lattice distributed in space and time. It is almost like when a myriad of colored threads are woven together into a patterned blanket.

The researchers have produced light beams with special quantum mechanical properties (squeezed states) and woven them together using optical fiber components to form an extremely entangled quantum state with a two-dimensional lattice structure--also called a cluster state.

"As opposed to traditional cluster states, we make use of the temporal degree of freedom to obtain the two-dimensional entangled lattice of 30.000 light pulses. The experimental setup is surprisingly simple. Most of the effort was in developing the idea of the cluster state generation," says Mikkel Vilsbøll Larsen, the lead author of the work.

Creating such an extensive degree of quantum physical entanglement is--in itself--interesting basic research,

The cluster state is also a potential resource for creating an optical quantum computer. This approach is an interesting alternative to the more widespread superconducting technologies, as everything takes place at room temperature.

Also, the long coherence time of the laser light can be utilized--meaning that it is maintained as a precisely defined light wave even over very long distances.

An optical quantum computer will therefore not require costly and advanced refrigeration technology. At the same time, its information-carrying light-based qubits in the laser light will be much more durable than their ultra-cold electronic relatives used in superconductors.

"Through the distribution of the generated cluster state in space and time, an optical quantum computer can also more easily be scaled to contain hundreds of qubits. This makes it a potential candidate for the next generation of larger and more powerful quantum computers," adds Ulrik Lund Andersen.