MIT biologists glean insight into repetitive protein sequences

Super computational analysis reveals that many repetitive sequences are shared across proteins and are similar in species from bacteria to humans.

About 70 percent of all human proteins include at least one sequence consisting of a single amino acid repeated many times, with a few other amino acids sprinkled in. These “low-complexity regions” are also found in most other organisms. MIT researchers used a technique called dot-plot matrix, which is a way to visually represent amino acid sequences, to compare protein sequences known as “low-complexity regions” across many different species. Credits:Image: Courtesy of the researchers, and edited by MIT News.

The proteins that contain these sequences have many different functions, but MIT biologists have now come up with a way to identify and study them as a unified group. Their technique allows them to analyze similarities and differences between LCRs from different species and helps them to determine the functions of these sequences and the proteins in which they are found.

Using their technique, the researchers analyzed all of the proteins found in eight different species, from bacteria to humans. They found that while LCRs can vary between proteins and species, they often share a similar role — helping the protein in which they’re found to join a larger-scale assembly such as the nucleolus, an organelle found in nearly all human cells.

“Instead of looking at specific LCRs and their functions, which might seem separate because they’re involved in different processes, our broader approach allows us to see similarities between their properties, suggesting that maybe the functions of LCRs aren’t so disparate after all,” says Byron Lee, an MIT graduate student.

The researchers also found some differences between LCRs of different species and showed that these species-specific LCR sequences correspond to species-specific functions, such as forming plant cell walls.

Lee and graduate student Nima Jaberi-Lashkari are the lead authors of the study, which appears today in eLife. Eliezer Calo, an assistant professor of biology at MIT, is the senior author of the paper.

Large-scale study

Previous research has revealed that LCRs are involved in a variety of cellular processes, including cell adhesion and DNA binding. These LCRs are often rich in single amino acids such as alanine, lysine, or glutamic acid.

Finding these sequences and then studying their functions individually is a time-consuming process, so the MIT team decided to use bioinformatics — an approach that uses computational methods to analyze large sets of biological data — to evaluate them as a larger group.

“What we wanted to do is take a step back and instead of looking at individual LCRs, try to take a look at all of them and to see if we could observe some patterns on a larger scale that might help us figure out what the ones that have assigned functions are doing, and also help us learn a bit about what the ones that don't have assigned functions are doing,” Jaberi-Lashkari says.

To do that, the researchers used a technique called dot plot matrix, which is a way to visually represent amino acid sequences, to generate images of each protein under study. They then used computational image processing methods to compare thousands of these matrices at the same time.

Using this technique, the researchers were able to categorize LCRs based on which amino acids were most frequently repeated in the LCR. They also grouped LCR-containing proteins by the number of copies of each LCR type found in the protein. Analyzing these traits helped the researchers to learn more about the functions of these LCRs.

As one demonstration, the researchers picked out a human protein, known as RPA43, that has three lysine-rich LCRs. This protein is one of many subunits that make up an enzyme called RNA polymerase 1, which synthesizes ribosomal RNA. The researchers found that the copy number of lysine-rich LCRs is important for helping the protein integrate into the nucleolus, the organelle responsible for synthesizing ribosomes.

Biological assemblies

In a comparison of the proteins found in eight different species, the researchers found that some LCR types are highly conserved between species, meaning that the sequences have changed very little over evolutionary timescales. These sequences tend to be found in proteins and cell structures that are also highly conserved, such as the nucleolus.

“These sequences seem to be important for the assembly of certain parts of the nucleolus,” Lee says. “Some of the principles that are known to be important for higher-order assembly seem to be at play because the copy number, which might control how many interactions a protein can make, is important for the protein to integrate into that compartment.”

The researchers also found differences between LCRs seen in two different types of proteins that are involved in nucleolus assembly. They discovered that a nucleolar protein known as TCOF contains many glutamine-rich LCRs that can help scaffold the formation of assemblies, while nucleolar proteins with only a few of these glutamic acid-rich LCRs could be recruited as clients (proteins that interact with the scaffold).

Another structure that appears to have many conserved LCRs is the nuclear speckle, which is found inside the cell nucleus. The researchers also found many similarities between LCRs that are involved in forming larger-scale assemblies such as the extracellular matrix, a network of molecules that provides structural support to cells in plants and animals.

The research team also found examples of structures with LCRs that seem to have diverged between species. For example, plants have distinctive LCR sequences in the proteins that they use to scaffold their cell walls, and these LCRs are not seen in other types of organisms.

The researchers now plan to expand their LCR analysis to additional species.

“There’s so much to explore because we can expand this map to essentially any species,” Lee says. “That gives us the opportunity and the framework to identify new biological assemblies.”

The research was funded by the National Institute of General Medical Sciences, National Cancer Institute, the Ludwig Center at MIT, a National Institutes of Health Pre-Doctoral Training Grant, and the Pew Charitable Trusts.

FAU lands NASA grant to inspire local high school students

Researchers from Florida Atlantic University’s College of Engineering and Computer Science have received a three-year $478,699 grant from the National Aeronautics and Space Administration (NASA) for a project titled, “MAA Experiential Learning Opportunities for South Florida Underserved High School Students.”

FAU was one of only eight institutions in the nation to be awarded NASA’s Minority University Research and Education (MUREP) award for the MUREP Aerospace Academy (MAA). Through cooperative agreement awards, MAA funding allows minority-serving institutions to develop exciting new avenues to inspire local high school students in the STEM (science-technology-engineering-mathematics) fields.

Administered through NASA’s Office of STEM Engagement, MUREP supports the training and development of students and faculty at minority-serving institutions in STEM fields. MUREP awards are established to strengthen and develop these institutions’ research capacity and infrastructure in areas of strategic importance and value to NASA’s mission and national priorities.

With this NASA grant, FAU is partnering with Florida Memorial University to offer students from Boyd Anderson High School in Broward County and Lake Worth Community High School in Palm Beach County, which serves predominantly black and Hispanic student populations, experiential learning opportunities, field trips, study tours, engagement with STEM professionals, and near-peer mentoring. NASA themes will be explored, including earth observations and aeronautics.

“We are incredibly excited that Florida Atlantic University was selected by NASA as one of only eight recipients in the United States to receive their Minority University Research and Education award,” said Stella Batalama, Ph.D., dean, FAU College of Engineering and Computer Science. “We remain committed to our mission to produce top-notch students and graduates in the STEM fields. This NASA grant will enable professors Arockiasamy and Nagarajan to provide exceptional training and research opportunities for underrepresented high school students in collaboration with outstanding partner institutions.”

The project is spearheaded by Madasamy Arockiasamy, Ph.D., principal investigator, professor, and director, Center for Infrastructure and Constructed Facilities, in FAU’s Department of Civil, Environmental, and Geomatics Engineering. His research expertise includes non-contact infrastructure condition monitoring, ocean, wind, and wave energy utilization, offshore/coastal structures, advanced high-strength composites, and sustainability and climate change impact on infrastructure.

“Data show that less than 3 percent of minority students nationwide have degrees in engineering or are employed in this field,” said Arockiasamy. “We are very grateful for receiving this NASA grant, which will help aspiring engineers to explore space and aeronautics, and importantly, to help us increase the number of America’s minority students entering into STEM-related programs and professions.”

Sudhagar Nagarajan, Ph.D., associate chair and associate professor, FAU’s Department of Civil, Environmental and Geomatics Engineering, is serving as co-principal investigator of the project. His research expertise includes non-contact infrastructure modeling, terrestrial, aerial, and space-borne laser scanning, multi-sensor-based registration and change detection, and drones for mapping. 

“Experiential learning is an essential part of high school and higher education at all levels. Internships, field experiences, and cooperative learning all help to prepare students for a successful transition into the workforce,” said Nagarajan.

Arockiasamy and Nagarajan plan to partner with various organizations throughout Florida for the MAA Experiential Learning Opportunities for South Florida Underserved High Students, including the South Florida Water Management District, Lockheed Martin, Aerojet Rocketdyne, NASA Centers – Kennedy Space Center, Glenn Research Center, Ames Research Center, Florida Space Grant Consortium, and the Gangals Nonprofit Organization.

Q&A: Why are dark matter halos of ultra-diffuse galaxies so … odd?

A UC Riverside physicist explains 

study co-led by physicists at UC Riverside and UC Irvine has found that dark matter halos of ultra-diffuse galaxies are very odd, raising questions about physicists’ understanding of galaxy formation and the structure of the universe. Hai-Bo Yu is a theoretical physicist at UC Riverside. (Samantha Tieu)

Ultra-diffuse galaxies are so-called because of their extremely low luminosity. The distribution of baryons — gas and stars — is much more spread out in ultra-diffuse galaxies compared to “normal” galaxies with similar masses. 

In the following Q&A, Hai-Bo Yu, an associate professor of physics and astronomy at UCR, shares his thoughts on the findings he and UCI’s Manoj Kaplinghat, a long-term collaborator of Yu’s, have published in The Astrophysical Journal about newly discovered ultra-diffuse galaxies and their dark matter halos.

Yu and Kaplinghat were joined in the research by Demao Kong of Tufts University, and Filippo Fraternali and Pavel E. Mancera Piña of the University of Groningen in the Netherlands. First author Kong will join UCR this fall.

The research was supported by grants from the National Science Foundation, Department of Energy, John Templeton Foundation, National Aeronautics and Space Administration, Netherlands Research School for Astronomy, and ASTRON, the Netherlands Institute for Radio Astronomy.

Q. What is a dark matter halo?

A dark matter halo is the halo of invisible matter that permeates and surrounds a galaxy or a cluster of galaxies. Although dark matter has never been detected in laboratories, physicists are confident dark matter, which makes up 85% of the universe’s matter, exists. 

Q. You’ve found dark matter halos of the ultra-diffuse galaxies are very odd. What is odd about them and what are you comparing them to?

The ultra-diffuse galaxies we studied are much less massive compared to, say, the Milky Way. They contain a lot of gas, however, and they have much higher gas mass than total stellar mass, which is opposite to what we see in the Milky Way. The ultra-diffuse galaxies also have large sizes.

The distribution of dark matter in these galaxies can be inferred from the motion of gas particles. What really surprises us is that the presence of baryonic matter itself, predominantly in the form of gas, is nearly sufficient to explain the measured velocity of gas particles and leaves little room for dark matter in the inner regions, where most of the stars and gas are located. 

This is very surprising because, in the case of normal galaxies, whose masses are similar to those of ultra-diffuse galaxies, it’s the opposite: dark matter dominates over baryonic matter. To accommodate this result, we conclude that these dark matter halos must have much lower “concentrations.” That is, they contain much less mass in their inner regions, compared to those of normal galaxies. In this sense, dark matter halos of the ultra-diffuse galaxies are “odd.”

At first glance, one would expect that such low-concentration halos are so rare that the ultra-diffuse galaxies would not even exist. After looking into the data from state-of-the-art numerical simulations of cosmic structure formation, however, we found the population of low-concentration halos is higher than the expectation.  

Q. What was involved in doing the study?

This is collaborative work. Filippo Fraternali and his student Pavel E. Mancera Piña are experts on the gas dynamics of galaxies. They discovered that ultra-diffuse galaxies rotate more slowly than normal galaxies with similar masses. We worked together to interpret measurement data of the gas motion of these galaxies and infer their dark matter distribution. Furthermore, we analyzed data from simulations of cosmic structure formation and identified dark matter halos that have similar properties as those inferred from the ultra-diffuse galaxies.

Q. Your findings raise questions about our understanding of galaxy formation/structure formation of the universe. How?

We have many questions regarding the formation and evolution of these newly discovered galaxies. For example, ultra-diffuse galaxies contain a lot of gas and we do not know how this gas is retained during galaxy formation. Further, our results indicate that these galaxies may be younger than normal galaxies. The formation of the ultra-diffuse galaxies is not well understood, and more work is needed. 

Q. What makes ultra-diffuse galaxies so interesting?

These are amazing objects to study because of their surprising properties, as discussed in our work. The newly discovered ultra-diffuse galaxies provide a new window for further testing our understanding of galaxy formation, probably even the nature of dark matter.