Tags Posts tagged with "David McCandlish"

David McCandlish

David McCandlish, center, with postdoctoral researchers Anna Posfai and Juannan Zhou. Photo by Gina Motisi, 2020/ CSHL

By Daniel Dunaief

If cancer were simple, scientists would have solved the riddle and moved on to other challenges.

Often, each type of the disease involves a combination of changes that, taken together, not only lead to the progression of cancer, but also to the potential resistance to specific types of treatment.

Using math, David McCandlish, Assistant Professor at Cold Spring Harbor Laboratory, is studying how the combination of various disruptions to the genome contribute to the development of cancer.

McCandlish recently published a study with colleagues at Cold Spring Harbor Laboratory in the journal Proceedings of the National Academy of Sciences.

David McCandlish. Photo by Gina Motisi, 2020/CSHL

The research didn’t explore any single type of cancer, but, rather applied the method looking for patterns across a range of types of cancers. The notion of understanding the way these genetic alterations affect cancer is a “key motivating idea behind this work,” McCandlish said.

So far, the method has identified several candidates that need further work to confirm.

“Cancer would be a lot easier to treat if it was just one gene,” said Justin Kinney, Associate Professor at CSHL and a collaborator on the work. “It’s the combination that makes it so hard to understand.”

Ultimately, this kind of research could lead researchers and, eventually, health care professionals, to search for genetic biomarkers that indicate the likely effect of the cancer on the body. This disease playbook could help doctors anticipate and head off the next moves with various types of treatments.

“This could potentially lead to a more fundamental understanding of what makes cancer progress and that understanding would very likely open up new possibilities in cancer treatments,” Kinney said.

To be sure, at this point, the approach thus far informs basic research, which, in future years, could lead to clinical improvements.

“We are working on this method, which is very general and applicable to many different types of data,” McCandlish said. “Applications to making decisions about patients are really down the road.”

McCandlish described how he is trying to map out the space that cancer evolves in by understanding the shape of that space and integrating that with other information, such as drug susceptibility or survival time.

“We are trying to ask: how do these variables behave in different regions of this space of possibilities?” he said.

McCandlish is making this approach available to scientists in a range of fields, from those scientists interpreting and understanding the effects of mutations on the development of cancer to those researchers pursuing a more basic appreciation of how such changes affect the development and functioning of proteins.

“This is accessible to a wide array of biologists who are interested in genetics and, specifically in genetic interactions,” said McCandlish.

The main advance in this research is to take a framework called maximum entropy estimation  and improve its flexibility by using math to capture more of the underlying biological principals at work. Maximum entropy estimation is based on the idea of inferring the most uniform distribution of behaviors or outcomes with the least information that’s compatible with specific aspects of experimental observations.

Using this philosophy, scientists can derive familiar probability distributions like the bell curve and the exponential distribution. By relaxing these estimates, scientists can infer more complicated shapes.

This more subtle approach enhances the predictive value, which captures the distributions of data better, McCandlish explained. “We’re trying to capture and model cancer progression in a new and more expressive way that we hope will be able to tell us more about the underlying biology.”

The idea for this paper started when McCandlish, Kinney and  Jason Sheltzer, a former fellow at Cold Spring Harbor Laboratory and a current Assistant Professor of Surgery at Yale School of Medicine, discussed the possibilities after McCandlish attended a talk by Wei-Chia Chen, a post doctoral researcher in Kinney’s lab.

Chen will continue to pursue questions related to this effort when he starts a faculty position in the physics department at National Chung Cheng University in Taiwan this spring.

Chen will use artificial intelligence to handle higher dimensional data sets, which will allow him “to implement effective approximations” of the effect of specific combinations of genetic alterations, Kinney said.

Kinney believes teamwork made this new approach, which the high-impact, high-profile journal PNAS published, possible.

“This problem was an absolutely collaborative work that none of us individually could have done,” Kinney said. He described the work as having a “new exploratory impact” that provides a way of looking at the combination of genomic changes that “we haven’t had before.”

Working at Cold Spring Harbor Laboratory, which McCandlish has done since 2017, enables collaborations across different disciplines.

“We have this quantitative biology group, we also have people working on neuroscience, cancer, and plant biology,” McCandlish added.

McCandlish is also currently also working with Professor Zachary Lippman and his graduate student Lyndsey Aguirre to understand how multiple mutations interact to influence how the fruit on tomato plants develop.

“The idea is that there are these huge spaces of genetic possibilities where you can combine different mutations in different ways,” McCandlish explained. “We want to find those key places in that space where there’s a tipping point or a fork in the road. We want to be able to identify those places to follow up or to ask what’s special about this set of mutations that makes it such a critical decision point.”

A native of Highland Park, New Jersey, McCandlish was interested in math and science during his formative years. 

As for the work, McCandlish appreciates how it developed from the way these collative researchers interacted.

“This would never have happened if we weren’t going to each other’s talks,” he said.

Peter Koo. Photo by ©Gina Motisi, 2019/ CSHL

By Daniel Dunaief

We built a process that works, but we don’t know why. That’s what one of the newest additions to Cold Spring Harbor Laboratory hopes to find out.

Researchers have applied artificial intelligence in many areas in biology and health care. These systems are making useful predictions for the tasks they are trained to perform. Artificial intelligence, however, is mostly a hands-off process. After these systems receive training for a particular task, they learn patterns on their own that help them make predictions.

How these machines learn, however, has become as much of a black box as the human brains that created these learning programs in the first place. Deep learning is a way to build hierarchical representations of data, explained Peter Koo, an assistant professor at the Simons Center for Quantitative Biology at CSHL, who studies the way each layer transforms data and the next layer builds upon this in a hierarchical manner.

Koo, who earned his doctorate at Yale University and performed his postdoctoral research at Harvard University, would like to understand exactly what the machines we created are learning and how they are coming up with their conclusions.

“We don’t understand why [these artificial intelligence programs] are making their predictions,” Koo said. “My postdoctoral research and future research will continue this line of work.”

Koo is not only interested in applying deep learning to biological problems to do better, but he’s also hoping to extract out what knowledge these machines learn from the data sets to understand why they are performing better than some of the traditional methods.

“How do we guide black box models to learn biologically meaningful” information? he asked. “If you have a data set and you have a predictive model that predicts the data well, you assume it must have learned something biologically meaningful,” he suggested. “It turns out, that’s not always the case.”

Deep learning can pick up other trends or links in the data that might not be biologically meaningful. In a simplistic example, an artificial intelligence weather system that tracked rain patterns during the spring might conclude, after seven rainy Tuesdays, that it rains on Tuesdays, even if the day of the week and the rain don’t have a causative link.

“If the model is trained with limited data that is not representative, it can easily learn patterns that are correlative in the training data,” Koo said. He tries to combat this in practice by holding out some data, which is called validating data. Scientists use it to evaluate how well the model generalizes to new data.

Koo plans to collaborate with numerous biologists at Cold Spring Harbor Laboratory, as well as other quantitative biologists, like assistant professors Justin Kenney and David McCandlish.

In an email, Kenney explained that the Simons Center is “very interested in moving into this area, which is starting to have a major impact on biology just as it has in the technology industry.”

The quantitative team is interested in high-throughput data sets that link sequence to function, which includes assays for protein binding, gene expression, protein function and a host of others. Koo plans to take a “top down” approach to interpret what the models have learned. The benefit of this perspective is that it doesn’t set any biases in the models.

Deep learning, Koo suggested, is a rebranding of artificial neural networks. Researchers create a network of simple computational units and collectively they become a powerful tool to approximate functions.

A physicist by training, Koo taught himself his expertise in deep learning, Kenney wrote in an email. “He thinks far more deeply about problems than I suspect most researchers in this area do,” he  wrote. Kenney is moving in this area himself as well, because he sees a close connection between the problem of how artificial intelligence algorithms learn to do things and how biological systems mechanistically work.

While plenty of researchers are engaged in the field of artificial intelligence, interpretable deep learning, which is where Koo has decided to make his mark, is a considerably smaller field.

“People don’t trust it yet,” Koo said. “They are black box models and people don’t understand the inner workings of them.” These systems learn some way to relate input function to output predictions, but scientists don’t know what function they have learned.

Koo chose to come to Cold Spring Harbor Laboratory in part because he was impressed with the questions and discussions during the interview process.

Koo, daughter Evie (left) and daughter Yeonu (right) during Halloween last year. Photo by Soohyun Cho

He started his research career in experimental physics. As an undergraduate, he worked in a condensed matter lab of John Clarke at the University of California at Berkeley. He transitioned to genomics, in part because he saw a huge revolution in next-generation sequencing. He hopes to leverage what he has learned to make an impact toward precision medicine. 

Biological researchers were sequencing all kinds of cancers and were trying to make an impact toward precision medicine. “To me, that’s a big draw,” Koo said, “to make contributions here.”

A resident of Jericho, Koo lives with his wife, Soohyun Cho, and their 6-year-old daughter Evie and their 4-year old-daughter Yeonu.

Born and raised in the Los Angeles area, he joined the Army Reserves after high school, attended community college and then transferred to UC Berkeley to get his bachelor’s degree in physics.

As for his decision to join Cold Spring Harbor Laboratory, Koo said he is excited with the opportunity to combine his approach to his work with the depth of research in other areas. 

“Cold Spring Harbor Laboratory is one of those amazing places for biological research,” Koo said. “What brought me here is the quantitative biology program. It’s a pretty new program” that has “incredibly deep thinkers.”