Cancer Care

What Netflix can teach us about treating cancer

Cancer researchers dream of offering personalised treatments to patients. Can they get there using the same math that drives Netflix recommendations?

Two years ago, former President Barack Obama announced the Precision Medicine initiative in his State of the Union Address. The initiative aspired to a “new era of medicine” where disease treatments could be specifically tailored to each patient’s genetic code.

This resonated soundly in cancer medicine. Patients can already manage their cancer with therapies that target the specific genes that are altered in their particular tumor. For example, women with a type of breast cancer caused by the amplification of gene HER2 are often treated with a therapeutic called herceptin. Because these targeted therapeutics are specific to cancer cells, they tend to have fewer side effects than traditional cancer treatments with chemotherapy or radiation.

However, such treatments are not available for most cancer patients. In many cancers, the specific genetic alterations that are responsible for a cancer remain unknown. To create individualized cancer treatments, we must know more about the functional genetic alterations.

With data on cancer genetics growing rapidly, mathematics and statistics can now help unlock the hidden patterns in this data to find the genes that are responsible for an individual’s cancer. With this knowledge, physicians can select appropriate treatments that block the action of these genes to personalize therapies for individual patients. My research aims to improve precision medicine in cancer – by building on the same methods that have been used to find patterns in Netflix movie ratings.

Sifting through the data

Today, there is unprecedented public access to cancer genetics data. These data come from generous patients who donate their tumor samples for research. Scientists then apply sequencing technologies to measure the mutations and activity in each of the 20,000 genes in the human genome.

All these data are a direct result of the Human Genome Project in 2003. That project determined the sequence for all the genes that make up healthy human DNA. Since the completion of that project, the cost of sequencing the human genome has more than halved every year, surpassing the growth of computing power described in Moore’s Law. This cost reduction enables researches to collect unprecedented genetics data from cancer patients.

Most scientific studies on cancer genetics performed worldwide release their data to a centralised, public database provided by the US National Institutes of Health National Library of Medicine. The NIH National Cancer Institute and National Human Genome Research Institute have also freely released genetic data from over 11,000 tumors in 33 cancer types through a project called The Cancer Genome Atlas.

Every biological function – from extracting energy from food to healing a wound – results from activity in different combinations of genes. Cancers hijack the genes that enable people to grow to adulthood and that protect the body from the immune system. Researchers dub these the “hallmarks of cancer.” This so-called gene dysregulation enables a tumor to grow uncontrollably and form metastases in distant organs from the original tumor site.

Researchers are actively using these public data to find the set of gene alterations that are responsible for each tumor type. But this problem is not as simple is identifying a single dysregulated gene in each tumor. Hundreds, if not thousands, of the 20,000 genes in the human genome are dysregulated in cancer. The group of dysregulated genes varies in each patient’s tumor, with smaller sets of commonly reused genes enabling each cancer hallmark.

Precision medicine relies on finding the smaller groups of dysregulated genes that are responsible for biological function in each patient’s tumor. But, genes may have multiple biological functions in different contexts. Therefore, researchers must uncover a set of “overlapping” genes that have common functions in a set of cancer patients.

Linking gene status to function requires complex mathematics and immense computing power. This knowledge is essential to predict of outcome to therapies that would block the function of these genes. So, how can we uncover those overlapping features to predict individual outcomes for patients?

What Netflix can teach us

Fortunately for us, this problem has already been solved in computer science. The answer is a class of techniques called “matrix factorization” – and you’ve likely already interacted with these techniques in your everyday life.

In 2009, Netflix held a challenge to personalise movie ratings for each Netflix user. On Netflix, each user has a distinct set of ratings of different movies. While two users may have similar tastes in movies, they may vary wildly in specific genres. Therefore, you cannot rely on comparing ratings from similar users.

Instead, a matrix factorization algorithm finds movies with similar ratings among a smaller group of users. The group of users will vary for each movie. The computer associates each user with a group of movies to a different extent, based upon their individual tastes. The relationships among users are referred to as “patterns.” These patterns are learned from the data, and may find common rankings unforeseen by movie genre alone – for example, users may share a preference for a particular director or actor.

Genevieve Stein-O'Brien, CC BY
Genevieve Stein-O'Brien, CC BY

The same process can work in cancer. In this case, the measurements of gene dysregulation are analogous to movie ratings, movie genres to biological function and users to patients’ tumors. The computer searches across patient tumors to find patterns in gene dysregulation that cause the malignant biological function in each tumor.

From movies to tumors

The analogy between movie ratings and cancer genetics breaks down in the details. Unless they are minors, Netflix users are not constrained in the movies they watch. But, our bodies instead prefer to minimise the number of genes used for any single function. There are also substantial redundancies between genes. To protect a cell, one gene may easily substitute for another to serve a common function. Gene functions in cancer are even more complex. Tumors are also highly complex and rapidly evolving, depending upon random interactions between the cancer cells and the adjacent healthy organ.

To account for these complexities, we have developed a matrix factorization approach called Coordinated Gene Activity in Pattern Sets – or CoGAPS for short. Our algorithm accounts for biology’s minimalism by incorporating as few genes as possible into the patterns for each tumor.

Different genes can also substitute for one another, each serving a similar function in a different context. To account for this, CoGAPS simultaneously estimates a statistic for the so-called “patterns” of gene function. This allows us to compute the probability of each gene being used in each biological function in a tumor.

For example, many patients take a targeted therapeutic called cetuximab to prolong survival in colorectal, pancreatic, lung and oral cancers. Our recent work found that these patterns can distinguish gene function in cancer cells that respond to the targeted therapeutic agent cetuximab from those that do not.

The future

Unfortunately, cancer therapies that target genes usually cannot cure a patient’s disease. They can only delay progression for a few years. Most patients then relapse, with tumors that are no longer responsive to the treatment.

Our own recent work found that the patterns that distinguish gene function in cells that are responsive to cetuximab include the very genes that give rise to resistance. Emerging immunotherapies are promising and appear to cure some cancers. Yet, far too often, patients with these treatments also relapse. New data that track the cancer genetics after treatment is essential to determine why patients no longer respond.

Along with these data, cancer biology also requires a new generation of scientists who can bridge mathematics and statistics to determine the genetic changes occurring over time in drug resistance. In other fields of mathematics, computer programs are able to forecast long-term outcomes. These models are used commonly in weather prediction and investment strategies.

In these fields and my own previous research, we have found that updates to the models from large datasets – such as satellite data in the case of weather – improve long-term forecasts. We have all seen the effect of these updates, with weather predictions improving the closer that we are to a storm.

Just as tools from computer science used can be adapted to both movie recommendations and cancer, the future generation of computational scientists will adopt prediction tools from an array of fields for precision medicine. Ultimately, with these computational tools, we hope to predict tumors’ response to therapy as commonly as we predict the weather, and perhaps more reliably.

Elana Fertig, Assistant Professor of Oncology Biostatistics and Bioinformatics, Johns Hopkins University.

This article first appeared on The Conversation.

We welcome your comments at
__('Sponsored Content') BY 

Harvard Business School’s HBX brings the future of business education to India with online programs

HBX is not only offering courses online, but also connecting students to the power of its network.

The classic design of the physical Harvard Business School (HBS) classroom was once a big innovation – precisely designed teaching amphitheaters laid out for every student to participate from his or her seat with a “pit” in the center of the room from which professors orchestrate discussions analyzing business cases like a symphony lead. When it came to designing the online experience of HBX—the school’s digital learning initiative—HBS faculty worked tirelessly to blend these tenets of the HBS classroom pedagogy with the power of new technology. With real-world problem solving, active learning, and social learning as its foundation, HBX offers immersive and challenging self-paced learning experiences through its interactive online learning platform.

Reimagining digital education, breaking the virtual learning mold

Typically, online courses follow a one-way broadcast mode – lectures are video recorded and reading material is shared – and students learn alone and are individually tested. Moving away from the passive learning model, HBX has developed an online platform that leverages the HBS ‘case-based pedagogy’ and audio-visual and interaction tools to make learning engaging.

HBX courses are rarely taught through theory. Instead, students learn through real-world problem-solving. Students start by grappling with a business problem – with real world data and the complexity in which a business leader would have to make a decision – and learn the theory inductively. Thus even as mathematical theories are applied to business situations, students come away with a greater sense of clarity and perspective, whether it is reading a financial report, understanding why a brand’s approach to a random sample population study may or may not work, or how pricing works.

HBX Platform | Courses offered in the HBX CORe program
HBX Platform | Courses offered in the HBX CORe program

“Learning about concepts through real-life cases was my favorite part of the program. The cases really helped transform abstract concepts into observable situations one could learn from. Furthermore, it really helped me understand how to identify situations in which I could use the tools that HBX equipped me with,” says Anindita Ravikumar, a past HBX participant. India’s premier B-school IIM-Ahmedabad has borrowed the very same pedagogy from Harvard. Learning in this manner is far more engaging, relatable, and memorable.

Most lessons start with a short 2-3 minute video of a manager talking about the business problem at hand. Students are then asked to respond on how they would handle the issue. Questions can be in the form of either a poll or reflections. Everyone’s answers are then visible to the ‘classroom’. In the words of Professor Bharat Anand, Faculty Chair, HBX, “This turns out to be a really important distinction. The answers are being updated in real-time. You can see the distribution of answers, but you can also see what any other individual has answered, which means that you’re not anonymous.” Students have real profiles and get to know their ‘classmates’ and learn from each other.

HBX Interface | Students can view profiles of other students in their cohort
HBX Interface | Students can view profiles of other students in their cohort

Professor Anand also says, “We have what we call the three-minute rule. Roughly every three minutes, you are doing something different on the platform. Everyone is on the edge of their seats. Anyone could be called on to participate at any time. It’s a very lean forward mode of learning”. Students get ‘cold-called’ – a concept borrowed from the classroom – where every now and then individuals will be unexpectedly prompted to answer a question on the platform and their response will be shared with other members of the cohort. It keeps students engaged and encourages preparedness. While HBX courses are self-paced, participants are encouraged to get through a certain amount of content each week, which helps keep the cohort together and enables the social elements of the learning experience.

More than digital learning

The HBS campus experience is valued by alumni not just for the academic experience but also for the diverse network of peers they meet. HBX programs similarly encourage student interactions and opportunities for in-person networking. All HBXers who successfully complete their programs and are awarded a credential or certificate from HBX and Harvard Business School are invited to the annual on-campus HBX ConneXt event to meet peers from around the world, hear from faculty and business executives, and also experience the HBS campus near Cambridge.

HBXers at ConneXt, with Prof. Bharat Anand
HBXers at ConneXt, with Prof. Bharat Anand

Programs offered today

HBX offers a range of programs that appeal to different audiences.

To help college students and recent graduates prepare for the business world, HBX CORe (Credential of Readiness) integrates business essentials such as analytics, economics, and financial accounting. HBX CORe is also great for those interested in an MBA looking to strengthen their application and brush up their skills to be prepared for day one. For working professionals, HBX CORe and additional courses like Disruptive Strategy, Leading with Finance, and Negotiation Mastery, can help deepen understanding of essential business concepts in order to add value to their organizations and advance their careers.

Course durations range from 6 to 17 weeks depending on the program. All interested candidates must submit a free, 10-15 minute application that is reviewed by the HBX admissions team by the deadlines noted on the HBX website.

For more information, please review the HBX website.

This article was produced by the Scroll marketing team on behalf of HBX and not by the Scroll editorial team.