Groundbreakers: 91³Ō¹Ļās Data Sciences Institute to help researchers find answers to their biggest questions
When 91³Ō¹Ļ astronomer Bryan Gaensler looks up at the night sky, he doesnāt just see stars ā he sees data. Big data.
So big, in fact, that his current research tracking the baffling āfast radio burstsā (FRBs) that bombard Earth from across the universe requires the capture of more data per second than all of Canadaās internet traffic.
āThis is probably the most exciting thing in astronomy right now, and itās a complete mystery,ā says Gaensler, director of 91³Ō¹Ļās Dunlap Institute for Astronomy & Astrophysics and Canada Research Chair in Radio Astronomy. āRandomly, maybe once a minute, thereās this incredibly bright flash of radio waves ā like a one-millisecond burst of static ā from random directions all over the sky.
āWe now know that theyāre from very large distances, up to billions of light-years, so they must be incredibly powerful to be able to be seen this far away.ā
91³Ō¹Ļ is a world leader in finding FRBs, using the multi-university CHIME radio telescope in British Columbiaās Okanagan region and a 91³Ō¹Ļ supercomputer. Yet, despite the impressive technology, many daunting challenges remain.
āItās a massive computational and processing problem that is holding us back,ā he says. āWe are recording more than the entire internet of Canada, every day, every second. And because thereās no hard drive big enough or fast enough to actually save that data, we end up throwing most of it away. We would obviously like to better handle the data, so that needs better equipment and better algorithms and just better ways of thinking about the data.ā
With the creation of 91³Ō¹Ļās (DSI), Gaensler and his colleagues now have a new place to turn to for help. The institute, , is designed to help the universityās wealth of academic experts in a variety of disciplines team up with statisticians, computer scientists, data engineers and other digital experts to create powerful research results that can solve a wide range of problems ā from shedding light on interstellar mysteries to finding life-saving genetic therapies.
āThe way forward is to bring together new teams of astronomers, computer scientists, artificial intelligence experts and statisticians who can come up with fresh approaches optimized to answer specific scientific questions that we currently donāt know how to address,ā Gaensler says.
The Data Sciences Institute is just one of nearly two dozen (ISI) launched by 91³Ō¹Ļ to address complex, real-world challenges that cut across fields of expertise. Each initiative brings together a flexible, multidisciplinary team of researchers, students and partners from industry, government and the community to take on a āgrand challenge.ā
āWeāre bringing together individuals at the intersection of traditional disciplinary fields and computational and data sciences,ā says Lisa Strug, director of the Data Sciences Institute and a professor in the departments of statistical sciences and computer science in the Faculty of Arts & Science, and a senior scientist at the Hospital for Sick Children research institute.
She notes that 91³Ō¹Ļ boasts world-leading experts in fields such as medicine, health, social sciences, astrophysics and the arts, and āsome of the top departments in the world in the cognate areas of data science like statistics, mathematics, computer science and engineering.ā
Data science techniques can be brought to bear on a near-infinite variety of academic questions ā from climate change to transportation, planning to art history. In literature, Strug says, many works from previous centuries are now being digitized, allowing data-based analysis right down to, say, sentence structure.
āNew fields of data science are emerging every day,ā says Strug, who oversees data-intensive genomics research in complex diseases such as cystic fibrosis that has led to the promise of new drugs to treat the debilitating lung disease. āWe have so much computational disciplinary strength we can leverage to define and advance these new fields.
āWe want to make sure that faculty have access to the cutting-edge tools and methodology that enable them to push the frontiers of their field forward. They may be answering questions they wouldnāt have been able to ask before, without that data and without those tools.ā
A key function of the DSI is the creation and funding of Collaborative Research Teams (CRTs) of professors and students from a variety of disciplines who can work together on important projects with stable support.
Gaensler, who already has statisticians on his team, says heās looking to the CRTs to greatly expand the scope of his work.
āWe have just done the low-hanging fruit,ā he says. āThere are many deeper problems that we havenāt even started on.ā
Similarly, Laura Rosella, an associate professor at the Dalla Lana School of Public Health, says the collaborative teams will be a major asset for the university.
āWeāre going to dedicate funding to these multi-disciplinary trainees and post-docs so we can start building a critical mass of people that can actually translate between these disciplines,ā she says. āTo solve problems, you need this connecting expertise.ā
Rosella played a key role in how Ontario dealt with COVID-19 in the early part of 2021. By analyzing anonymous cellphone data along with health information, she and her interdisciplinary team were able to see where people were moving and congregating, and then predict in advance likely clusters of the disease that would appear up to two weeks later. Her work helped support the provinceās highly successful strategy of targeting so-called āhotspots.ā
āWeāve been able to work with diverse data sources in order to generate insights that are used for
high-level pandemic preparedness and planning, in ways that werenāt possible before,ā says Rosella, who sits on Ontarioās COVID-19 . āAnd weāve also brought in new angles to the data around the social determinants of health that have shone a light on the policy measures that are needed to truly address disparities in COVID rates.ā
Rosellaās population risk tools also include one for diabetes, which health systems can use to estimate the future burden of the disease and guide future planning. This includes inputs about the built environment. For example, if people can walk to a new transit stop, Rosella says, the increased exercise may have an impact on diabetes or other diseases. Potentially, even satellite imaging data could be brought into the prediction mix, she says.
In addition to advancing research in a given field, the Data Sciences Institute is also seeking to advance equity.
That includes tackling societal inequalities uncovered by data research ā including how socio-economic factors can determine who is more likely to get COVID-19 ā and the way the research itself is being conducted.
For example, Strug says most genomics studies have focused on participants of European origin, even though the genetic risk factors for various diseases can differ between different ethnicities.
āWe must make sure we develop and implement the models, tools and research designs ā and bring diverse sources of data together ā to ensure our understanding of disease risk is applicable to all,ā Strug says.
Many algorithms, or the data they use to make predictions, contain unconscious bias that may skew results ā which is why Strug says transparency is vital both to support equity and to ensure studies can be reproduced properly.
Gaensler says itās critical to ensure diversity among researchers, too.
āMy department looks very different from the faces that I see on the subway,ā he says. āItās not a random sampling of Canadian society ā itās very male, white and old, and thatās a problem we need to work on.ā
Strug hopes the Data Sciences Institute will ultimately become a nucleus for researchers across the university ā and beyond.
āThereās never been one entrance to the university to guide people, so itās so important for us to be that front door,ā she says.
āWe will make every effort to stay abreast of the different fantastic things that are happening in data sciences and be able to direct people to the right place, as well as provide an inclusive, welcoming and inspiring academic home.ā