Hackathon
The workshop hackathon will leverage data captured in EDRN’s LabCAS biomarker data commons—as well as other public data resources—for hands-on application. The hackathon will include both programming/hacking and ideation/brainstorming about combining diverse datasets for specific "big questions" that may have been considered out of reach just a few years ago. This expanded scope opens the hackathon to both bioinformatics experts and investigators of all levels who are interested in learning more about the application and use of various bioinformatics, AI/ML tools, or methods—all in support of cancer biomarker research.
Note: the hackathon is intended for in-person attendees; online attendees are free to try things out on their own.
Goals for the Biomarker Workshop
- Downloading and using the data
- Applying ML methodology to large cancer biomarker datasets
- Exploring combining datasets (either public or multi-modal)
- Investigating zero-shot models (models trained on one model to be used on another)
After the workshop, further exploration with the three larger collections of workshop data will continue.
Instructions for Attendees
- Review the hackathon data descriptions
- Install Aspera to download data
- Accessing Google Colab
- Signup to join a hacking team (in-person only)
Agenda
Time | Duration | Topic | Facilitator |
---|---|---|---|
8:35 AM |
0:15 |
Overview of Hackathon: Data collections, challenges, opportunities, tools and methodology |
Ashish Mahabal, Ph.D., Caltech |
8:50 AM |
0:15 |
Tutorial: Accessing and using data |
Heather Kincaid, Jet Propulsion Laboratory |
9:05 AM |
0:20 |
Tutorial: Overview of the hackathon data collections |
Erin Fowler, Moffitt Cancer Center |
9:25 AM |
0:20 |
Ashish Mahabal, Ph.D., Caltech |
|
9:45 AM |
0:05 |
Logistics for breakout groups |
Ashish Mahabal, Ph.D., Caltech |
9:50 AM |
0:45 |
Exercise/Discussion: Multi-Modal and Transfer Learning |
Facilitated teams |
10:35 AM |
0:15 |
Coffee break |
|
10:50 AM |
0:45 |
Exercise/discussion: Zero-shot |
Facilitated teams |
11:35 AM |
1:00 |
Next Steps and Future Plans for Hackathon: Report from Teams |
Ashish Mahabal and Dan Crichton |
12:35 PM |
~ |
Adjourn |
Dan Crichton, Jet Propulsion Laboratory, Caltech |
Data Collections
A number of large datasets for complex problems have been made available.
Tools and Resources
- GitHub repository
- LabCAS Overview
- Sign up for a LabCAS account
- Download data from LabCAS
- LabCAS Help Pages
- Facilitators for each class of problem
- Software tools, including Jupyter Notebooks
- AI and Biostatistics Glossary of Terms
Questions?
You can reach the EDRN AI workshop planning panel by email.