The Challenge
ProCogia were engaged by a multinational pharmaceutical company to design and build a solution that designs and evaluates sgRNA (single guide RNA) candidates for CRISPR/Cas9 (Clustered Regularly Interspaced Short Palindromic Repeats) based screens. The solution required integrating functionality from R and Python components in an easy-to-use R package.
Procogia’s Approach
- The solution was delivered by our Bioinformatics team. We used our expertise in Bioinformatics, genome-wide screening, R, Python, and machine learning. We evaluated existing approaches and advised on and evaluated methodology using publicly available and proprietary multi-omics data to ensure the results were scientifically sound.
- We developed an R package that handles user input and output while leveraging automated conda environment calls to seamlessly handle operations in Python modules. Calls to Python were implemented using reticulate and conda environments were managed using basilisk.
- We built a machine learning model based on previously published scientific literature to predict sgRNA efficiency and efficacy. The trained model utilizes genetics and epigenetics information obtained from multi-omics datasets to make these predictions.
- We utilized ground truth data to develop, train, and test the machine learning model for predictive scoring of sgRNA candidates.
The Results
- We delivered an R package that follows Bioconductor guidelines and best practices.
- The complete package, including trained model and Python components, were delivered in a portable, easy-to-use R package.
- Due to the novelty of the tool, we are working with the client to prepare the tool for scientific publication and Bioconductor submission to make it available to the wider community.
Services Used
Bioinformatics
We deliver scientific results that drive clinical and translational research decisions. Our Bioinformatics team has extensive experience designing, optimizing, executing and analyzing pre-clinical and clinical research projects using next-generation sequencing technologies.
Data Consultancy
We provide data consultancy to organizations to optimize your investment in people, processes, and technology. This is typically through data strategy engagements, roadmaps, transformations, and independent technology advice.
Data Science
We use open source technology to leverage the full potential of your data. Predictive and prescriptive results are actioned using AI and Machine Learning (ML).
Related Blogs
Let’s Connect
What can we help you with?
T: +1 425-624-7532
Alternatively, simply fill in this form and we’ll be in touch.