Skip to content

Welcome to the Pupil Bio Data Analysis repository! This repository contains code and scripts used for the bioinformatics challenge related to identifying somatic mutations and performing quality control for cancer sample analysis.

Notifications You must be signed in to change notification settings

Harshith-Reddy-CK/pupil_bio_data_analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

6 Commits
Β 
Β 
Β 
Β 

Repository files navigation

Pupil Bio Data Analysis 🧬

Welcome to the Pupil Bio Data Analysis repository! This repository contains code and scripts used for the bioinformatics challenge related to identifying somatic mutations and performing quality control for cancer sample analysis.

⚑ Overview of Tasks

πŸ§‘β€πŸ”¬ Task 1: Coverage Analysis and Biomarker Identification

This task involves calculating coverage metrics and identifying biomarkers for tissue differentiation.

πŸ§ͺ Task 2: Quality Control, Alignment, and Mutation Calling

This task covers quality control checks, alignment to the human genome, and identifying somatic mutations using custom scripts.

πŸ–₯️ Scripts Folder

The scripts folder contains Python and R scripts that automate the following tasks:

  • πŸ§ͺ Quality control analysis using FastQC and other tools.
  • 🧬 Alignment of sequencing data using tools such as Bowtie2 and BWA.
  • πŸ”¬ Mutation calling and somatic mutation identification using custom Python scripts leveraging Samtools and bcftools.
  • πŸ“Š Plots and visualizations for quality control and biomarker identification.

The scripts are organized into separate folders for each task, allowing users to easily navigate through the pipeline.

πŸš€ Usage

To use the scripts, follow these steps:

  1. Clone the repository:
    git clone https://github.com/Harshith-Reddy-CK/pupil_bio_data_analysis.git
    cd pupil_bio_data_analysis
    
    
    
    

πŸ’Ύ Supplementary and Output Files The repository currently contains scripts for performing the tasks outlined above. Due to the size limitations of GitHub, large files, including supplementary data and output files, are not hosted directly on this repository.

To request the large files (e.g., BAM files, VCF files, output data), you can contact the repository owner or download them from the provided Dropbox or Google Drive links below:

πŸ“₯ Download Links Please reach out to me if you would like access to these files.

About

Welcome to the Pupil Bio Data Analysis repository! This repository contains code and scripts used for the bioinformatics challenge related to identifying somatic mutations and performing quality control for cancer sample analysis.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published