Clone of https://github.com/YtrebergPatelLab/EpistasisStats, I'm the original author.
This folder contains the scripts used for analysis. This includes:
- Loglikelihood ratio analysis: Generates simulated set of lambas by greatest likelihood
- Model building/selection: Performs model selection using data curated from the SKEMPIv2.0 and ProTherm4 datasets
- Validation: Performs a leave-10-percent-systems-out validation process on the aforementioned model selection procedure.
Contains the data used for input and generated by the analysis scripts
- loglikedata: Data generated by the loglikelihood ratio test, used to generate figure 3
- Mappings: Amino acid data to map the single letter code to attributes, like charge.
- Model_build_results: Resulting best model and delta r-squared data.
- processed: The processed datasets from SKEMPIv2.0 and ProTherm4
- raw: The raw datasets from SKEMPIv2.0 and ProTherm4
- validation_results: The results from the validation procedure
- validation_results_parsed: The split and parsed validation results
Contains figures present in the manuscript and the scripts used to generate them. Self explanatory, for more details see the included README.md