KRSA Analysis: Detailed Workflow Guide

Comprehensive Step-by-Step Procedure

Preparation Phase

1. Software Installation

Install R from R Project Website
- Verify installation: R --version
Install Git from Git Website
- Verify installation: git --version
Install Just task runner Recommended Methods:
- macOS: using homebrew
```
brew install just
```
- Linux: using snap or your distribution's package manager
- Windows: Use one of the recommended methods from the website
Install Quarto CLI from the Quarto Website
Install recommended IDE (VSCode, RStudio) Recommended options:

2. Repository Setup

Navigate to the template repository
Click "Use Template" to create your own repository

Clone your new repository

git clone https://github.com/your-username/your-repository.git
cd your-repository

Environment Configuration

3. R Environment Preparation

Note

This is the step that usually takes the most time. If in doubt, try running renv::hydrate and renv::update

Open R in the repository directory

Initialize and restore project dependencies

# Restore project-specific packages
renv::restore()

# Verify installation
renv::status()

Data Preparation

4. Data File Management

Locate your experimental data files:
- Each experiment has two data files. Each file has a specific naming scheme that can be used to identify its contents:
  - Signal Intensity File: This file contains the signal data after subtracting the background intensity. These can be identified by the presence of the string SigmBg in the filename
  - Signal Saturation File: This file contains the data about he times when the detected signal intensity was higher than the maximum detectable signal. These can be identified by the presence of the string SignalSaturation in the filename.
Place files in kinome_data/ directory
- The structure in the kinome_data/ directory is entirely up to personal preference. The convention is to separate by chips (STK and PTK) in different directories and then further use some kind of identifying prefix for the files that matches the analysis report prefix (see below).

Analysis Setup

5. Creating a New Analysis

Use Just to create a new analysis file

# Basic usage
just new-analysis my_experiment

# Specify chip type (PTK or STK)
just new-analysis my_experiment STK

# Customize prefix
just new-analysis my_experiment STK custom_prefix

6. Configuration

Open the newly created .Rmd file

Edit YAML frontmatter:

params:
   title: "Add Title Here" # Add the title of the report
   subtitle: "Customer Name Here" # Add the customer/collaborator name

Update file paths for input data

params:
   signal_file: "kinome_data/your/file/name/here"
   saturation_file: "kinome_data/your/file/name/here"

Review analysis parameters

params:
   threshold: 2 # This is cosmetic and is used to mark significant Z scores
   # This is used to identify and manage multiple experiments in the same repository
   prefix: "kinome" 
   pairwise: FALSE # Set to TRUE if you are comparing one well to another well without replicates

Run the analysis until the group comparison chunk.

In many cases, it is either hard or impossible to identify the group comparisons in advance. Similarly, unless you generated the data yourself, the naming scheme of the groups and sample names might not be what you expect.

Thus, it becomes important to find out this information and set up the correct comparisons in this report.

For this reason, we recommend running the report chunk by chunk until the Group Comparison chunk.

This should give you the variable groups which has the extracted group names. This should allow you to identify the groups and comparisons.
Fill out the comparisons variable with the required comparisons.

With the comparisons in hand, you can go ahead and edit the comparisons chunk to include the comparisons needed. It is important to note that the second group mentioned will be treated as a control group, against which the other group will be compared.
```
# Define Groups to be compared
comparisons <- list(
  COMP1 = c(groups[[1L]], groups[[4L]]),
  COMP2 = c(groups[[1L]], groups[[3L]]),
  COMP3 = c(groups[[2L]], groups[[4L]]),
  COMP4 = c(groups[[4L]], groups[[3L]])
)
```
These groups are defined in terms of the groups variable. This ensures that the group names are sourced correctly.
Copy the childA chunk as many times as there are comparisons.

The childA chunk is the main copy of what performs the group comparison analysis. You need to copy that as many times as needed. Once copied, each new copy would need two places changed. The label statement in the chunk options and the random variable in the chunk itself. These should be named sequential based on the alphabet as a suffix of child. Thus, the second chunk could be titled childB with the corresponding random variable being set to B as well. The following chunk would be childC and so on.

Execution

7. Run Complete Analysis

# Run entire workflow
just all

# Or run specific components
just render   # Generate reports
just uka      # Run Universal Kinase Analysis
just creeden  # Creedenzymatic analysis

It is important to run the steps in this order. The render step generates the files consumed by both uka and creeden steps, while the creeden step relies on the files generated by the uka step as well.

Failing to follow the order may result in cryptic errors.

Output Management

8. Interpreting Results

Examine rendered PDF reports
Check results/ for CSV files
Review figures/ for generated plots

Troubleshooting

9. Common Issues

Dependencies: renv can be unpredictable at times. In case of errors loading libraries, ensure that you've run renv::hydrate()
Incorrect Chip Type: If you see an error implying that there are no peptides matching, you might have set the incorrect chip_type in the frontmatter. Ensure that the chip_type parameter is set correctly.

10. Verification Steps

Confirm R and Just versions
Check renv.lock file
Verify data file integrity

Best Practices

Always use renv for dependency management
Commit your renv.lock file to version control
Keep input data files unchanged

Getting Help

Reach out to lab support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KRSA Analysis: Detailed Workflow Guide

Comprehensive Step-by-Step Procedure

Preparation Phase

1. Software Installation

2. Repository Setup

Environment Configuration

3. R Environment Preparation

Data Preparation

4. Data File Management

Analysis Setup

5. Creating a New Analysis

6. Configuration

Execution

7. Run Complete Analysis

Output Management

8. Interpreting Results

Troubleshooting

9. Common Issues

10. Verification Steps

Best Practices

Getting Help

FilesExpand file tree

HOWTO.md

Latest commit

History

HOWTO.md

File metadata and controls

KRSA Analysis: Detailed Workflow Guide

Comprehensive Step-by-Step Procedure

Preparation Phase

1. Software Installation

2. Repository Setup

Environment Configuration

3. R Environment Preparation

Data Preparation

4. Data File Management

Analysis Setup

5. Creating a New Analysis

6. Configuration

Execution

7. Run Complete Analysis

Output Management

8. Interpreting Results

Troubleshooting

9. Common Issues

10. Verification Steps

Best Practices

Getting Help