Skip to content

[GDR-3409] Add prism data - first phase#78

Open
j-smola wants to merge 6 commits into
mainfrom
GDR-3409
Open

[GDR-3409] Add prism data - first phase#78
j-smola wants to merge 6 commits into
mainfrom
GDR-3409

Conversation

@j-smola
Copy link
Copy Markdown
Contributor

@j-smola j-smola commented Jun 2, 2026

Description

What changed?

Related JIRA issue: GDR-3409

Why was it changed?

To allow using PRISM data in examples accross gDRplatform

Checklist for sustainable code base

  • I added tests for any code changed/added
  • I added documentation for any code changed/added
  • I made sure naming of any new functions is self-explanatory and consistent

Logistic checklist

  • Package version bumped
  • Changelog updated

Screenshots (optional)

@j-smola j-smola requested review from bczech, darsoo and gladkia June 2, 2026 12:00
@j-smola j-smola requested a review from a team as a code owner June 2, 2026 12:00
Copy link
Copy Markdown

@gemini-code-assist gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the gDRtestData package to version 1.11.4, introducing the PRISM public dataset and expanding the documentation for several DepMap datasets by adding loading instructions, detailed descriptions, and updating keywords to 'real_data'. The review feedback identifies minor grammatical typos in the documentation and generated .Rd files, specifically recommending the pluralization of 'genes as columns' and '774 cell lines'.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

Comment thread R/dataset.R Outdated
#' package = "gDRtestData"))}
#'
#' @source \url{https://depmap.org/portal/data_page/?tab=allData}
#' @format Matrix with cell lines as rows and genes as column
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

There is a grammatical typo in the @format description. 'genes as column' should be pluralized to 'genes as columns'.

#' @format Matrix with cell lines as rows and genes as columns

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changed.

Comment thread R/dataset.R Outdated
#' PRISM Broad Screen Cancer Cell Line Viability Dataset
#'
#' The single-agent screening data from the PRISM platform -
#' with one drug and 774 cell line.
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

There is a grammatical typo in the description. '774 cell line' should be pluralized to '774 cell lines'.

#' with one drug and 774 cell lines.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changed.

}
\source{
\url{https://depmap.org/portal/data_page/?tab=allData}
Matrix with cell lines as rows and genes as column
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

There is a grammatical typo in the \format section. 'genes as column' should be pluralized to 'genes as columns'.

Matrix with cell lines as rows and genes as columns

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changed.

}
\description{
The single-agent screening data from the PRISM platform -
with one drug and 774 cell line.
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

There is a grammatical typo in the description. '774 cell line' should be pluralized to '774 cell lines'.

with one drug and 774 cell lines.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changed.

Copy link
Copy Markdown
Contributor

@gladkia gladkia left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

GJ!

Comment thread R/dataset.R
#' @section Description:
#' - Rows: Cell line identifiers
#' - Columns: Metadata columns (see below)
#' - Values: Cell line annotations and patient information
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are no patients, these are in vitro data. These are donors

Suggested change
#' - Values: Cell line annotations and patient information
#' - Values: Cell line annotations and donor demographics

| **Rows** | Individual cell lines (~1,000 models) |
| **Rows** | Individual cell lines |
| **Columns** | Metadata columns (see below) |
| **Values** | Cell line annotations and patient information |
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same as above.

Suggested change
| **Values** | Cell line annotations and patient information |
| **Values** | Cell line annotations and metadata |

Comment thread R/dataset.R
#' \item{clid}{Unique Broad Institute cell line identifier.}
#' \item{CellLineName}{Publicly recognized cancer cell line name.}
#' \item{Tissue}{Primary tissue of origin.}
#' \item{parental_identifier}{Identifier for parental strain/patient source.}
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not patient source — it's a parental cell line identifier.

Suggested change
#' \item{parental_identifier}{Identifier for parental strain/patient source.}
#' \item{parental_identifier}{Identifier for the parental cell line.}

Comment thread R/dataset.R
#' ### Row Metadata (rowData)
#' Feature metadata describing the treatment compounds across assays:
#' \describe{
#' \item{Gnumber}{Broad Institute global substance identifier (Genentech/Broad ID).}
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Gnumber is a Genentech identifier, not Broad Institute.

Suggested change
#' \item{Gnumber}{Broad Institute global substance identifier (Genentech/Broad ID).}
#' \item{Gnumber}{Genentech compound identifier.}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants