pyproteonet.metrics.differential_expression.find_des

pyproteonet.metrics.differential_expression.find_des(dataset: Dataset, molecule: str, columns: str | List[str], nominator_samples: List[str], denominator_samples: List[str], max_pvalue=0.05, min_fc=2, is_log: bool = False) Tuple[DataFrame, DataFrame, DataFrame]

Find differentially expressed molecules usign a t-test and multitest correction (benjamin hochberg).

Parameters:
  • dataset (Dataset) – The dataset to find differentially expressed molecules in.

  • molecule (str) – The molecule type to find differentially expressed molecules for.

  • columns (Union[str, List[str]]) – The value column(s) containing the abundance values of potentially differentially expressed molecules.

  • nominator_samples (List[str]) – List of samples names to use as nominator when computing fold change.

  • denominator_samples (List[str]) – List of samples names to use as denominator when computing fold change.

  • max_pvalue (float, optional) – P value to as significance threshold . Defaults to 0.05.

  • min_fc (int, optional) – Minimum fold change required to be considered as differentially expressed. Works for both increase and decrease in abundance (e.g. a min. fold change of 2 results in both a fold change of 2 and 0.5 being considered as potentially differentially expressed.). Defaults to 2.

  • is_log (bool, optional) – Whether the column values are logarithmized. Defaults to False.

Returns:

Tuple of three dataframes representing differential expressions, p-values and fold changes.

Return type:

Tuple(pd.DataFrame, pd.DataFrame, pd.DataFrame)