pyproteonet.io.io.read_mapped_dataframe

pyproteonet.io.io.read_mapped_dataframe(df: DataFrame, molecule: str, sample_columns: List[str], molecule_columns: List[str] | None = None, id_column: str | None = None, result_column_name: str = 'abundance', mapping_column: str | None = None, mapping_sep: str = ',', mapping_molecule: str = 'protein', mapping_name='peptide-protein') → Dataset

Reads a dataframe containing a mapping as column with comma separated ids of the mapped molecule.

Parameters:

df (pd.DataFrame) – The input dataframe.
molecule (str) – The name of the molecule column in the dataframe.
sample_columns (List[str]) – The list of sample columns in the dataframe.
molecule_columns (Optional[List[str]], optional) – The list of molecule columns in the dataframe. Defaults to None.
id_column (Optional[str], optional) – The name of the ID column in the dataframe. Defaults to None.
result_column_name (str, optional) – The name of the result column in the dataframe. Defaults to “abundance”.
mapping_column (Optional[str], optional) – The name of the mapping column in the dataframe. Defaults to None.
mapping_sep (str, optional) – The separator used in the mapping column. Defaults to “,”.
mapping_molecule (str, optional) – The name of the molecule in the mapping column. Defaults to “protein”.
mapping_name (str, optional) – The name of the mapping. Defaults to “peptide-protein”.

Returns:

The converted Dataset object.

Return type:

Dataset