pyproteonet.imputation.multistep.multistep_top3_impute

pyproteonet.imputation.multistep.multistep_top3_impute(dataset: Dataset, molecule: str, column: str, mapping: str, partner_column: str, all_missing_percentile: float = 0.1) Series

Simple imputation method based on top3 aggregation using other aggregation approaches as fallback options.

Parameters:
  • dataset (Dataset) – The dataset to be imputed.

  • molecule (str) – The molecule type to be imputed.

  • column (str) – The value column with missing values to be imputed.

  • mapping (str) – The mapping to use for aggregation.

  • partner_column (str) – The partner column to use for aggregation.

  • all_missing_percentile (float, optional) – The percentile to use as fallback value when aggregation fails. Defaults to 0.1.

Returns:

_description_

Return type:

pd.Series