Crandore Hub

lfproQC

Quality Control for Label-Free Proteomics Expression Data

Label-free bottom-up proteomics expression data is often affected by data heterogeneity and missing values. Normalization and missing value imputation are commonly used techniques to address these issues and make the dataset suitable for further downstream analysis. This package provides an optimal combination of normalization and imputation methods for the dataset. The package utilizes three normalization methods and three imputation methods.The statistical evaluation measures named pooled co-efficient of variance, pooled estimate of variance and pooled median absolute deviation are used for selecting the best combination of normalization and imputation method for the given dataset. The user can also visualize the results by using various plots available in this package. The user can also perform the differential expression analysis between two sample groups with the function included in this package. The chosen three normalization methods, three imputation methods and three evaluation measures were chosen for this study based on the research papers published by Välikangas et al. (2016) <doi:10.1093/bib/bbw095>, Jin et al. (2021) <doi:10.1038/s41598-021-81279-4> and Srivastava et al. (2023) <doi:10.2174/1574893618666230223150253>.This work has published by Sakthivel et al. (2025) <doi:10.1021/acs.jproteome.4c00552>.

README

<!-- README.md is generated from README.Rmd. Please edit that file -->

## About

<!-- badges: start -->

<!-- badges: end -->

The goal of lfproQC R package is to provide an optimal combination of
normalization and imputation methods for the label-free proteomics
expression dataset.

## Installation

You can install the development version of lfproQC from
[GitHub](https://github.com/) with:

``` r
# install.packages("devtools")
devtools::install_github("kabilansbio/lfproQC", build_manual = TRUE, build_vignette = TRUE)
```

## Example

This is a basic example for finding the best combinations of
normalization and imputation method for the label-free proteomics
expression dataset:

``` r
library(lfproQC)
## basic example code with the example dataset and data groups
yeast <- best_combination(yeast_data, yeast_groups, data_type = "Protein")
yeast$`Best combinations`
#>   PCV_best_combination PEV_best_combination PMAD_best_combination
#> 1     rlr_knn, rlr_lls              vsn_lls               rlr_lls
```

**The overall workflow for using the ‘lfproQC’ package**

<img src="vignettes/images/flow1.png" width="800px" />

Versions across snapshots

VersionRepositoryFileSize
1.4.1 rolling linux/jammy R-4.5 lfproQC_1.4.1.tar.gz 1.5 MiB
1.4.1 rolling linux/noble R-4.5 lfproQC_1.4.1.tar.gz 1.5 MiB
1.4.1 rolling source/ R- lfproQC_1.4.1.tar.gz 1.5 MiB
1.4.1 latest linux/jammy R-4.5 lfproQC_1.4.1.tar.gz 1.5 MiB
1.4.1 latest linux/noble R-4.5 lfproQC_1.4.1.tar.gz 1.5 MiB
1.4.1 latest source/ R- lfproQC_1.4.1.tar.gz 1.5 MiB
1.4.1 2026-04-26 source/ R- lfproQC_1.4.1.tar.gz 1.5 MiB
1.4.1 2026-04-23 source/ R- lfproQC_1.4.1.tar.gz 1.5 MiB
1.4.0 2025-04-20 source/ R- lfproQC_1.4.0.tar.gz 1.5 MiB

Dependencies (latest)

Imports

Suggests