Background: With the exponential growth in available biomedical data, there is a need for data integration methods that can extract information about relationships between the data sets. However, these data sets might have very different characteristics. For interpretable results, data-specific variation needs to be quantified. For this task, Two-way Orthogonal Partial Least Squares (O2PLS) has been proposed. To facilitate application and development of the methodology, free and open-source software is required. However, this is not the case with O2PLS. Results: We introduce OmicsPLS, an open-source implementation of the O2PLS method in R. It can handle both low- and high-dimensional datasets efficiently. Generic methods for inspecting and visualizing results are implemented. Both a standard and faster alternative cross-validation methods are available to determine the number of components. A simulation study shows good performance of OmicsPLS compared to alternatives, in terms of accuracy and CPU runtime. We demonstrate OmicsPLS by integrating genetic and glycomic data. Conclusions: We propose the OmicsPLS R package: a free and open-source implementation of O2PLS for statistical data integration. OmicsPLS is available at https://cran.r-project.org/package=OmicsPLSand can be installed in R via install.packages("OmicsPLS").

Integrating omics datasets with the OmicsPLS package / el Bouhaddani S.; Uh H.-W.; Jongbloed G.; Hayward C.; Klaric L.; Kielbasa S.M.; Houwing-Duistermaat J.. - In: BMC BIOINFORMATICS. - ISSN 1471-2105. - ELETTRONICO. - 19:(2018), pp. 371.1-371.9. [10.1186/s12859-018-2371-3]

Integrating omics datasets with the OmicsPLS package

Houwing-Duistermaat J.
2018

Abstract

Background: With the exponential growth in available biomedical data, there is a need for data integration methods that can extract information about relationships between the data sets. However, these data sets might have very different characteristics. For interpretable results, data-specific variation needs to be quantified. For this task, Two-way Orthogonal Partial Least Squares (O2PLS) has been proposed. To facilitate application and development of the methodology, free and open-source software is required. However, this is not the case with O2PLS. Results: We introduce OmicsPLS, an open-source implementation of the O2PLS method in R. It can handle both low- and high-dimensional datasets efficiently. Generic methods for inspecting and visualizing results are implemented. Both a standard and faster alternative cross-validation methods are available to determine the number of components. A simulation study shows good performance of OmicsPLS compared to alternatives, in terms of accuracy and CPU runtime. We demonstrate OmicsPLS by integrating genetic and glycomic data. Conclusions: We propose the OmicsPLS R package: a free and open-source implementation of O2PLS for statistical data integration. OmicsPLS is available at https://cran.r-project.org/package=OmicsPLSand can be installed in R via install.packages("OmicsPLS").
2018
Integrating omics datasets with the OmicsPLS package / el Bouhaddani S.; Uh H.-W.; Jongbloed G.; Hayward C.; Klaric L.; Kielbasa S.M.; Houwing-Duistermaat J.. - In: BMC BIOINFORMATICS. - ISSN 1471-2105. - ELETTRONICO. - 19:(2018), pp. 371.1-371.9. [10.1186/s12859-018-2371-3]
el Bouhaddani S.; Uh H.-W.; Jongbloed G.; Hayward C.; Klaric L.; Kielbasa S.M.; Houwing-Duistermaat J.
File in questo prodotto:
File Dimensione Formato  
2018_Article_.pdf

accesso aperto

Tipo: Versione (PDF) editoriale
Licenza: Licenza per Accesso Aperto. Creative Commons Attribuzione (CCBY)
Dimensione 1.52 MB
Formato Adobe PDF
1.52 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/879839
Citazioni
  • ???jsp.display-item.citation.pmc??? 15
  • Scopus 46
  • ???jsp.display-item.citation.isi??? 38
social impact