We run local instances of several services focused on Metabolomics, Lipidomics, Statistical analysis and Chemometrics. As the hardware resources are limited, these tools are available only within campus LAN.
All services are running in separate Docker containers:
MetaboAnalyst in a container.... from https://github.com/xia-lab/MetaboAnalyst_Docker
# modify Docker file # prefer Java 8 (Oracle flavor) ... ENV METABOANALYST_VERSION 4.93 ENV METABOANALYST_LINK https://www.dropbox.com/s/9xo4yy3gzqsvyj9/MetaboAnalyst-4.93.war?dl=0 ENV METABOANALYST_FILE_NAME MetaboAnalyst.war ...
# Build the Dockerfile docker build -t metab_docker . # Run the Dockerfile in the interactive mode docker run -ti --rm --name METAB_DOCKER -p 8080:8080 metab_docker # Execute R script loading libraries etc. inside the container Rscript /metab4script.R # Deploy Java cargo inside the container java -jar /opt/payara/payara-micro.jar --deploymentDir /opt/payara/deployments # Look for running MetaboAnalyst at http://localhost:8080/MetaboAnalyst/
# Dockerfile FROM opencpu/base MAINTAINER OK LABEL Description = "ChemRICH 0.1.1 container" ... # prepare Ubuntu for compilations as needed ... # setup Java, jdk contains jre, set PATH RUN apt-get -y install openjdk-11-jdk # check where is Java #RUN update-java-alternatives -l #RUN java -version ENV JAVA_HOME="/usr/lib/jvm/java-1.11.0-openjdk-amd64" ENV PATH $JAVA_HOME/bin:$PATH # configure Java for R RUN R CMD javareconf RUN R -e "install.packages('rJava', repos='http://cran.rstudio.com/')" # install all R packages via R script, wait few hours # XLConnect works with Java 8 till Java 11, no more ADD install_package.R /install_package.R RUN Rscript install_package.R # these packages require special attention # RCurl needs re-installation if ....rcurl.so... error appears RUN R -e "install.packages('devtools', repos='http://cran.rstudio.com/')" RUN R -e "install.packages('RCurl', repos='http://cran.rstudio.com/')" RUN R -e "install.packages('unix', repos='http://cran.rstudio.com/')" # run local installation of the package COPY ChemRICH_0.1.1.tar.gz /ChemRICH_0.1.1.tar.gz RUN R -e "install.packages('ChemRICH_0.1.1.tar.gz', repos = NULL)" # opencpu needs more time for POST and more memory, upload modified configurations (timelimit.post": 900, etc.) COPY defaults.conf /usr/local/lib/R/site-library/opencpu/config/defaults.conf COPY server.conf /etc/opencpu/server.conf # make sure Java can be found in rApache and other daemons not looking in R ldpaths # otherwise RJava loading error will appear RUN echo "/usr/lib/jvm/java-1.11.0-openjdk-amd64/lib/server/" > /etc/ld.so.conf.d/rJava.conf RUN /sbin/ldconfig # add R script if needed in the interactive session ADD run-opencpu-server.R /run-opencpu-server.R # start the service CMD service cron start && /usr/lib/rstudio-server/bin/rserver && apachectl -DFOREGROUND # just in case # ENTRYPOINT ["bin/bash"]
Build and run the container
docker run -t -p 80:80 -p 8004:8004 opencpu/rstudio # help: https://hub.docker.com/r/opencpu/rstudio # help: https://opencpu.github.io/server-manual/opencpu-server.pdf # browse # http://localhost:8004/ocpu/library/ChemRICH/www/ # http://localhost:8004/ocpu/info
This web generator helps to summarize contributions of individual authors and prepare the Author contributions paragraph for a scientific journal
The purpose of this test was to select an appropriate hardware for LC-MS data processing using MS-DIAL 4.20.
Machine A - old office computer
Machine B - Workstation
Machine C - Virtual machine (OpenStack)#
Machine | MS-DIAL threads | Intensity threshold | Library loading... | Peak detection... | Alignment... | Total time | Peak spots |
---|---|---|---|---|---|---|---|
A | 4 | 20,000 cps | 20 min | 90 min | 260 min | 6.2 hrs | 23,840 |
B | 4 | 20,000 cps | 20 min | 34 min | 37 min | 1.5 hrs | 23,840 |
B | 8 | 20,000 cps | 20 min | 27 min | 42 min | 1.5 hrs | 23,840 |
B | 16 | 20,000 cps | 20 min | 23 min | 40 min | 1.4 hrs | 23,840 |
B | 16 | 20,000 cps | 1 min* | 23 min | 36 min | 1.0 hrs | 23,840 |
C | 16 | 20,000 cps | 23 min | 14 min | 46 min | 1.4 hrs | 23,840 |
C | 16 | 20,000 cps | 1 min* | 14 min | 45 min | 1.0 hrs | 23,840 |
C | 16 | 1,000 cps | 23 min | 30 min | 920 min | 16.2 hrs | 27,026 |
Dedicated physical workstation with many CPUs (Threadripper or better) is the best option for processing large datasets with large libraries using MS-DIAL software.