GSoC ‘21 Report | Javed Ali | NRNB | GeneCloudOmics
Mentors
Dr. Mohamed Helmy
Thuy Tien Bui
Introduction
The Google summer of code program has been a great and fun learning experience to me over the past months,
during which I was developing new features to the interactive webapp GeneCloudOmics .
My project idea was to make the exisiting webapp UI uniform, adding some new features to help the Researchers in analysis. I also
added report generation of the analysis which can be used by Researchers for publication.
What is GeneCloudOmics
It is a web server for transcriptome data analysis and visualization. It supports the analysis of microarray and RNASeq data and performs ten different bio-statistical analyses that cover the common analytics for gene expression data.
Furthermore, it gives the users access to several bioinformatics tools to perform 12 different bioinformatics analyses on gene/protein datasets.
It is designed as a one-stop server that helps the users perform all tasks through an intuitive graphical user interface (GUI) that waves the hassle of coding, installing tools, packages or libraries and dealing with operating systems compatibility and versioning issues, some of the complications that make data analysis tasks more challenging for biologists.
GeneCloudOmics is an open-source tool and the website is free and open to all users and there is no login requirement.
GeneCloudOmics provides the user with the following bio-statistical analyses: Pearson and Spearman rank correlations, PCA, k-means and hierarchical clustering, Shannon entropy and noise (square of the coefficient of variation), t-SNE, random forest and SOM analyses.
All analyses include proper high-resolution visualization
Project Goals
Done
- Improving UI of the platform.
- Adding missing buttons or UI elemets in the webapp.
- Adding alternate way(copy paste box) of adding the data for Gene/Protein Analysis.
- Adding option to import the data from GEO Database automatically using accession number.
- Adding support of GEO database import for Microarray
- Adding option to download all the analysis as PNG/PDF
- Adding option to add the analysis to a report in pdf format.
- Adding option to download the final report after the analysis.
- Performing SOM analysis for research work.
Contributions to GeneCloudOmics before GSoC:
- Added Docker Support: #48
- Created & hosted a static webpage for AbioTrans(GeneCloudOmics) on git-pages.(link)
- Completed a Manual Test for the GeneCloudOmics using this Open Source Data and submitted the corresponding report.
Commits during GSoC period along with Goals
- Adding UI improvements and submit buttons
- Adding textbox for accession numbers in gene set and protein set analysis.
- Adding GEO Import from Accession Numnber
- Adding support for RnaSeq in GEO Import
- Adding download of plots in Transcriptome analysis
- Adding download of report as PDF
** Pull Request during gsoc period**
https://github.com/buithuytien/GeneCloudOmics/pull/61
Working Screengrabs:
- UI Improvements
Before
After
- Text Box for Gene/Protien Input view
Before
After
- GEO Data Import
- Dowload PNG/PDF
- Adding to report
- Genrating Report
Future Aspects
There are more feature which can be added:
- Improve SOM analysis for Research purpose.
- Add report export options like email or save to drive.
Important Links
- GeneCloudOmics live portal
- GeneCloudOmics github repo
- GSoC GeneCloudOmics issue
- GSoC @NRNB
- GSoC @NRNB GitHub
- GitHub Javed Ali
- GSoC Project Link
Contact
- Email: javedali9582@gmail.com