On request of one of our clients we have recently evaluated the combination of ADAM and Apache Spark for processing of large scale genomics data. The steep increase in the amount of available genomics data triggers the need for tools that are able to store and handle large volumes of genomics data while ensuring necessary computational steps remain feasible within a reasonable amount of time. The combination of ADAM and Apache Spark showed to be successful.
This poster will be presented at Bio-IT World 2015, one of the largest bioinformatics conferences in the world. Also read our other poster around multi-omics analysis in tranSMART.