Hadoop MapReduce
On parallelizing graph theoretical approaches for identifying causal genes and pathways from very large biological networks
In this work, Hadoop's distributed storage system has been used to store the molecular interaction network. Graph parallel processing techniques of Hadoop MapReduce, in conjunction with graph theoretical approaches have been utilized to improve the accuracy of detecting causal genes and execution time on benchmark data.