In this article, I would like to talk about the visualization of the sample diversity and the Shannon entropy usage for it.

My background is bioinformatics, so I will use a normalized RNAseq dataset in this article. As an alternative, you can use any other dataset, containing multiple features per sample and numerical values for each feature. You may always generate a random dataset or generate a synthetic dataset.

Each time I have my raw sequencing data preprocessed and normalized I am looking forward to my favorite part — visualization. For me this is the most interesting and inspiring part…

Ksenia Troshchenkova

My fav drink is gene and tonic. Adding informatics to my bio.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store