Arun Seetharam bio photo

Arun Seetharam

Bioinformatician

Email Twitter Github

Synergistically Developed Resources

Overview

Educational

Bioinformatics Workbook

Bioinformatics and biological data analysis are an important aspect of biological research. Researchers in the genomic era ask larger questions that routinely require dealing with large amounts of data that require familiarity with medium to large scale computing. We created a GitHub based bioinformatics workbook bioinformaticsworkbook.org, that can serve as an online knowledge repository for scientists. This workbook covers variety of problems, with the walk-through examples.

Website bioinformaticsworkbook.org

If you use the workbook for your research, please cite us:

Arun Seetharam, Siva Chudalayandi, Rick Masonbrink, Usha Muppirala, Adam Rivers, Maryam Sayadi, & Andrew Severin. (2019, October 11). ISUgenomics/bioinformatics-workbook: 2019-10-11 Release of the Bioinformatics Workbook (Version v1.0.0). Zenodo DOI

Workshop on Basic UNIX

This exercise is designed to provide the basic skills required for working in the UNIX environment, using plenty of relevant examples, specifically for biologists. If you are using your personal computer, make sure that you have downloaded the files required for the workshop. This exercise will provide you information regarding navigation, files and directory creation/modification and some administrative things related to file permissions.

Workshop on Phylogenomics

This phylogenomics tutorial is divided into 3 major sections. The first section deals with identification of orthologs from closely related plasmodium species. Second section is about multiple sequence alignment and construction of super matrix. The final section will show simple phylogenetic tree construction using the Maximum Likelihood and Bayesian analyses.

Web resources

NAM-genomes.org: is consortium website containing information regarding the maize NAM founder lines genome assembly and annotation. The website includes links to raw data and final results distributed via MaizeGDB. The genomes have been made public on Jan 9th, 2020.

GenomeQC is a user-friendly and interactive platform that generates descriptive summaries with intuitive graphics for genome assemblies and structural annotations. It also benchmarks user supplied assemblies and annotations against the publicly available reference genomes of their choice.

SeriolaDB is a website designed as a centralized knowledge base for Seriola researchers and producers. It contains genomics, genetics, breeding and phenotypic data along with protocols and tools to translate this information in to practical applications for culture activities. SeriolaDB uses GMOD Tripal and Drupal’s luggage extension to manage the genomics data and empower efficient web development by multiple users.

SCNbase is a website designed as a centralized knowledge base for Soybean Cyst Nematode (SCN) researchers and producers. It contains genomics, genetics, population data along tools to translate this information in to practical applications. SCNBase uses GMOD Tripal and Drupal’s luggage extension to manage the genomics data and empower efficient web development by multiple users.

Software

pyrpipe: is python package to easily develop RNA-Seq analyses workflow by integrating popular RNA-Seq analysis programs in an object oriented manner.

SequelTools: is a fast and easy to install command-line program that provides a collection of tools for working with multiple SMRTcells of BAM format PacBio Sequel raw sequence data.

phylostratr: A fully automated R framework predict and explore the age of genes using phylostratigraphic methods. The program selects species for balanced representation, retrieving sequences, building databases, inferring phylostrata and returning diagnostics.

fagin: a synteny-based pipeline in the R framework. fagin determines the genomic context of each query gene in a focal species compared to homologous sequence in target species.