RESOURCES

https://www.genome.gov/10005107/the-encode-project-encyclopedia-of-dna-elements/
https://genome.ucsc.edu/encode/
https://www.ncbi.nlm.nih.gov/geo/
https://www.ncbi.nlm.nih.gov/gap
https://www.kaggle.com/datasets?sortBy=relevance&group=featured&search=cancer https://www.kaggle.com/xiaotawkaggle/inhibitors
https://www.insilicodb.com/
ClinVar - a database of relationships asserted between genetic variation and observed health status
SURVIVOR - Toolset for SV simulation, comparison and filtering
article - dbVar structural variant cluster set for data analysis and variant comparison
Cancer Epitopes CSHL - given an SRA ID, prioritize and quantify variants with respect to immunogenicity (single score) + variant annotation
Master_gff3_parser - Convert sequence IDs between ucsc/refseq/genbank
here - Human Genome Resources at NCBI
RefSeqGene - but this is probably the least ambiguous for grabbing enhancer regions!
markdown-here - Google Chrome, Firefox, and Thunderbird extension that lets you write email in Markdown and render it before sending.
getfasta - extracts sequences from a FASTA file for each of the intervals defined in a BED/GFF/VCF file.
Master_gff3_parser - Convert sequence IDs between ucsc/refseq/genbank
giab_data_indexes - This repository contains data indexes from NIST's Genome in a Bottle project.
awsgiab - Genome in a Bottle on AWS
tutorial on using SRA and dbGaP
biotools - A list of useful bioinformatics resources
Application for Participation in August (14-16) 2017 NCBI-Style Bioinformatics Hackathon at NLM (National Library of Medicine in Bethesda)
articles - describes some of Ben Busby's experiences in running hackathons.
hackathons - lists NCBI code results

COLLABORATIVE RESEARCH EVENTS (aka “Hackathons”)

SVAI uses “collaborative research events” (CREs) to apply the power of the computational biology community to the needs of targeted patient communities. Like a movie production company, CREs pool the talents of a diverse set of participants for a short period of time to create a well defined deliverable. CRE deliverables increase knowledge about the disease of interest and can be leveraged in future CREs. For example the July 2017 CRE lasted 3 days and centered on NF2. During that time, several groups made especially significant progress.

The “AutoNF2 Team” extracted interaction networks from complex genomic data using “deep learning” by feeding an Autoencoder a vector of gene expressions (FPKM) to doing dimensionality reduction. Large network weights indicated which feature was important. Using this approach, the team postulated “miR-200a as a potential circulating marker for NF2”
The “C3 Team” created 7 mutational signature clusters of brain cancer using 290 glioblastomas and 286 gliomas from TCGA. Cluster analysis suggested that drugs targeting homologous repair, e.g. PARP inhibitors, could be used to treat the NF2 patient.

The p1RCC CRE will apply these NF2 CRE methods to the p1RCC dataset.

Resources