Blog

2nd Viral Bioinformatics and Genomics Training Course (1st – 5th August 2016)

August 10, 2016
3 Comments

We have shared our knowledge on Viral bioinformatics and genomics with yet another clever and friendly bunch of researchers. Sixteen delegates from across the world joined us for a week of intensive training. The line-up of instructors changed slightly due to the departure of Gavin Wilkie earlier in the year. Instructors: Joseph Hughes (Course Organiser) […]

April 25, 2016
1 Comment

If you are using BaseSpace for sample entry but demultiplexing your data manually, you may have been frustrated that there is no facility to download your sample names and index tag data from BaseSpace as a sample sheet. This means you have to enter the same data twice – with the possibility of errors creeping […]

April 25, 2016
4 Comments

Sequence runs on NGS instruments are typically carried out with multiple samples pooled together. An index tag (also called a barcode) consisting of a unique sequence of between 6 and 12bp is added to each sample so that the sequence reads from different samples can be identified. On the Illumina MiSeq, the process of demultiplexing […]

April 20, 2016
4 Comments

I use NCBI Entrez Direct UNIX E-utilities regularly for sequence and data retrieval from NCBI. These UNIX utils can be combined with any UNIX commands. It is available to download from the NCBI website: ftp://ftp.ncbi.nlm.nih.gov/entrez/entrezdirect/ A few useful examples for NCBI edirect utilities. Download a sequence in fasta format from NCBI using accession number esearch -db […]

April 20, 2016
1 Comment

Here are my slides from a session on NGS data formats and analyses that I gave as part of the EPIZONE Workshop on Next Generation Sequencing applications and Bioinformatics in Brussels in April 2016. It covers file formats such as FASTA, FASTQ, SAM, BAM, and VCF, and also goes over IUAPAC nucleotide ambiguity codes, read names, quality […]

April 4, 2016
No Comment

In this blog we describe how to import lists of sample data with defined index tags into BaseSpace, and provide templates for TruSeqLT and TruSeqHT libraries. We have found this saves a lot of time and eliminates errors associated with manual entry. The Illumina NextSeq500 sequencer requires all users to complete sample data entry on […]

April 1, 2016
No Comment

These are the steps I used to create a batch of bootable BioLinux Live USB sticks – with persistent data so that any data files created/downloaded would be preserved. This was used for a course so that each stick had the same NGS data and the same additional (non-BioLinux) programs pre-installed and already configured. Step 1 […]

April 1, 2016
1 Comment

Imagine having two different servers called darwin and linnaeus. Imagine that darwin is a great server with loads of RAM for doing de-novo assembly and that linnaeus has loads of nodes so a great server for splitting up jobs and running lots of jobs in parallel. To make good use of all these resources, it […]

April 1, 2016
No Comment

Sharing large files with collaborators has rarely been a problem, we usually just compress them and put them on our web server and then send the link to our collaborator who can then download the file. However, we have struggled to find a solution to receive large files. We usually run out of space in […]

February 15, 2016
1 Comment

Count-based differential expression analysis of sequencing data is one of the best known pipeline in bioinformatics analysis. In this pipeline, the vital step is to estimate the reads count of each genomic features. After counting the features, the differential expression(DE) analysis tools are used for getting the differential expression list of genomic features. It has been […]

2nd Viral Bioinformatics and Genomics Training Course (1st – 5th August 2016)

How to generate a Sample Sheet from sample/index data in BaseSpace

How to demultiplex Illumina data and generate fastq files using bcl2fastq

NCBI Entrez Direct UNIX E-utilities

NGS Data Formats and Analyses

How to Import data for libraries with index tags into BaseSpace

How to make a BioLinux Live USB Stick – with persistent data storage

Submitting a job to run on another server and retrieving the results

Setting up an Amazon ftp server to receive big files

featureCounts or htseq-count?