For this workshop, we have also downloaded all relevant annotation files from public databases (UCSC Genome Browser and Gencode) and placed them in the /work3/NRPB1219 folder.
The Repeating Elements information provided the by UCSC Genome Browser was split into individual chromosomes, i.e. one data file per one chromosome. We will use a shell script to demonstrate batch download and joining of the data files into one single file for ease manipulation.
cd ~/Data
wget --no-check-certificate https://raw.githubusercontent.com/ycl6/MethylationWorkshop2014/master/download_rmsk.sh
sh download_rmsk.sh
Use ls to check the file was in the "Data" folder.