Bash: A Command Language Interpreter, Part 2
Write a bash script that splits the saccharomyces_cerevisiae.gff file from the previous example into two files. • Onecontainingrecordsforgeneswithverifiedopenreadingframes(ORFs).
• Andanothercontainingtherestofthegenes.
All the information to do this is contained in saccharomyces_cerevisiae.gff, you only need to determine how to extract it.
Remember that the url for the file is https://downloads.yeastgenome.org/curation/chromosomal_feature/saccharomyces_ cerevisiae.gff.
Recall that a GFF file contains both a table of features and FASTA formatted genome sequences. The description of the format is described at https://github.com/The-Sequence-Ontology/Specifications/blob/master/gff3.md.
Submitting
Feel free to discuss your answers with each other and ask for help. Google is also very helpful to understand what commands do.
When you are finished upload your annotated script to Canvas. Remember to include the information/references of any resources you used to complete the assignment (apart from manual pages and help messages). This includes fellow students.
1of1
BIO/MAT/MBB 355 2020.A2