Computational Concepts
in Biology

This article is about a lecture that is held at the University of Vienna in the winter term. It is a part of the Master's programme in Computational Science, which can be accessed by graduates of a Bachelor's programme in Computer Science, Mathematics or Natural Sciences.

Written by Claus Volko
Vienna, Austria, Europe

Contact: cdvolko (at) gmail (dot) com
Homepage: www.cdvolko.net

At the University of Vienna, an interesting new Master's programme was introduced only a couple of years ago (in 2013). This Master's programme is called "Computational Science" and it is highly interdisciplinary. To be admitted for this Master's programme, you need to have a Bachelor's degree in Computer Science, Mathematics, Biology, Physics, Chemistry, Astronomy, Geology or a related field. The Master's programme has a minimum duration of two years and afterwards, you can enroll for a PhD programme.

"Computational Science" is all about research in natural sciences that is done using computers and most of all self-written computer programs. It it thus an ideal study programme for people who are both into computers as well as natural sciences. Depending on what type of Bachelor's degree students have, they either have to attend basic lectures in mathematics, basic lectures in computer science or advanced lectures in these fields. In addition, they have to attend lectures about all of the aforementioned branches of science - physics, chemistry, biology, geology and astronomy, especially lectures about computational approaches to scientific problems. Moreover, students have to complete a Master's thesis.

The lecture "Computational Concepts in Biology I" is a new lecture that had not been held at the University of Vienna before this new Master's programme was freshly introduced. It is an obligatory lecture for all students of the Master's programme. It is held in the winter term, two academic hours a week. Since in the winter term 2017/2018 it was held in the late afternoon, I was able to attend this lecture although I am already working in the software industry and not a student any more. There were a couple of times when I decided not to go to the lecture since it was very cold outside, so I prefered to stay at home, but most of the times, I was there. In this article I am going to tell you about what I learned in the lecture. In addition, I would like to mention that there is also a lecture called "Computational Concepts in Biology II", which is held in the summer term; we will see whether I will find time to attend it as well.

The lecturers involved in "Computational Concepts in Biology I" are Thomas Rattei, Bojan Zagrovic, Andrea Tanzer, Gerhard Ecker, Arndt von Haeseler, Ivo Hofacker and Christoph Bock. Most of them are employees of the University of Vienna. The exception is Christoph Bock who is working at the CeMM, a research institute that belongs to the Austrian Academy of Sciences.

Computational Biology is quite a broad field and it basically consists of two components: Bioinformatics and Computational Systems Biology. The lecture "Computational Concepts in Biology I" is more about the former subfield, while the latter subfield will be dealt with in the lecture "Computational Concepts in Biology II".

Bioinformatics is actually primarily about nucleic acids (DNA, RNA) and proteins. Nucleic acids make up the substance in which the genetic information of a cell is stored. They are located in the cell's nucleus, which is why they are called nucleic acids. Proteins are the product that results of transcription and translation of the nucleic acids. Inside an organism, proteins mainly serve two purposes: First, some of them are enzymes that make biochemical reactions possible. Second, some of them are so-called structural proteins, which means that they contribute to the constitution of the body.

There are huge databases with DNA sequences and these databases have to be processed by computer programs. That's what bioinformatics is all about. One application in particular is sequence alignment. The purpose is to discover relationships between genetic sequences. For example, it may be that two organisms of different species are related to each other, but they slightly differ in some of their DNA sequences. With sequence alignment algorithms such as the Needleman-Wunsch and the Smith-Waterman algorithms, it is possible to discover relationships between different DNA sequences. This allows researchers to speculate about the phylogeny, i. e. how these organisms are related to each other. In this context, the tools BLAST and FASTA are also well-known.

What I found especially interesting was Bojan Zagrovic's part of the lecture. He deals with computational biophysics of proteins. Proteins are large molecules composed of hundreds or even thousands of amino acids. It is difficult to predict the function of a protein in an organism just from its amino acid sequence unless one manages to visualize the protein in 3D. But to discover the correct folding of protein, enormous computational power is needed. According to Zagrovic, with today's computational power it is already hard work for the computer just to correctly simulate the folding behaviour of a protein for a period of a hundred nanoseconds. That is also why distributed computing is often used for this purpose. If you are a Windows user, you can download and install the program "Folding@home" from Stanford University on your computer and run it whenever you have nothing else to do. In this way you can actively support research in computational biophysics of proteins without actually doing anything but providing computational power.

Andrea Tanzer's part of the lecture was a basic introduction to modern molecular biology and genetics, which was not new for me, not only because of my medical studies but because I had already learned about these things at grammar school. For some of the other students, it was quite a tough part of the lecture.

Gerhard Ecker talked about pharmacoinformatics. It is interesting that artificial intelligence is already being used to identify potential drug candidates.

The lecture "Computational Concepts in Biology II" will be more about Computational Systems Biology, so the lecturers said. Computational Systems Biology is the science of creating computer models of biological processes and simulate entire organisms and ecosystems on the computer.

I am already curious what we are going to learn in this lecture in the upcoming summer term, and I am happy and grateful that I was able to attend the lecture in the winter term although I am officially not a student any more.

I also maintain a website about Computational Systems Biology and Artificial Life, the so-called "Web Portal on Computational Biology", which can be accessed by the following URL: http://www.computational-biology.life/

Claus Volko