Protein sequences from different organisms may be viewed as texts written in different languages. The mapping of protein sequence to their structure, dynamics and function then becomes analogous to the mapping of words to meaning in natural languages. This analogy can be exploited by application of statistical language modeling and text classification techniques to biological sequences, thereby generating testable hypotheses regarding the fundamental building blocks of ˇ°protein sequence languageˇ±. The biology-language analogy enables novel applications of language technologies to the biology domain, but is to a great extent overlapping with existing other computational biology/bioinformatics applications. The purpose of the Biological Language Conference is to facilitate scientific exchange between researchers using the language analogy approach directly and researchers using other approaches.
|