Gene prediction refers to algorithmically identifying stretches of DNA sequences that are biologically functional. In the old days, gene prediction was a very painstaking and difficult process. Today, thanks to comprehensive genome sequencing and powerful computational resources, gene prediction is largely a computational problem.
Gene prediction is used to find a functional sequence. In other words, a region of the DNA which is coding for a protein or mRNA. Regulatory regions, regions of DNA that regulate gene expression, are also considered functional. Gene prediction does not tell us which genes code for which proteins.
There are two primary approaches for predicting genes:
• Intrinsic approach – Ab Initio
• Extrinsic approaches – homology-based