An open reading frame (ORF) is a continuous stretch of nucleotide sequence that starts with a start codon and continues without interruption until a stop codon. Such a segment has the capacity to be translated into a polypeptide. Multiple reading frames exist in a given DNA or RNA molecule, and an ORF refers to one frame that could encode a protein.
Production and Detection
In double‑stranded DNA there are six possible reading frames, three on each strand. An ORF begins at an initiation codon, commonly AUG in RNA or ATG in DNA, and ends immediately before a termination codon such as UAA, UAG or UGA. Bioinformatics tools scan genomic sequences to identify ORFs, which helps in predicting protein‑coding genes. However, not every detected ORF corresponds to a functional gene; random sequences may produce long ORFs by chance. Gene annotation therefore combines ORF prediction with additional evidence such as conservation, expression data and known protein domains.
Short ORFs, sometimes called sORFs, encode micropeptides and often escaped early gene predictions. Advanced sequencing and ribosome profiling have revealed that many small ORFs are translated and can play roles in regulation. Viruses often use overlapping ORFs to maximize coding capacity in compact genomes. Overlap allows different proteins to be encoded within the same nucleotide sequence, offset by one or more nucleotides.
Examples and Significance
In bacteriophage lambda the cI repressor gene and cro gene overlap, using different reading frames. Hepatitis B virus employs multiple overlapping ORFs to encode surface antigen, core antigen and polymerase from a small genome. Mitochondrial genomes also feature compact ORFs due to limited space. Identifying ORFs aids in genome annotation, comparative genomics and understanding how organisms encode proteins. It also underpins the design of expression constructs in molecular cloning, ensuring that inserted sequences maintain the correct frame.
An ORF represents a potential protein‑coding region defined by start and stop codons. While many ORFs correspond to functional genes, others are incidental. Careful analysis is needed to determine which ORFs are truly translated and contribute to cellular or viral function.
Related Terms: Start Codon, Stop Codon, Coding Sequence, Reading Frame, Translation
