Skip to content

Genetics

Genetics Language models

Genetics Language Models

Applications

Targets

Research

Genomic language model predicts protein co-regulation and function

The authors show in their paper the ability to train genomic language models on top of protein language modesl (ESM2) "on millions of metagenomic scaffolds to learn the laten functional and regulatory relationships between genes." Their reveal "a promising approach to encode functional semantics and regulatory syntax of genes in ther genomic context and uncover complex relationships between genes in a genomic region." image

📋
RegLM - a toolkit for training hyenaDNA based autoregressiv language modles on DNA sequences

Developments The authors show in their paper a model capable of generating Cis-regulatory elements (CREs) like protors and enhancers that can regulate the epxression of Genes. These are useful for biomanufacturing as well as other therapeutic applications.

<img width="1151" alt="image" src="https://github.com/user-attachments/assets/ada05f41-b21f-4e16-a1c6-0b80f658dc6c">

image