This paper is now widely used as a foundational source for benchmarking genomic language models. In it, Prof. Baillie demostrated the potential of clustering methodologies to identify genes implicated in biological pathways, including viral response.