Deborah Siegele

Associate Professor

Fax: 979-845-2891
Email:
siegele@bio.tamu.edu

Office:
3258 TAMU
Biological Sciences Building West
Room 233A
979-862-4022

Lab:
Biological Sciences Building West
Room 233
979-862-4159

Joined the Department in 1992

  • B.A., 1976, Northwestern University, Biochemistry.
  • Ph.D., 1989, University of Wisconsin, Madison, Molecular Biology.
  • Postdoctoral research: Harvard Medical School.

Associations:

American Society of Microbiology
Genetics Society of America
Sigma Xi

The value of society’s investment in science is strongly dependent on the ability of future scientists to build on previous results. As the pace of scientific productivity has accelerated through genomics and other Big Data technologies, scientists are increasingly dependent on computational tools to find information relevant to their own research. My group is working on projects to make published data more reusable through biocuration, which is the association of biological data with annotations that can be used in computer-based data mining and analyses.

The work involves using controlled vocabularies, such as ontologies, to provide both consistent terminology and a structured data format for the capture of biological information. An ontology consists of a controlled vocabulary of defined terms with unique identifiers and precise relationships to each other.

We are using the widely used Gene Ontology to make functional annotations to the gene products of bacteria and their viruses (bacteriophage). In addition, with collaborators at the Institute of Genomic Sciences at the University of Maryland Medical School, we have developed a new ontology for capturing phenotype information about all microorganisms: the Ontology for Microbial Phenotypes (OMP).

Phenotypes are the observable characteristics of an organism that result from the expression of a particular genotype in a particular environment. For example, eye color, number of seeds per pod, and coat color are phenotypic traits that can be observed in flies, legumes, and cats, respectively. Phenotype-genotype associations in well-studied model organisms can be a powerful tool for predicting biological function in less well-studied organisms. However, to use known phenotypic information in one organism to predict possible phenotypes in other organisms requires that phenotype information is stored in a consistent, computable format for ease of data integration and mining.

Until recently, phenotypic information has largely been captured as free text descriptions in primary research papers. The ambiguities in natural language confound attempts to retrieve information across sources. For example, “serotype” and “serovar” both refer to the same phenotype, but a simple text-based computer query with either word alone would miss the other. Or, a single term may be ambiguous: “sporulation” can be used to describe the general process of spore formation, but a search just for “sporulation” would return results for sporulation to survive adverse conditions (such as endospore formation in Gram-positive bacteria) or sporulation for the purpose of reproduction (such as found in the Actinomyces).

Currently, there are phenotype and anatomy ontologies in common use for many eukaryotic organisms, including fungi. However, none of the existing ontologies is appropriate to comprehensively capture phenotypes for Bacteria or Archaea or their phages or for comparisons across microbial species. This was the impetus for developing OMP. The OMP ontology and annotations of bacterial and fungal phenotypes can be accessed via a wiki-based ontology browser: microbialphenotypes.org. OMP releases can be downloaded from Github.

  1. Ramsey, J, McIntosh, B, Renfro, D, Aleksander, SA, LaBonte, S, Ross, C et al.. Crowdsourcing biocuration: The Community Assessment of Community Annotation with Ontologies (CACAO). PLoS Comput Biol. 2021;17 (10):e1009463. doi: 10.1371/journal.pcbi.1009463. PubMed PMID:34710081 PubMed Central PMC8553046.
  2. Wu, PI, Ross, C, Siegele, DA, Hu, JC. Insights from the reanalysis of high-throughput chemical genomics data for Escherichia coli K-12. G3 (Bethesda). 2021;11 (1):. doi: 10.1093/g3journal/jkaa035. PubMed PMID:33561236 PubMed Central PMC8022724.
  3. Hudson, MA, Siegele, DA, Lockless, SW. Use of a Fluorescence-Based Assay To Measure Escherichia coli Membrane Potential Changes in High Throughput. Antimicrob Agents Chemother. 2020;64 (9):. doi: 10.1128/AAC.00910-20. PubMed PMID:32631824 PubMed Central PMC7449193.
  4. Siegele, DA, LaBonte, SA, Wu, PI, Chibucos, MC, Nandendla, S, Giglio, MG et al.. Phenotype annotation with the ontology of microbial phenotypes (OMP). J Biomed Semantics. 2019;10 (1):13. doi: 10.1186/s13326-019-0205-5. PubMed PMID:31307550 PubMed Central PMC6631659.
  5. Xia, J, Chiu, LY, Nehring, RB, Bravo Núñez, MA, Mei, Q, Perez, M et al.. Bacteria-to-Human Protein Networks Reveal Origins of Endogenous DNA Damage. Cell. 2019;176 (1-2):127-143.e24. doi: 10.1016/j.cell.2018.12.008. PubMed PMID:30633903 PubMed Central PMC6344048.
  6. Giglio, M, Tauber, R, Nadendla, S, Munro, J, Olley, D, Ball, S et al.. ECO, the Evidence & Conclusion Ontology: community standard for evidence information. Nucleic Acids Res. 2019;47 (D1):D1186-D1194. doi: 10.1093/nar/gky1036. PubMed PMID:30407590 PubMed Central PMC6323956.
  7. Chibucos, MC, Siegele, DA, Hu, JC, Giglio, M. The Evidence and Conclusion Ontology (ECO): Supporting GO Annotations. Methods Mol Biol. 2017;1446 :245-259. doi: 10.1007/978-1-4939-3743-1_18. PubMed PMID:27812948 PubMed Central PMC6377151.
  8. Chibucos, MC, Zweifel, AE, Herrera, JC, Meza, W, Eslamfam, S, Uetz, P et al.. An ontology for microbial phenotypes. BMC Microbiol. 2014;14 :294. doi: 10.1186/s12866-014-0294-3. PubMed PMID:25433798 PubMed Central PMC4287307.
  9. Lim, B, Miyazaki, R, Neher, S, Siegele, DA, Ito, K, Walter, P et al.. Heat shock transcription factor σ32 co-opts the signal recognition particle to regulate protein homeostasis in E. coli. PLoS Biol. 2013;11 (12):e1001735. doi: 10.1371/journal.pbio.1001735. PubMed PMID:24358019 PubMed Central PMC3866087.
  10. Hu, JC, Sherlock, G, Siegele, DA, Aleksander, SA, Ball, CA, Demeter, J et al.. PortEco: a resource for exploring bacterial biology through high-throughput data and analysis tools. Nucleic Acids Res. 2014;42 (Database issue):D677-84. doi: 10.1093/nar/gkt1203. PubMed PMID:24285306 PubMed Central PMC3965092.
Search PubMed