MPI-INF Logo
Campus Event Calendar

Event Entry

What and Who

Extracting Knowledge from High-Throughput Screening Data: Towards the Generation of Biophore Models

Marc Zimmermann
Fhg-Scai St. Augustin
Talk
AG 1, AG 2, AG 3, AG 4  
MPI Audience

Date, Time and Location

Thursday, 12 June 2003
16:00
-- Not specified --
46.1 - MPII
023
Saarbrücken

Abstract

Due to the pressure on finding new, innovative drug candidates in the pharmaceutical industry, significant technological progress has been made in the fields of combinatorial chemistry and high-throughput screening (HTS). While today HTS is crucial in the lead identification process, its typical outcome is of limited accuracy and contains systematic and statistical errors. As most screens result in many more than a few unique hit structures, there is an increasing demand for approaches interpreting those large data sets in order to extract relevant motifs for follow-up procedures.

Here we present the novel software tool HTSview, designed for the extraction of knowledge from HTS data. The tool combines cheminformatics concepts with data mining and visualization methods towards the identification of appropriate chemical series for optimization. Biophore models are generated that capture relevant motifs from an HTS experiment. In this context, we define a biophore as the fragment-based description of an ensemble of molecules with similar biological activities and structures, based on the feature tree descriptor (M. Rarey and J.S. Dixon, JCAMD, (12), 1998). Feature trees are fragment-based, non-linear, topological descriptors. They describe molecules by a tree structure representing their major chemical building blocks and their connection. Each node corresponds to a small molecular fragment.
The interactive software tool HTSview applies this descriptor to the analysis of HTS data. First, a training set is selected from actives and similar, but inactive compounds. Their feature trees are iteratively aligned onto the largest tree which is used as a reference. This results in a topological template with an invariant core and variable regions. The model consists of a mapping of all feature trees so that the most similar fragments are aligned. From these multiple alignments, biophore models are generated by using statistical methods to identify those features correlated with biological activity. The procedure results in the generation of models, which reveal important structure-activity relationships. Initial results indicate that the approach can be applied to the interpretation of HTS data.

Contact

Ruth Schneppen-Christmann
--email hidden
passcode not visible
logged in users only