MPI-INF Logo
Campus Event Calendar

Event Entry

What and Who

Clustering by common friends finds locally significant proteins mediating modules

Dr. Bill Andreopoulos
Biotechnologisches Zentrum, TU-Dresden , Germany
Talk
AG 1, AG 3, AG 4, AG 5, SWS, RG1, MMCI  
Public Audience
English

Date, Time and Location

Wednesday, 16 September 2009
09:00
45 Minutes
E1 4
019
Saarbrücken

Abstract

A challenge in applying density-based clustering algorithms to categorical datasets is that the `cube' of attribute values has no ordering defined. We propose the HIERDENC algorithm, which builds a hierarchy representing the underlying cluster structure of the categorical dataset. HIERDENC minimizes the user-specified input parameters, is insensitive to the order of object input, and can handle outliers. We propose an indexing scheme for HIERDENC for scalable clustering of large datasets.

We present a simplification of HIERDENC, called the MULIC algorithm for multi-layered clustering. We apply this layered clustering algorithm on protein interaction networks, by grouping proteins based on the similarity of their direct neighborhoods. We identify locally significant proteins, called mediators, which link different clusters. Clusters and mediators are organized in hierarchies, where clusters are mediated by and act as mediators for other clusters. We compare the clusters and mediators to known yeast complexes and find
agreement with precision of 71% and recall of 61%.

In other applications, we use HIERDENC to form triangles by complementing protein interaction networks with structural information. Triangles have ten-fold higher overlap
with known yeast complexes than bicliques. Moreover, we visualize protein interaction networks; our success is measured by the percentage of edges collapsed into bicliques,
which is as high as 90%. We also developed a large search database of biomedical images and text captions.

As ongoing work, we are applying HIERDENC to cluster large datasets of Force-Distance curves representing protein unfolding pathways.

Contact

Conny Liegl
302-70150
--email hidden
passcode not visible
logged in users only

Conny Liegl, 09/03/2009 11:49 -- Created document.