top bar
QuickTopic free message boards logo
Skip to Messages

TOPIC:

An Iterative Improvement Procedure for Hierarchical Clustering

^     All messages            2-17 of 17  1-1 >>
17
louis vuitton handbags
12-06-2010
04:39 AM ET (US)
replica givenchy handbags | louis vuitton cosmic blossom
16
Deleted by topic administrator 10-07-2008 02:27 AM
15
Aleks
02-25-2008
10:05 AM ET (US)
Internet Marketing,promotion of money,eBay, of the reference.
Books of the program-all in one place.
So it is convenient.Email marketing software. Free Adsense Templates page.
14
Deleted by topic administrator 02-25-2008 11:10 AM
13
William
07-21-2006
11:33 PM ET (US)
I have been looking for discussions like this for a long time. Thank you! Visit yasmin lebon webpage devoted to yasmin lebon. aciphex cheap webpage devoted to aciphex cheap. and have fun!
12
Spam deleted by QuickTopic 12-07-2010 02:02 AM
11
Steve Scher
10-12-2004
04:31 PM ET (US)
When assigning non-uniform weights to preference a particular level of granularity, does the hierarchy provide a way to automatically identify "interesting" inherent levels of granularity?

Perhaps this would involve calculating where small weight-changes would have little vs major impact?
10
Gary Tedeschi
10-12-2004
03:06 PM ET (US)
I also agree with Robin, that a key question here is whether we suspect our data to be structured hierarchically (i.e., do we care about the tree); and if so, what is the appropriate number of clusters.

However, based on vague intuition, I think that if their method was compared to a method which combines a hierarchical approach with the flat k-means approach they would lose. For their procedure must maintain a hierarchical structure, whereas a combined approach is allowed to relax this constraint. Since the metric they use to compare methods is the k-means cost function, it seems that a method that is allowed to minimize this cost function directly without constraints during part of its procedure should beat their method.
9
Robin Hewitt
10-12-2004
02:56 PM ET (US)
Sanjeev,

This general topic is an interesting one. The wards-error term is a compactness measure, equivalent to moment of inertia in physical objects. There are some assumptions implicit in this - one being that feature dimensions are orthogonal, something that's often not true in practice.

You can, however, use other error measures, including non-metric ones, with many clustering methods. For example, you can use Shannon entropy within clusters as an error measure. What you want in this case is to minimize surprisal within your clusters. Although non-euclidean, this error measure will work with Wards method, since all you need is a way to measure the increase in total error with each agglomeration step. Like moment of inertia, information-content is a multiple-linkage similarity measure, so it tends to encourage compact (rather than stringy) clusters.

- Robin
8
Sanjeev Kumar
10-12-2004
02:25 PM ET (US)
This question is not directly related to this paper.
From the image segmentation and scene understanding point of view, norm in the cost function used here will probably correspond to euclidean distances in some feature space. Has there been any work to accomodate the importance of clusters representing some semantically meaningful objects? E.g. an image segment representing a person may consist of features which are not necesarily close to each other but together they form a meaningful cluster.
7
Sanjeev Kumar
10-12-2004
01:54 PM ET (US)
I agree with the view of Robin that typically we are not interested in whole range of k. How do we choose w(k) which reflects our prior preference about some particular range of k, considering that even if w(k) is uniform, more emphasis is on smaller values of k.

The present reordering scheme may not remain optimal for arbitrary choices of w(k) (except the example mentioned in paper), so it would be interesting to explore the family of w(k) for which efficient alternatives could be devised.
Edited 10-12-2004 01:58 PM
6
Hamed Masnadi-Shirazi
10-12-2004
01:43 PM ET (US)
I would like to know more about the "cost function" and what it measures. maybe you could explain this by telling us more about the "yeast data" experiment. What is it and what is it trying to achieve?
5
Louka Dlagnekov
10-12-2004
02:33 AM ET (US)
What exactly do they mean by "hillclimbing"?
4
Robin Hewitt
10-11-2004
04:59 PM ET (US)
I think the value of this method depends a lot on how important the tree is to you. Often the tree is just an artifact of the clustering method (e.g., Wards). Typically, one is only interested in a few levels because near the root, the groupings are too diverse to be meaningful and near the leaves you start losing track of things you want to keep together. For example, if you have 20 identical somethings, you don't care at all that this group can theoretically be split down 19 more times.

Given that, I think it would have been good for the authors to compare their method to standard industry pratices for combining Wards and K-means. They're working with such small datasets (n<1000) that it's very feasible to first cluster by Wards, stop at some level, then improve the result with K-means. You'd lose the hierarchy, but as mentioned, that's often of little interest. Or, they could split those clusters divisely to recover a new lower-portion of the tree if they wanted it, using their same cost metric. I would have liked to see a comparison of their method with this type of standard practice.

- Robin
3
Stephen Krotosky
10-11-2004
03:02 PM ET (US)
I'm curious to know what kinds of vision applications has this been used with. Clustering, of course, would be useful for a variety of vision-based tasks, but I'm finding it hard to think of something that would need specific hierarchical clustering.
2
Vincent Rabaud
10-09-2004
09:18 PM ET (US)
Intuitively, it seems pretty dependant on the weight you chose. How dependant is it ? Do they (Sanjoy...) have new results explaining why the average linkage initialization is more efficient. And, what is it for actually ? (If possible, I'd like something else than your scale stuff Andrew ;) )
Edited 10-09-2004 09:34 PM
Upgrade to PRO

Upload pictures, personalize your board, and more!

^     All messages            2-17 of 17  1-1 >>

Print | RSS Views: 1622 (Unique: 904 ) / Subscribers: 2 | What's this?