Welcome to the OR-Exchange, your site for questions and answers in operations research.

vote up 1 vote down
star

I am looking for a group-ranking algorithm (similar to Google's PageRank algorithm) that can help me solve the following problem:

In a large sparse graph every node can rate other nodes, an arc from node i to node j shows that i has rated j; and the arc-weight,r, which is a number in [0,1] shows how high the rating is. The graph is large (number of vertices>5000) and sparse (average number of adjacent arcs to each node is <5) many nodes do not have any adjacent arc coming in or going out and many only have incoming arcs, some only have outgoing arcs. Few have both incoming and outgoing arcs.

The question is how can we assign a ranking to each node such that the ranking is robust (a small group of nodes cannot push up their own ranking by down voting others)

I appreciate if you could please point out one or two valuable papers related to this question.

Note: If sparsity makes it difficult (or impossible) you can assume that the graph is dense.

Update It seems my problem definition was confusing. Here are some more explanations: Let's assume we have a web graph (although my problem is not related to the web at all). each page links to other pages. some pages are orphans (no incoming arc). But I want to rank the pages. Google does the same thing with their PageRank algorithm. Let's assume that the resulting Markov chain is irreducible and positive recurrent (it makes the problem much easier and better defined). The problem here is that each state rates only a subset of the state space.

The following are a number of related articles that I believe are very close:

  1. Methodologies and Algorithms for Group-Rankings Decision
  2. Country credit-risk rating aggregation via the separation-deviation model
  3. HITS algorithm via @hakankj

I am looking for a public domain algorithm similar to HITS

flag

2 Answers

vote up 1 vote down

Unfortunately I don't know any public domain versions of HITS.

Also, I don't know the copyright status of the SALSA method, but here is a comparison of HITS, PageRank and SALSA: Authority Rankings from HITS, PageRank, and SALSA: Existence, Uniqueness, and effect of Initializations. Another comparison is Najork's: Comparing the Effectiveness of HITS and SALSA.

This may be an impertinent suggestion, but have you tried a simple eigenvector analysis? A very simple example is Hilbert Wilf's ranking of football teams in his short paper Searching the web with eigenvectors. Anyhow, I like this paper.

Later edit: In case someone wonder, I am the Twitter hakankj that Mark linked to in his question.

link|flag
vote up 0 vote down

Don't know any references, but I'm wondering about the problem definition. Are you assuming that i can rank j only if the arc (i,j) exists? Or does existence of (i,j) mean i chose to rank j, while absence of (i,k) can either mean i could not rank k or i chose not to rank k?

link|flag
Hi sorry if the definition was confusing. I added a number of assumptions (lets assume the irreducibility in the MC) I am basically looking for some ideas along the lines of the PageRank algorithm. I have added a number of references that I have found so far. Thanks again – Mark Dec 28 at 21:13

Your Answer

Not the answer you're looking for? Browse other questions tagged or ask your own question.