Horn Christopher, Pimas Oliver, Granitzer Michael, Lex Elisabeth
2011
In this paper, we outline our experiments carried out at theTREC Microblog Track 2011. Our system is based on a plain text indexextracted from Tweets crawled from twitter.com. This index hasbeen used to retrieve candidate Tweets for the given topics. The resultingTweets were post-processed and then analyzed using three differentapproaches: (i) a burst detection approach, (ii) a hashtag analysis, and(iii) a Retweet analysis. Our experiments consisted of four runs: Firstly,a combination of the Lucene ranking with the burst detection, and secondly,a combination of the Lucene ranking, the burst detection, and thehashtag analysis. Thirdly, a combination of the Lucene ranking, the burstdetection, the hashtag analysis, and the Retweet analysis, and fourthly,again a combination of the Lucene ranking with the burst detection butin this case with more sophisticated query language and post-processing.We achieved the best MAP values overall in the fourth run.