﻿<?xml version="1.0" encoding="UTF-8"?>
<records>
  <record>
    <language>eng</language>
    <publisher>Science and Education Publishing</publisher>
    <journalTitle>American Journal of Software Engineering</journalTitle>
    <eissn>2379-528X</eissn>
    <publicationDate>2017-09-06</publicationDate>
    <volume>5</volume>
    <issue>1</issue>
    <startPage>20</startPage>
    <endPage>26</endPage>
    <doi>10.12691/ajse-5-1-3</doi>
    <publisherRecordId>AJSE2017513</publisherRecordId>
    <documentType>article</documentType>
    <title language="eng">A Distributed Multi-facet Search Engine of Microblogs Based on SolrCloud</title>
    <authors>
      <author>
        <name>Lan Huang</name>
        <email>lanhuang@yangtzeu.edu.cn</email>
        <affiliationId>1</affiliationId>
      </author>
      <author>
        <name>Juan Zhou</name>
        <affiliationId>1</affiliationId>
      </author>
    </authors>
    <affiliationsList>
      <affiliationName affiliationId="1">College of Computer Science, Yangtze University, Jingzhou, Hubei, China</affiliationName>
    </affiliationsList>
    <abstract language="eng">Microblog services, such as Twitter and Weibo in China, has become a new yet powerful information dissemination channel. More than 500 million tweets are sent every day. The extraordinary large number of messages brings new challenges to conventional search paradigms: a message might be relevant to the query in many aspects, for example the content, time and location of a message. Furthermore, there might be a large number of such relevant messages. In order to address these challenges, we designed a multi-facet distributed microblog search system using off-the-shelf open source frameworks including SolrCloud, Hadoop and Zookeeper. The system was then populated with real world messages collected from the most popular microblog website in China: Sina Weibo. We compared the performances of the standalone and the distributed version of the system. Empirical experimental results showed both effectiveness and efficiency of the proposed system in retrieving large scale microblog messages.</abstract>
    <fullTextUrl format="pdf">http://pubs.sciepub.com/ajse/5/1/3/ajse-5-1-3.pdf</fullTextUrl>
    <keywords language="eng">
      <keyword>solr</keyword>
      <keyword>SolrCloud</keyword>
      <keyword>multi-facet retrieval</keyword>
      <keyword>information retrieval</keyword>
      <keyword>microblog</keyword>
    </keywords>
  </record>
</records>