WikiBench: A distributed, Wikipedia based web application ...

Anji

Le téléchargement nécessite un accès à la bibliothèque YouScribe
Tout savoir sur nos offres

33 pages

English

Le téléchargement nécessite un accès à la bibliothèque YouScribe
Tout savoir sur nos offres

A propos
Informations
Extrait

Description

WikiBench: A distributed, Wikipedia
based web application benchmark
Master thesis by Erik-Jan van Baaren
Student number 1278967
erikjan@gmail.com
Under the supervision of:
Guillaume Pierre
Guido Urdaneta
Vrije Univesiteit Amsterdam
Department of Computer Science
May 13, 2009Abstract
Many di erent, novel approaches have been taken to improve throughput
and scalability of distributed web application hosting systems and relational
databases. Yet there are only a limited number of web application bench-
marks available. We present the design and implementation of WikiBench,
a distributed web application benchmarking tool based on Wikipedia. Wik-
iBench is a trace based benchmark, able to create realistic workloads with
thousands of requests per second to any system hosting the freely available
Wikipedia data and software. We obtained completely anonymized, sam-
pled access traces from the Wikimedia Foundation, and we created software
to process these traces in order to reduce the intensity of its tra c while
still maintaining the most important properties such as inter-arrival times
and distribution of page popularity. This makes WikiBench usable for both
small and large scale benchmarks. Initial benchmarks show a regular day of
tra c with its ups and downs. By using median response times, we are able
to show the e ects of increasing tra c intensities on our system under test.Contents
1 Introduction 2
2 Related Work 4
2.1 TPC-W . . . . . . . . . . . . . . . . . . . . . . . ...

Sujets

Tracé digital

Horodatage

Échantillonnage (musique)

Ratio de Sharpe

Controller area network

Réalisme fantastique

Informations

Publié par	Anji
Nombre de lectures	148
Langue	English

Extrait

WbiakisBedenwceh:baApdpilisctraitbioutnedb,enWcihkmipaerdkiaMasterthesisbyErik-JanvanBaarenStudentnumber1278967erikjan@gmail.comUnderthesupervisionof:GuillaumePierreGuidoUrdanetaVrijeUnivesiteitAmsterdamDepartmentofComputerScienceMay13,2009

AbstractManydiﬀerent,novelapproacheshavebeentakentoimprovethroughputandscalabilityofdistributedwebapplicationhostingsystemsandrelationaldatabases.Yetthereareonlyalimitednumberofwebapplicationbench-marksavailable.WepresentthedesignandimplementationofWikiBench,adistributedwebapplicationbenchmarkingtoolbasedonWikipedia.Wik-iBenchisatracebasedbenchmark,abletocreaterealisticworkloadswiththousandsofrequestspersecondtoanysystemhostingthefreelyavailableWikipediadataandsoftware.Weobtainedcompletelyanonymized,sam-pledaccesstracesfromtheWikimediaFoundation,andwecreatedsoftwaretoprocessthesetracesinordertoreducetheintensityofitstraﬃcwhilestillmaintainingthemostimportantpropertiessuchasinter-arrivaltimesanddistributionofpagepopularity.ThismakesWikiBenchusableforbothsmallandlargescalebenchmarks.Initialbenchmarksshowaregulardayoftraﬃcwithitsupsanddowns.Byusingmedianresponsetimes,weareabletoshowtheeﬀectsofincreasingtraﬃcintensitiesonoursystemundertest.

Contents1Introduction22RelatedWork42.1TPC-W..............................42.2WebPolygraph..........................63SystemModel83.1Requirements...........................93.2WikiBenchdesign.........................113.3TraceBenchDesign........................153.4WikiBenchWorkﬂow.......................164WorkloadCreation194.1ChangingtheRequestRate...................194.2AHybridApproach........................215BenchmarkResults326FutureWork276.1ScalinguptraﬃcandFlashCrowds...............276.2Adjustread/writeratio......................276.3Adjustthedistributionofpagepopularity...........276.4Indicationofrealism.......................286.5Moreadvancededits.......................287Conclusion192

1IntroductionAlthoughoriginallyaplacedesignedforresearcherstoeasilyexchangein-formation,theworldwidewebhasbecomeoneofthemostimportantinfor-mationinfrastructuresofmodernsociety.WehaveseenarapidtransitionfromstaticHTMLdocumentstoadvancedwebapplicationslikeonlineemail,socialnetworksandonlineoﬃcetoolssuchaswordprocessingandspread-sheets.Whilesuchwebapplicationsbecamemoreandmorecommonoverthepastyears,onlyalimitednumberofwebapplicationbenchmarkingtoolsemerged.Hostingadvancedwebapplicationscanrequirelotsofresources.Itisthereforeimportanttoperformresearchonhowtoimprovevariousaspects([11],[12],[8],[16])ofsuchhostingsystems.Webapplicationbenchmarksareespeciallyimportantwhendoingsuchresearch,sincetheyprovideacon-ﬁgurable,reproducibleandoftenrealisticsimulationofrealwebapplicationusage.Benchmarktoolsaidthesystematicresearchintotheperformanceofwebhostingsystemsandmakeitpossibletocomparediﬀerentsystemsanddiﬀerentsystemsetups.Thereareanumberofbenchmarkapplicationsthatareusedtoday,likeTPC-W,RUBBoSandRUBiS.Thesebenchmarkshavesimilarcharacter-istics.TPC-Wsimulatesawebstore,whileRUBBoSisasimplebulletinboardsystemandRUBBiSmimicsanonlineauctionsite.Althoughthesetoolshaveproventobeusefultomanyresearchers,theyhavelimitsintermsofdatasetsize,scalabilityandfunctionality.Forexample,allthreetoolsrunonasinglesystem.Thereisnobuilt-inwaytoscaleuptomultiplesys-tems.Althoughitcanbesubjecttoheateddebate,wefeelthatthesyntheticworkloadsthesebenchmarkscreateareunrealisticandlackconﬁgurability.Anotherimportantdownsideofthesebenchmarksisthattheygenerateaconstantloadthroughaﬁxednumberofemulatedbrowsers.Theseemulatedbrowserallwaitindeﬁnitelyforaservertoanswerarequest,whileinrealityavisitorisonlypreparedtowaitforalimitedamountoftime,like4to8seconds[7].Inaddition,thenumberofvisitorstypicallyvariesgreatlyde-pendingonthetimeofday,whilemostbenchmarktoolsshowaconstantloadovertheentiretimeperiod.Sothesetoolslackrealism,ﬂexibilityandconﬁgurability:qualitiesthatwethinkareveryimportantwhenitcomestothedevelopmentandtestingofadvancedwebhostingsetups.Toaddresssomeoftheshortcomingsofthecurrentlyavailablebench-marktools,wecreatedWikiBench.WikiBenchhasanumberofadvantagescomparedtothepreviouslydiscussedtools.Firstofall,Wikibenchoﬀersahighdegreeofrealism,sinceitisentirelybasedontheWikipediasoftwareanddata.wehaveobtainedaccesstracesfromtheWikiMediaFoundation.ThesetracescontaindetailedtraﬃclogsofrequestsmadetoWikipediabyits2

users.WeareabletoconverttheseaccesstracestobenchmarkworkloadsbyusingourspecializedTraceBenchtool.TraceBenchcanreducetheintensityofthistraﬃcwhilemaintainingimportanttraﬃcproperties,allowingustocreateveryrealisticbenchmarkworkloadswithintensitiesrangingfromverylowuptotheoriginaltraﬃcintensityofthetraceﬁle.Tomatchtheserversidesoftwarewiththeworkloadﬁles,weusetheopensourceMediaWikiapplication[1],thesoftwareusedtorunWikipedia.Thisapplicationisquiteadvancedandhasbeentestedextensively.InadditionwehaveusedpubliclyavailablesnapshotsfromtheWikiMediafoundationtobuildamirroroftheEnglishWikipediasite.Sowenowhavearealworldwebapplicationwithalargeamountofdatatoserve.SinceWikipediahasalargeandconstantlyincreasingamountofdataandvisitors,basingabenchmarktoolonthisdataisnotaneasytask.WehavedesignedWikiBench,fromthegrounduptobeaninherentlydistributedapplication.Itcanscaleupfromonemachineforsmallbenchmarkstomanymachinesworkingtogetherinacoordinatedfashion.Allputtogether,webelievewehavecreatedabenchmarkingtoolthatisabletocreateaworkloadwhichmatchesrealityclosely.Byusingrealworldserversidesoftwareanddata,wethinktheWikiBenchbenchmarkingsuiteisaveryrealisticandﬂexibleresearchtool.InitialbenchmarkresultsshowatypicaldayofWikipediatraﬃcandtherelationbetweentherequestrateandtheserverresponsetimes.TheintensityofthistraﬃcisreducedwithourTraceBenchtooltoﬁtoursystemundertest.Therestofthisthesisisorganizedasfollows.InSection2wediscussanumberofexistingwebapplicationbenchmarks,inSection3wedescribetheWikiBenchdesignindetail.InSection4wefocusonhowwecreaterealisticworkloadsofarbitrarysizefromthetraceﬁles.Section5discussesourinitialresultsandsection6concludes.3

2RelatedWorkOneofthemorewellknownandintensivelyusedtoolstobenchmarkapplica-tionhostingsystemsisTPCBenchmarkWTM[13].Anothertoolwewilldis-cuss,intendedforbenchmarkingcachingservers,iscalledWebPolygraph[10].Althoughnotaimedatapplicationbenchmarking,itdoeshaveanumberofinterestingfeatures.2.1TPC-WTPCBenchmarkWTM(TPC-W)isatransactionalwebbenchmark[13].Theworkloadisperformedinacontrolledinternetcommerceenvironmentthatsimulatestheactivitiesofabusinessorientedtransactionalwebserver.Thisisdonebyimplementingawebstoreandatestingtoolthatbrowsesthroughthisstore.TPC-WusestheconceptofEmulatedBrowsers(EBs).EachEBrunsinaseparatethreadandemulatesauser’sbrowserandtheuseractions.TheEBusesMarkovchainstoﬁndrandompathsintheTPC-Wstore.AnEBhasarandomthinktimethatemulatesthetimeausertakesbeforeclickingonthenextlink.Thisthinktimeisdistributedinsuchawaythattheaveragethinktimeis7seconds.LoadgenerationinTPC-Whappensinabest-eﬀortfashion.ThismeansthattheEBswaitforeachrequesttobeprocessed,nomatterhowlongthistakes.Theresultbeingthatthisbenchmarktoolcanonlyvaryloadbychangingthedegreeofconcurrentemulatedbrowsers.insteadofchangingtherequestrate,whichwouldmatchrealitymuchcloser.TPC-Woﬀersthreediﬀerentmodes.Thesemodeshavediﬀerentratiosbetweentheamountofinformationrequests(pagereads)andthenumberofordersplacesbycustomers.Inthedefaultmode,theratioofreadandorderpagesis95%/5%.Inbrowsemode,usersmostlybrowsearoundwith98%browsingand2%ordering.Inordermode,usersmostlybuystuﬀandorderpagesarevisited50%ofthetime.OrdermodeputsmuchmorepressureonthedatabasesystemofaTPC-Wsite,sincetheactionsassociatedwiththeorderingofproductsarenotcacheable.Sothelattermodecanbeusedtoputmorepressureonthedatabasewithouthavingtoincreasethewebservercapacity.TPC-WBenchmarkhasbeendiscontinuedsince2005,butisstillusedfortestingdistributedhostingsystems.TwoothercommonlyusedbenchmarktoolsarecalledRUBBoS[2]andRUBiS[3].BothhavesimilarcharacteristicstoTPC-W.RUBBoSisabul-letinboardbenchmarkmodeledafteranonlinenewsforumlikeSlashdot.RUBiSisabenchmarkingtoolbasedonanonlineauctionsitelikeeBay.BothRUBBoSandRUBiSmakeuseofemulatedusersessions,transition4

tableswithprobabilitiesandwaittimes.Althoughwebsitetraﬃcdoesoftenhavereturningandpredictablepat-terns,wefeelthattheworkloadsofthesetoolsaretoolimited.Alargerproblemweseeisthattheemulatedbrowserscreateaverystableandpre-dictableworkloadonasystem.Thesetools,forexample,donotcreatespikesoftraﬃcatthepagelevel,whileinrealitypagesgetlinkedonlargenewsforumslikeDiggorSlashdot.Wecallthesetraﬃcspikesﬂashcrowds.Asinglepagemay,forshortperiodsoftime,easilyreceivemanytimesmoretraﬃcthanitgetsonaverage.ForadistributedhostingsystemitiseasytoanticipatethestabletraﬃcpatternandpagepopularitydistributionthatMarkovchainscreate.Butinrealityadistributedhostingsystemwillhavetodealwithmorediﬃculttraﬃcpatternslikethepagelevelﬂashcrowdsdescribedabove.Suchﬂashcrowds,evensmallones,changethespatialandtemporallocalityoftherequests,possiblyoverloadingpartsofthesystemifnotimelymeasuresaretaken.Anotherproblemweseewiththeemulatedbrowsersinthedescribedbenchmarkingtoolsisthattheyuseaﬁxedamountofconcurrentemulatedbrowserstoputloadonthesystemundertest.Sotheloadputonahost-ingsystemisdeﬁnedbytheamountofemulatedbrowsersinsteadofbytherequestrate.Iftheserverisoverloaded,theemulatedbrowserswillallwaitlonger,decreasingtheloadputontheserveratsomepoint.Workloadandresponsetimearethereforedirectlyrelatedtoeachother.Thisishow-evernotrepresentativeofrealworldtraﬃc,wherenewandcurrentvisitorswillkeephammeringahostingsystemwithnewrequests.Inotherwords,thesebenchmarksdeﬁneserverloadbythenumberofconcurrentemulatedbrowsers,insteadofbythenumberofrequestspertimeunitaserveractuallycanhandlebeforeitstartstoreactunacceptablyslow.Allthreesynthetictraﬃcgeneratorsalsolackthetraﬃcthatisgeneratedbywebcrawlers,malicioususersandpagescrapers.Suchuserscanforex-amplehaveaveryconstantthinktimeanddiﬀerentpagepreferencesthanaregularvisitor.Acrawlerwillgenerallyvisitallpagesinasite,whilemostregularuserswillonlyvisitasmallsubset.Amalicioususermightsubmitloadsofdiﬃcultsearchqueriesorextraordinaryamountsofpageeditstovandalizethecontent.Pagescrapersmightonlydownloadthepageandnottherelateditems,likeimagesandstylesheets.Theseusersexistintherealworld,soasystemshouldbeabletodealwiththem.Onalargescalehost-ingsystemlikethatofWikipedia,allthiscontributesonlyslightlytothetraﬃcsoitisnotunreasonabletonottakethisintoaccountinasyntheticworkload.Finally,allthreebenchmarksarenotdesignedtoscale.Oneneedstostartmultiplemachinesbyhandandcreatetoolstosynchronizethestartof5

theseindividualmachines.Theseindividualbenchmarkprocesseswillnotcoordinatewitheachother,makingitmorediﬃculttodetecterrors.E.g.ifonefails,othermachineswillkeepgeneratingloadwithoutnoticingthisfailure.2.2WebPolygraphWebPolygraph[10]isatoolforbenchmarkingHTTPintermediarieslikeproxyserversandotherwebcachingproducts.EventhoughthisbenchmarktestsHTTPintermediariesinsteadofHTTPservers,WebPolygraphisstillinterestingenoughtomentionsincetheauthorshavetakensomuchcaretocreaterealisticworkloads.TheWebPolygraphbenchmarkisbasedentirelyonsynthetictraﬃcworkloads,whicharecreatedinconjunctionwithindustryandvariousresearchgroups.ThecreatorsofWebPolygraphhavetriedtocreaterealisticworkloadsbyanalyzingandusingmanycharacteristicsofrealwebtraﬃc,likeﬁletypeandsizedistributionandrequestinterarrivaltimes.Fromtheirresearch,itbecomesclearthatcreationofsynthetic,realisticworkloadisaresearchﬁeldinitself.TheWebPolygraphglobalarchitectureconsistsofvirtualclientsandservers.Clientsrequestsimulatedobjectsfromtheseservers.TheHTTPintermediariesareplacedinbetween.Serversandclientsaregluedtogetherbyusingconﬁgurationﬁlesthataresharedbetweenthetwo.Thechoiceforsyntheticworkloadsismadebecauseforthistool’spur-pose,realaccesstraceswouldneedtobemodiﬁedheavily.Forexample,thebenchmarkallowschangestoparameterssuchastherequestrate,cachehitratiosandﬁlesizeandpopularitydistribution.Thecreatorsarguethatusingdiﬀerent(realworld)tracestogetdiﬀerenttypesofworkloadswillchangemanyparametersatonceinsteadofjustoneoftheparameters.Thismakesanalyzingperformancecomparisonsmorediﬃcult,sincediﬀerentworkloadsarenotascloselyrelatedtoeachotherasispossiblewithsyntheticallycre-atedworkloads.Itisalsoarguedthatmanytestsdonotcorrespondtoanyrealtrace,sousingrealtraceswouldmakenosense.Furthermore,usingrealtracesforthisbenchmarkwouldmeanthatthebenchmarkneedstohavedetailedknowledgeofmillionsofobjects.Instead,theauthorshavechosentoembedinformationabouttheobjectsrequestedfromtheserverintheURLsofthoseobjects.Forexample,anobjecttypeidembeddedintheurlidentiﬁesthepropertieslikeﬁlesizeandﬁletypeoftherequestedobject.Tovarytheworkload,theauthorscreatedamodelthatspeciﬁesameaninter-arrivaltimeforaPoissonstreamofrequests.Therobots,whicharesimilartothepreviouslymentionedemulatedbrowsers,multiplythismeanbythecurrentloadfactortogetthecorrectinter-arrivaltimeatanymoment.6

Unfortunately,thisbenchmarktoolisoflittleusewhenitcomestotest-ingsystemsthathostwebapplications.Becauseitisaimedatbenchmarkingcachingservers,theserversideneedstohostaveryspeciﬁcwebapplicationandwebserverthatgeneratesdataﬁlesofdiﬀerentsizeandtypebasedontheﬁlename.Thesedataﬁlescontainrandomdata.Thebenchmarkingap-plicationrequestsﬁlesofcertainsizesinacertaindistributionbyadjustingtherequestedﬁlenames.Unfortunately,thewebapplicationisnotrepresen-tativeofatypicalwebapplication.E.g.itdoesnotservewebpagesthatlinktoeachother.Inaddition,itdoesnotdependonadatabase,makingitevenlessrealisticsincemostadvancedwebapplicationsareverydataintensive.7

3SystemModelWikiBenchconsistsofacollectionoftools.Themaintoolsare:•TraceBench:AtooltoprocesstraceﬁlesforusewithWikiBench•WikiBench:thebenchmarkapplicationitself,whichcanbedividedintoacontrollerandworkernodes•Post-processingtools:AsetofscriptstoprocesslogﬁlesproducedbyWikiBenchandcreategraphicsfromtheresultsWikiBenchisintheﬁrstplacecreatedasaresearchtool.Ourgoalistocreatearealisticbenchmarkwithadaptabletraﬃcproperties.Wethereforebenchmarkarealworldsoftwareapplicationthatisusedextensively.ThissoftwareapplicationisMediaWiki,thesoftwarethatisusedbyWikipedia.Wikipediaisacollaborative,multi-languageonlineencyclopedia.Wikipediaisbasedonwikitechnologyforstoringandgivingstructuretoinformation.TheMediaWikiapplicationrepresentstherealitieswearefacedwithtodaywhendesigningsoftwareforalargescalewebsitewithmanyusers.Thisinturnmeansitwillputrealistic,highdemandsonahostingsystemthatisusedtohostthiswebapplication.AnaddedadvantagetousingWikipediaisthatthemillionsofWikipediapagesarefreelyavailableintheformofsnapshots[4].SousingMediaWikiinconjunctionwithWikipediasnapshotsallowsustocreatearealworldapplicationwithrealworlddatatobebenchmarked.Aswasnotedbyotherauthors,likethoseofWebPolygraph[10],creatingusefulworkloadswithabenchmarkingtoolisaresearchﬁeldinitself.WeareverygratefultohaverealaccesstracesfromWikipedia.Thesetraces,ob-taineddirectlyfromtheWikiMediafoundation,arecompletelyanonymized.Foreachrequest,weonlyhaveaunixtimestamp,thecompleteurlthatwasaccessedanda’save’ﬂagthatissettotrueonlyiftherequestresultedinapagechangeorpagecreation.Nomorepersonalinformationthanavail-ableonthepubliclyaccessibleWikipediawebsitecanbedeductedfromthesetraces.E.g.wedonothaveaccesstoIPaddresses,cookies,useraccountinformationorthecontentsofaPOSTrequest.Ontopofthat,eachrequestonlyhasa10%chanceofbeingincludedinthetracesweget.Previousre-search[15]hasshownthatthesetracescontainarepresentativesampleoftheworkloadWikipediagets.Wethereforeusedthesetracestocreatework-loadsforWikiBenchinsteadofcreatingpurelysyntheticworkloadslikeotherbenchmarkingtoolshavedone.Thisaddsrealismtothebenchmark,sincethetracesweusecontainallthecharacteristicsthattherealWikipediasitehastodealwith.8

Inthisthesiswedeﬁnethesystemundertest(SUT)asthehostingsystemthathoststheWikipediadata.WeapproachthisSUTasablackbox,withaURLthatservesasourentrypoint.Itisoutofthescopeofthisthesistodeﬁnedetailedrequirementsonahostingsystem.Inprinciple,thehostingsystemcanapplyanytechniquethatonecanthinkoftoimproveoverallsystemperformance.Theonly“common-sense”requirementsweputonthesystemisthatisisaccessiblethroughHTTP,exactlyliketheMediaWikisoftware,andthatitgivesthesameresponsesinidenticalsituations.IftheSUTwouldberunsidebysidewithavanillaMediaWikiinstallation,weexpecttheexactsamereplieswhendoingidenticalGETandPOSTrequestsonbothservers.Thereisoneexceptiontothisrule:weremovedacheckforduplicateeditsandCSRFattacks([5],[17])fromMediaWiki.WeworkedaroundthischeckbycommentingoutasinglelineofMediaWikicode.With-outthischeck,wecanpostanythingwewanttoawikipagewithasinglehttpPOSTrequest.Ifwewouldleavethissecuritycheckinthecode,wewouldneedtoﬁrstrequesttheeditpage,parsethatpagetoobtainanumberofhiddenformelementsandthenperformthePOSTrequest.Thehiddenﬁeld,asecrettokenthatisuserspeciﬁc,isusedtopreventCSRFattacksinwhichaloggedonusermightPOSTdatatoWikipediawithoutknowingitwhileheisvisitingamaliciouswebsite.Becauseamalicioussitecannotguessorcalculatethissecrettoken,itcanneverperformavalidPOSTrequestwithouttheusernoticingit.Leavingthecheckinplacewouldalsoincreasethechanceofaneditconﬂict.Whilerequestingandparsingtheeditpage,atimewindowisintroducedinwhichanotherthreadmightrequestthatsamepage.BecauseMediaWikiusestimestampstocheckforeditconﬂicts,wewillnowhavetosolveaneditconﬂictbetweentwothreads,introducingevenmorecomplexity.3.1RequirementsWikipediahasanumberofcharacteristicsthatmakeitidealtobeusedinadistributedwebapplicationbenchmark.Firstofall,thereislotsofdata.JusttheEnglishwiki,withoutuserandtalkpages,containsmorethan7millionarticlesinthesnapshottakenonOctober7,2008.Thiscanbehostedononesingleserver,butthatserverwillonlybeabletosustainfractionsofWikipedia’sactualtraﬃc.Wikipediareceiveshighnumbersofrequestspersecond.Asofthiswriting,peaksof50,000to60,000requestspersecondarenoexception.So,evenwithour10%sampleoftheaccesstraces,wecangeneratehighloadsuptoabout5000requestsperseconds.TherequirementsweputonWikiBenchobviouslyneedtomatchthesenumbers.Thissubsectiondescribesthemaingoalsandrequirementswehaveputon9