Benchmark Estimation for Markov Chain Monte Carlo Samples

Phoem

Le téléchargement nécessite un accès à la bibliothèque YouScribe
Tout savoir sur nos offres

28 pages

English

Le téléchargement nécessite un accès à la bibliothèque YouScribe
Tout savoir sur nos offres

A propos
Informations
Extrait

Description

Benchmark Estimation for Markov Chain Monte Carlo Samples
Subharup Guha, Steven N. MacEachern and Mario Peruggia
guha.3@osu.edu
May 2002; revised November 2002
Abstract
While studying various features of the posterior distribution of a vector-valued parameter using
an MCMC sample, a subsample is often all that is available for analysis. The goal of benchmark
estimation is to use the best available information, i.e. the full MCMC sample, to improve future
estimates made on the basis of the subsample. We discuss a simple approach to do this and provide
a theoretical basis for the method. The methodology and beneﬂts of benchmark estimation are
illustrated using a well-known example from the literature. We obtain as much as an 80% reduction
in MSE with the technique based on a 1-in-10 subsample and show that greater beneﬂts accrue with
the thinner subsamples that are often used in practice.
1 Introduction
While using an MCMC sample to investigate the posterior distribution of a vector-valued parameter ,
many features of interest have the representation E[g( )] for some function g( ). A subsample of
the MCMC output is often all that is retained for further investigation of the posterior distribution.
Subsamplingisoftennecessaryincomputationallyintensiveorreal-time,interactiveinvestigationswhere
speed is essential. Examples include expensive plot processing and examination of changes in the prior
(sensitivity analysis), likelihood (robustness) or data (case in uence). ...

Sujets

Linternaute.com

Kilo

Réduction d'endomorphisme

Montagnes russes

Théorème de Rice

Involute

Informations

Publié par	Phoem
Nombre de lectures	42
Langue	English

Extrait

BenchmarkEstimationforMarkovChainMonteCarloSamplesSubharupGuha,StevenN.MacEachernandMarioPeruggiaguha.3@osu.eduMay2002;revisedNovember2002AbstractWhilestudyingvariousfeaturesoftheposteriordistributionofavector-valuedparameterusinganMCMCsample,asubsampleisoftenallthatisavailableforanalysis.Thegoalofbenchmarkestimationistousethebestavailableinformation,i.e.thefullMCMCsample,toimprovefutureestimatesmadeonthebasisofthesubsample.Wediscussasimpleapproachtodothisandprovideatheoreticalbasisforthemethod.Themethodologyandbeneﬁtsofbenchmarkestimationareillustratedusingawell-knownexamplefromtheliterature.Weobtainasmuchasan80%reductioninMSEwiththetechniquebasedona1-in-10subsampleandshowthatgreaterbeneﬁtsaccruewiththethinnersubsamplesthatareoftenusedinpractice.1IntroductionWhileusinganMCMCsampletoinvestigatetheposteriordistributionofavector-valuedparameterθ,manyfeaturesofinteresthavetherepresentationE[g(θ)]forsomefunctiong(θ).AsubsampleoftheMCMCoutputisoftenallthatisretainedforfurtherinvestigationoftheposteriordistribution.Subsamplingisoftennecessaryincomputationallyintensiveorreal-time,interactiveinvestigationswherespeedisessential.Examplesincludeexpensiveplotprocessingandexaminationofchangesintheprior(sensitivityanalysis),likelihood(robustness)ordata(caseinﬂuence).Typically,suchstudieswouldincludehundredsorthousandsofchangestothemodel.Anotherreasonforsubsamplingisexpensive1

storagespace.Practicalconstraints,likelimiteddiskspaceavailabletousersofsharedcomputingresources,oftenmakeitinfeasibletostoretheentiresampleofMCMCdrawswhentheparameterhasalargenumberofcomponents.Asubsampleisthenretainedforfutureinvestigationoftheposteriordistribution.Geyer(1992)andMacEachernandBerliner(1994)showthatsubsamplingtheoutputoftheMarkovchaininasystematicfashioncanonlyleadtopoorerestimationofE[g(θ)].ThegoalofbenchmarkestimationistoproduceanumberofestimatesbasedontheentireMCMCsample,andtothenusethesetoimproveotherestimatesmadeonthebasisofthesubsample.Thebenchmarkestimatesmustbequickandeasytocompute.Theymustalsobecompatiblewithquickcomputationsforfurther,more(computationally)expensiveanalysesbasedontheeventualsubsample.Severalmotivatingperspectivesareusefultounderstandandinvestigatevariousaspectsofbench-markestimation.Thepointofviewofcalibrationestimation,developedinthesamplingliteraturetoimprovesurveyestimates(DevilleandSa¨rndal,1992;Vanderhoeft,2001),helpstobringalltheseperspectivestogetherintoauniﬁedframework.Incalibrationestimation,aprobabilitysamplefromaﬁnitepopulationisusedtocomputeestimatesofpopulationquantitiesofinterest.The(regressiontype)estimatorsarebuiltasweightedaveragesoftheobservationsinthesample,withtheweightsde-terminedsoastosatisfya(vector-valued)calibrationequationwhichforcestheresultingestimatorstoproduceexactestimatesofknownpopulationfeatures.Usually,theconstraintsimposedbythecalibra-tionequationdonotdetermineauniquesetofweights.Thus,amongthesetsofweightssatisfyingthecalibrationequation,onechoosesthesetthatyieldsweightsthatareascloseaspossible(withrespecttosomedistancemetric)toaﬁxedsetofprespeciﬁed(typicallyuniform)weights.TocastMCMCbenchmarkestimationintothetheframeworkofcalibrationestimation,weregardtheMCMCoutputasaﬁnitepopulationanda1-in-ksystematicsubsampleasaprobabilitysampledrawnfromtheﬁnitepopulation.Thissystematicsamplingdesigngiveseachunitinthepopulationaprobability1/kofbeingselected,thoughmanyjointinclusionprobabilitiesare0.Inthissetting,the2

(vector-valued)benchmarkE[h(θ)],forwhichthesubsampleestimateisforcedtomatchthefullsampleestimate,correspondstotheauxiliaryinformationavailablethroughthecalibrationequation.Oncethecalibrationweightshavebeencalculated,theycanthenbeusedtocomputethecalibrationsubsampleestimateofanyfeatureE[g(θ)].AsthefullMCMCsamplesizeincreases,theasymptoticperformanceofthesebenchmarkestimatorsmatchesthatofthecorrespondingcalibrationestimators.Thebenchmarkestimatorsthatweintroduceinthispapercanbeshowntobecalibrationestimatorscorrespondingtoappropriatelychosencalibrationequationsandmetrics.Weinvestigatetwomethodsofcreatingweights:post-stratiﬁcationandmaximumentropy.Intheirsimplestform,post-stratiﬁcationweightsarederivedbypartitioningtheparameterspaceintoaﬁnitenumberofregionsandbyforcingtheweightedsubsamplefrequenciesofeachregiontomatchthecorrespondingrawfrequenciesfortheentireMCMCsample.Theweightsaretakentobeconstantoverthevariouselementsofthepartitionandtosumtoone.Animprovedversionofpost-stratiﬁcation,(and,infact,theapproachthatinourexperiencehasgeneratedthemostsuccessfulestimators)beginswitharepresentationofanarbitraryfunctiong(θ)as∞acountablelinearcombinationofbasisfunctionshj(θ):g(θ)=j=1cjhj(θ).Theestimand,E[g(θ)],isPexpressedasthesamelinearcombinationofintegralsofthebasisfunctions,j∞=1cjE[hj(θ)].SplittingPtheinﬁniteseriesrepresentationofg(θ)intotwoparts,wehaveaﬁniteserieswhichmayprovideagoodapproximationtog(θ)andaninﬁniteremainderseriesthatﬁllsouttherepresentationofg(θ).Focusingontheﬁniteseries,wedeterminetheweightsbyforcingestimatesofE[hj(θ)]basedonthesubsampletomatchthosebasedonthefullsample.Inaddition,werequiretheweightstobeconstantovertheelementsofasuitablychosenpartitionoftheparameterspaceandtosumtoone.ThisproducesabetterestimateofE[jm=1cjhj(θ)]thanonebasedonthesubsamplealone.TheimprovementcarriesPovertoestimationofE[g(θ)]whenthetailoftheseriesisofminorimportance.Werefertotheﬁnitesetofbasisfunctionsasthe(vector-valued)benchmarkfunction.Inboththebasicandimprovedpost-stratiﬁcationapproaches,wespecifyenoughconditionsthat3

(forvirtuallyallMCMCsamplesofreasonablesize)thereisauniquesetofweightsthatwouldsatisfythem.Thus,fromthepointofviewofcalibrationestimation,thechoiceofthedistancemetricbecomesimmaterial,inthesensethatanymetricwouldyieldidenticalweights.Inthisrespect,ourpost-stratiﬁcationweightsarisefromadegenerateinstanceofaproblemofcalibrationestimation.Inthecaseofthemaximumentropyweights,however,wedonotspecifyenoughbenchmarkconditionstomaketheweightsunique.Rather,amongthesetsofweightssatisfyinganunder-determinednumberofbenchmarkconditions,weselectthesethavingmaximumentropyandthis,fromthepointofviewofcalibrationestimation,istantamounttochoosingaspeciﬁcdistancemetric.Benchmarkestimationyieldsimprovedweightedestimators(havingsmallervariances)basedonMCMCsubsamples.Weinvestigatetheperformanceofestimatorsbasedonweightsdeterminedaccord-ingtopost-stratiﬁcationandmaximumentropymethods.Substantialreductionsinthevariabilityoftheestimatesoccurwhenexaminingexpectationsoffunctionsg(θ)thataresimilartoalinearcombi-nationofbenchmarkcomponents.Theoreticalresultssuggestthegainsthatweseeinpractice.ThemethodologyisillustratedonanexamplediscussedbyGeorge,MakovandSmith(1993).WecompareestimationofE[g(θ)]foravarietyoffunctionsg(θ),showingthatthereareoftensubstantialbeneﬁtstotheuseofbenchmarkestimates.TheextentoftheimprovementinestimationofE[g(θ)]forfunctionsthatarenoticeablydiﬀerentfromthebenchmarksisstriking,evenforvaluesofmassmallas3or4.2Asimpleapproachtobenchmarkestimation2.1AnimprovedsubsampleestimatorLetθ∈Θbeavector-valuedparameter.ImaginethatanMCMCsampleisdrawnfromtheposteriordistributionofθ.Callthesequenceofdrawsθ(1),θ(2),...,θ(N).Thedrawsareusedtoestimatesomefeatureoftheposteriordistribution.OftenthesefeaturesofinterestcanberepresentedasE[g(θ)]for4

g(θ(ki)),2()some(possiblyvector-valued)functiong(θ).ThemoststraightforwardestimatorforE[g(θ)]isNEˆ[g(θ)]f=1g(θ(i)),(1)NX1=iwheretherighthandsubscriptdenotesthefullsampleestimator.Ifoneselectsasystematic1-in-ksubsampleofthedata,thenaturalestimatorisn1Eˆ[g(θ)]s=nX1=iwhereN=knandsdenotesthesubsampleestimator.Asmentionedearlier,thisformofsubsamplingalwaysleadstopoorerestimation;theunweightedsubsampleestimator(2)hasalargervariancethanthefullsampleestimator(1).WewishtousetheinformationavailablefromthefullsampletoimprovefutureestimationbasedonthesubsampleforanyfeatureE[g(θ)].Foranappropriatelychosen(andpossiblyvector-valued)functionh(θ),werefertothefeatureE[h(θ)]asthebenchmark.WenowcreateaweightedversionofthesubsampleestimatorofE[g(θ)]asfollows:nEˆ[g(θ)]w=wig(θ(ki)),(3)X1=iwherein=1wi=1.TheweightswiarechosensothattheyforcetheweightedsubsamplebenchmarkPestimatetoequalthefullsampleestimate:Eˆ[h(θ)]w=Eˆ[h(θ)]f.)4(ThusEˆ[h(θ)]wandEˆ[h(θ)]fhavethesamedistributionsprovidedtheweightscanbeconstructed,andallfeaturesoftheirdistributionsconditionalonthiseventarethesame.Foravector-valuedbenchmarkfunction,anylinearcombinationofitscoordinatesresultsinthesameestimateforboththesubsampleandthefullsample,andtheestimatorshavethesamedistribution.Inparticular,thetwoestimatorshavethesamevariance,andwehavepossiblygreatlyincreasedprecisionforoursubsampleestimatorofE[g(θ)].5

Theconnectionbetweenaconditionallyconjugatestructureandlinearposteriorexpectationinex-ponentialfamiliesimpliesthat,formanypopularmodels,quantitiessuchastheconditionalposteriormeanforacaseortheconditionalposteriorpredictivemeanwillbealinearfunctionofhyperparam-eters.Thestructureofthehierarchicalmodelenablesustousebenchmarkfunctionsbasedonthehyperparameterstocreatemoreaccurateestimatesofthesequantities.ThereductioninvariabilitywhenmovingfromEˆ[h(θ)]stoEˆ[h(θ)]walsoappearswhenexaminingexpectationsoffunctionsg(θ)thataresimilartoh(θ).Functionssuchasapredictivevariancewhichdependonﬁrstandsecondmomentswilltypicallybecloselyrelatedtobenchmarkfunctionsbasedonthehyperparametersandsotheywillbemoreaccuratelyestimatedwithourtechnique.Theweightedsubsample,(wi,θ(ki))fori=1,2,...,n,isnowusedinplaceoftheunweightedsubsample.Theweightsactexactlyastheywouldifarisingfromanimportancesample,andsoweobtainweightedsubsampleestimatesEˆ[g(θ)]wforvariousfeaturesofinterestE[g(θ)]oftheposterior.Techniquesandsoftwaredevelopedforimportancesamplescanbeusedwithoutmodiﬁcationfortheseweightedsamples.2.2SomemethodsofobtainingweightsnTheconstraintsthati=1wi=1andthatEˆ[h(θ)]w=Eˆ[h(θ)]fwillnottypicallydeterminethewi.PWithasinglerealbenchmarkfunction,wewouldhaveonlytwolinearconstraintsonthewi.Wesupplementtheconstraintswithaprinciplethatwillyieldauniquesetofweights.Thetwoprinciplesweinvestigatearemotivatedbytheliteraturesonsurveysamplingandinformationtheory.Weightsbypost-stratiﬁcation.Post-stratiﬁcationisastandardtechniqueinsurveysampling,designedtoensurethatasamplematchescertaincharacteristicsofapopulation.Thepopulationcharacteristicsarematchedbycomputingaweightforeachunitinthesample.Largesampleresultsshowthatapost-stratiﬁedsamplebehavesalmostexactlylikeaproportionallyallocatedstratiﬁedsample.Thistypeofstratiﬁcationreducesthevarianceofestimatesascomparedtoasimplerandom6

sample.Inthissetting,thefullsampleplaystheroleofthepopulationwhilethesubsampleplaystheroleofthesample.Thus,theessenceofthetechniqueistopartitiontheparameterspaceinto(say)mstrata,andtoassignthesameweighttoeachθ(ki)inastratum.Formally,supposethat{Θj}jm=1isaﬁnitepartitionoftheparameterspaceΘ.LetIj(θ)denotetheindicatorofsetΘj,forj=1,...,m.Thenaturalapplicationofthepost-stratiﬁcationmethodtakesasthebenchmarkfunctionthevectorofthesemindicatorfunctions.Thatis,h(θ)=(I1(θ),I2(θ),...,Im(θ))0.Weassignthesameweighttoallsubsamplepointsbelongingtoagivenstratum.Speciﬁcally,forallisuchthatθ(ki)∈Θj,wesetwi=vj,where,accordingto(4),thevaluesvjaredeterminedbyNn1vjIj(θ(ki))=Ij(θ(i)),wherej=1,...,m.NXXi=1i=1Thepost-stratiﬁcationweightsarethenobtainedas:N−1iN=1Ij(θ(i))Pj1=ivj=PnI(θ(ki)),(5)providedeachofthestratacontainsatleastonesubsamplepoint.Asinsurveysampling,withfairlywellchosenstrata,thechancethatanyofthestrataareemptyofsubsamplepointsisnegligible.Theintuitivedescriptionofthepost-stratiﬁcationweightvjisastheratiooftheproportionoffullsamplepointsinΘjtothenumberofsubsamplepoints.Werefertothissubsampleestimatorasthebasicpost-stratiﬁcationestimator,Eˆ[g(θ)]w,ps.Theperspectiveofabasisexpansionofg(θ)providesamoresophisticateduseofpost-stratiﬁcation.Insteadofusingabasisformedfromindicatorfunctions(essentiallyaHaarbasis),alternativebasesconsistoffunctionsotherthanindicators.Anattractivebasis,duetoitssuccessthroughoutstatistics,isthepolynomialbasisthatgeneratesTaylorseries.Assigningequalweighttosubsamplepointswithineachgivenpost-stratumyieldsn−mlinearconstraintsontheweights,andforcingtheweightstosumto1providesoneadditionalconstraint.Supplementingthesewithafurtherm−1linearconstraints7

ontheweights(andalsowithmildconditionsontheposteriordistributionandsimulationmethodtoguaranteeuniqueness)deﬁnestheweights.Theweightsarequicklyobtainedasasolutiontoasystemofmlinearequations.Thisversionofpost-stratiﬁcationhasproventobeextremelyeﬀectiveinpractice.Asanexampleofthistechnique,supposethatθisap-dimensionalvectorvaluedparameterandthattheparameterspaceΘispartitionedintom=3strata,namelyΘ1,Θ2andΘ3.Post-stratiﬁcationassignsthesameweightvjtoallsubsamplepointsbelongingtostratumΘj,wherej=1,2,3.Conditionalontheeventthatnostratumisemptyofsubsamplepoints,thiscorrespondston−3linearlyindependentconstraintsontheweights.Anadditionalconstraintontheweightsisthattheysumto1.Thustwootherindependentlinearconstraintswillensurethattheweightsareunique.Forachoiceoftworealfunctions,sayh1(θ)andh2(θ),ascomponentsofthebenchmarkfunction,thebenchmarkestimatesaregivenbyEˆ[h1(θ)]f=(1/N)iN=1h1(θ(ki)),andEˆ[h2(θ)]f=(1/N)iN=1h2(θ(ki)).ForPPj=1,2,3andl=1,2,letnj=in=1Ij(θ(ki))denotethenumberofsubsamplepointsfallinginstratumPΘj,andletbljdenotethesumofthefunctionhl(θ)evaluatedatthesubsamplepointsbelongingtostratumΘj.Thusnblj=hl(θ(ki))Ij(θ(ki)).X1=iThen,solvingthefollowingsystemoflinearequationsuniquelydeterminestheweights(v1,v2,v3)forthethreestrata,providedthesquarematrixbelowisinvertible:n1n2n3v11b11b12b13v2=Eˆ[h1(θ)]f.b21b22b23v3Eˆ[h2(θ)]fMaximumentropyweights.Informationtheorydescribes,invariousfashions,theamountofinformationindataaboutaparameterordistribution.InaBayesiancontext,itisoftenusedtodescribesubjectiveinformation(playingtheroleofdata)inordertoelicitapriordistribution.Thisisaccomplishedbyspecifyinganumberoffeaturesofthedistribution,typicallyexpectations,asthe“information”abouttheprior.Theprioristhenchosentoreﬂectthisinformationbutnomore.With8

entropydeﬁnedasthenegativeofinformation,thepriorwhichreﬂectsexactlythedesiredinformationisthatwhichmaximizesentropyamongthosepriorsmatchingtheconstraints.Inoursetting,weborrowthistechnique,matchingexactlytheinformationinthefullsamplebench-markestimates,butnomore.Letw=(w1,w2,...,wn)bethen-tupleofweightsgivenin(3).LetusdenotebyΩthe(possiblyempty)setofallweightn-tuplesthatsatisfy(4).ThusΩistheset{w|wi≥0∀i,in=1wi=1,in=1wih(θ(ki))=Eˆ[h(θ)]f}.PPDeﬁnition2.1Theentropyofann-tuplewbelongingtosetΩisdeﬁnedasnEn(w)=−wilnwi,X1=isubjecttotheconventionthat0ln(0)equals0.WeobservethatforallwbelongingtoΩ,En(w)≤Enn,n,...,n=ln(n).SinceΩisclosed,¡¡111¢¢thereexistsanelementw∗ofΩsuchthatEn(w∗)=supw∈ΩEn(w).Theseweightsw∗arecalledmaximumentropyweights,andtheyexistwheneverΩisnon-empty.Findingmaximumentropyweightsw∗isthusequivalenttomaximizingEn(w)subjecttothecon-straintswi≥0fori=1,...,n,in=1wi=1,andin=1wih(θ(ki))=Eˆ[h(θ)]f.ForarealbenchmarkPPfunctionh(θ),itcanbeshownthatthemaximumentropyweightsw∗areuniquewhenevertheyexist.Formostsubsamplesofreasonablesizetheyaregivenbywi∗=eλ1+λ2h(θ(ki)),i=1,2,...n;)6(whereλ2∈Rsatisﬁestheequationh(θ(ki))−Eˆ[h(θ)]fexpλ2h(θ(ki))−Eˆ[h(θ)]f=0,(7)Xn³´³³´´1=i´³andλ1=−lnPin=1eλ2h(θ(ki)).Thefewsubsampleswheretheweightsfailtoexistwillhaveeitherh(θ(ki))<Eˆ[h(θ)]fforalli,orh(θ(ki))>Eˆ[h(θ)]fforalli.Forallothersubsamples,equation(7)hasauniquerootbecausethelefthandsideoftheequationincreasesmonotonicallyfrom−∞to+∞as9

λ2increases.Therootcanbeobtainedbynumericalmethods,andhencewi∗canbecalculatedveryquicklyusing(6).Whenh(θ)isanindicatorfunctionofasubsetoftheparameterspaceΘ,theanswerobtainedusing(6)matchesthepost-stratiﬁcationweightsgivenin(5).Thus,thetwoapproacheswillsometimesyieldthesameresult.3TheoreticalResultsThemotivationforsubsamplingtheoutputofaMarkovchaincarriesovertothestudyofasymptoticpropertiesoftheestimators.Aﬁrstasymptoticismotivatedbythecommonpracticeofusingprelim-inaryrunsoftheMarkovchaintoassessthedependenceandconvergencepropertiesofthechain,andthenselectingasubsamplingratefortherunofthechainusedforestimation.Thispracticeleadsustoconsidertheasymptoticwherekisheldﬁxedandngrows(CaseA).Thestrongestmotivationforsub-samplingisthatfurtheruseofthesubsampleinvolvesexpensive(slow)processing.Whenthisfurtherprocessingisdoneinrealtime,aswithinvestigationofchangesinthepriordistributionorlikelihood,itisessentialtolimitthenumberofpointsthatarerepeatedlyprocessed.Pursuingthismotivationforsubsampling,anaturalasymptoticholdsthenumberofsubsampledpoints,n,ﬁxedwhilelettingtheintervalbetweensubsampledpoints,k,tendtoinﬁnity(CaseB).Inthelimit,thesesubsampledpointswilllooklikearandomsamplefromπ.Wenotethatasymptoticsbetweenthesetwo,wherebothnandkgrow(CaseC),arealsonaturalcandidatesfortheoreticalexploration,andareusefulforﬁllingouttherangeofasymptoticexpressionsusefulforassessingtheaccuracyofthebenchmarkestimators.Tierney(1994)containsacollectionofusefulresultsfortheasymptoticsofestimatorsbasedonoutputfromaMarkovchain.Thetwoessentialtypesofresultareergodictheoremswhichguaranteestrongconvergenceofanempiricalaverage(orfull-sampleestimator)toacorrespondingexpectationunderthelimitingdistribution,andcentrallimittheoremswhichdescribeweakconvergenceofanappropriatelycenteredandscaledfull-sampleestimatortoanormaldistribution.Werelyheavilyon01

hisresultstoshowasymptoticnormalityofoursubsampledestimators.CaseA.Weﬁrstconsidertheasymptoticwherekisheldﬁxedandntendsto∞.TheergodictheoreminTierney’spaperappliedtothesub-sampledchain(whichisitselfMarkovian)allowsustoconcludethat,asntendsto∞,Eˆ[g(θ)]stendstoE[g(θ)]almostsurely.Thenexttheoremestablishestheasymptoticnormalityofthebasicpost-stratiﬁcationestimatorforexpectationsofboundedfunctionsg(θ)andgeometricallyergodicMarkovchains.Theorem3.1SupposethattheMarkovchainisgeometricallyergodicwithinvariantdistributionπ.Ifthefunctiong(θ)isboundedandnotalinearcombinationofthestrataindicatorsIj(θ),j=1,...,m,thenthereexistsarealnumberσ(g)suchthat,asn→∞,thedistributionof√nEˆ[g(θ)]w,ps−E[g(θ)]´³convergesweaklytoanormaldistributionwithmean0andvarianceσ2(g).Proof.TheresultfollowsfromaveriﬁcationoftheconditionsofTheorem4ofTierney’spaperandbyanapplicationofthedeltamethod.SeeAppendixAforamoredetailedproof.UnderthestrongerassumptionofuniformergodicityoftheMarkovchain,anidenticalresultholdsforallfunctionsg(θ)withﬁniteposteriorvariance.AproofofthisresultreliesonTheorem5ofTierney’spaper,butisotherwisethesameastheoneabove.Theresultsextendbeyondthebasicpost-stratiﬁcationestimator,applyingalsotothemorecom-plexpost-stratiﬁcationestimators.Theproofofthefollowingcorollaryformodiﬁedpost-stratiﬁcationestimatorsisalmostidenticaltothatoftheprevioustheorem.Itdiﬀersonlyinminordetailsspeciﬁctomodiﬁedpost-stratiﬁcationestimators.Non-singularityofthematrixBdeﬁnedbelowisrequiredforlocalcontinuityoftheestimator.Corollary3.2LetEˆ[g(θ)]∗w,psdenotethemodiﬁedversionofthepost-stratiﬁcationestimatorbasedon11