... Stefan. Digital videoquality : visionmodelsandmetrics / Stefan Winkler.p. cm.Includes bibliographical references and index.ISBN 0-470-02404-61. Digital video. 2. Image processing Digital ... is necessary to build computational models of the HVS and integratethem in tools for perceptual quality assessment. Digital VideoQuality - VisionModelsandMetrics Stefan Winkler# 2005 John ... 1/f.30 VISION Simpo PDF Merge and Split Unregistered Version - http://www.simpopdf.comSimpo PDF Merge and Split Unregistered Version - http://www.simpopdf.com Digital Video Quality VisionModels and...
... to hybrid standards such as D2-MAC (analog video, digital sound) and delayed the introduction of 100 % digital video. However, the very rapid progress made in compression techniques and IC technology ... algorithms and modulation schemes, which are explained inChapters 6 and 7.We will now examine the principles and various steps of video and audio compression which allow these bit-rates (and in ... standards for broadcasting digital pictures to the consumer, and the solutions chosen for the EuropeanDVB system (Digital Video Broadcasting) based on the internationalMPEG-2 compression standard.The...
... LLCSondhi, M.M. & Schroeter, J. “Speech Production Modelsand Their Digital Implementations” Digital Signal Processing HandbookEd. Vijay K. Madisetti and Douglas B. WilliamsBoca Raton: CRC Press ... NasalCouplingNasalsoundsareproducedbyopeningthevelumandtherebycouplingthenasalcavitytothevocaltract.Innasalconsonants,thevocaltractitselfisclosedatsomepointbetweenthevelumandthelips,andalltheairflowisdivertedintothenostrils.Innasalvowelsthevocaltractremainsopen.(NasalvowelsarecommoninFrenchandseveralotherlanguages.TheyarenotnominallyphonemesofEnglish.However,somenasalizationofvowelscommonlyoccursinEnglishspeech.)Intermsofchainmatrices,thenasalcouplingcanbehandledwithouttoomuchadditionaleffort.Asfarasitsacousticalpropertiesareconcerned,thenasalcavitycanbetreatedexactlylikethevocaltract,withtheaddedsimplificationthatitsshapemayberegardedasfixed.Thecommonassumptionisthatthenostrilsaresymmetric,inwhichcasethecross-sectionalareasofthetwonostrilscanbeaddedandthenosereplacedbyasingle,fixed,variable-areatube.ThedescriptionofthecomputationsiseasiertofollowwiththeaidoftheblockdiagramshowninFig.44.5.FromaknowledgeoftheareafunctionsandlossesforthevocalandnasaltractsthreechainmatricesKgv,Kvt,andKvnarefirstcomputed.Theserepresent,respectively,thematricesfromglottistovelum,velumtotractclosure(orvelumtolips,incaseofanasalvowel),andvelumtonostrils.FromKvnwithsomeassumedimpedanceterminationatthenostrils,theinputimpedanceofthenostrilsatthevelummaybecomputedasindicatedinEq.(44.16b).Similarly,Kvtgivestheinputimpedanceatthevelum,ofthevocaltractlookingtowardthelips.Atthevelum,thesetwoimpedancesarecombinedinparalleltogiveatotalimpedance,sayZv.Withthisastermination,thevelocitytovelocitytransferfunction,Tgv,fromglottistovelumcanbecomputedfromKgvasshownc1999byCRCPressLLCFIGURE44.5:Chainmatricesforsynthesizingnasalsounds.inEq.(44.16b).Foragivenvolumevelocityattheglottis,Ug,thevolumevelocityatthevelumisUv=TgvUg,andthepressureatthevelumisPv=ZvUv.OncePvandUvareknown,thevolumevelocityand/orpressureatthenostrilsandlipscanbecomputedbyinvertingthematricesKvn and Kvt.44.4 SourcesofExcitationAsmentionedearlier,speechsoundsmaybeclassifiedbytypeofexcitation:periodic,turbulent,ortransient.Allofthesetypesofexcitationarecreatedbyconvertingthepotentialenergystoredinthelungsduetoexcesspressureintosoundenergyintheaudiblefrequencyrangeof20Hzto20kHz.Thelungsofayoungadultmalemayhaveamaximumusablevolume(“vitalcapacity”)ofabout5l.Whilereadingaloudthepressureinthelungsistypicallyintherangeof6to15cmofwater(6000to15000Pa).Vocalcordvibrationscanbesustainedwithapressureaslowas.2cmofwater.Attheotherextreme,apressureashighas195cmofwaterhasbeenrecordedforatrumpetplayer.Typicalaverageairflowfornormalspeechisabout0.1l/s.Itmaypeakashighas5l/sduringrapidinhalesinsinging.Periodicexcitationoriginatesmainlyatthevibratingvocalfolds,turbulentexcitationoriginatesprimarilydownstreamofthenarrowestconstrictioninthevocaltract,andtransientexcitationsoccurwheneveracompleteclosureofthevocalpathwayissuddenlyreleased.Inthefollowing,wewillexplorethesethreetypesofexcitationinsomedetail.Theinterestedreaderisreferredto[18]formoreinformation.44.4.1 ... SynthesisThevocaltractisapproximatedbyaconcatenationofabout20uniformsections.Thecross-sectionalareasofthesesectionsiseitherspecifieddirectly,orcomputedfromaspecificationofarticulatoryparametersasshowninFig.44.3.Thechainmatrixforeachsectioniscomputedatanadequatesamplingrateinthefrequencydomaintoavoidtime-aliasingofthecorrespondingtimefunctions.(Computationofthechainmatricesrequiresaspecificationofthelossesalso.Severalmodelsexistwhichassignthelossesintermsofthecross-sectionalarea[11,16]).Thechainmatricesfortheindividualsectionsarecombinedtoderivethematricesforvariousportionsofthetract,asappropriatefortheparticularspeechsoundbeingsynthesized.Forvoicedsounds,thematricesforthesectionsfromtheglottistothelipsaresequentiallymultipliedtogivethematrixfromtheglottistothelips.Fromthek11,k12,k21,k22componentsofthismatrix,thetransferfunctionUoutUinandtheinputimpedanceareobtainedasinEqs.(44.16a )and( 44.16b).KnowingtheradiationimpedanceZRatthelipswecancomputethetransferfunctionforoutputpressure,H=UoutUinZR.TheinverseFFTofthetransferfunctionHandtheinputimpedanceZingivethecorrespondingtimefunctionsh(n)andzin(n),respectively.Thesefunctionsarecomputedevery20ms,andtheintermediatevaluesareobtainedbylinearinterpolation.Forthecurrenttimesamplinginstantn,thecurrentpressurep1(n)attheinputtothevocaltractisthencomputedbyconvolvingzinwiththepastvaluesoftheglottalvolumevelocityug.Withp1known,thepressuredifferencePs−p1onthelefthandsideofEq.(44.22)isknown.Equation(44.18)isdiscretizedbyusingabackwarddifferenceforthetimederivative.Thus,anewvalueoftheglottalvolumevelocityisderived.This,togetherwiththecurrentvaluesofthedisplacementsofthevocalfolds,givesusnewvaluesforthedrivingforcesF1andF2forthecoupledoscillatorEqs.(44.24a) and( 44.24b).Thecoupledoscillatorequationsarealsodiscretizedbybackwarddifferencesfortimederivatives.Thus,thenewvaluesofthedrivingforcesgivenewvaluesforthedisplacementsofthevocalfolds.Thenewvalueofvolumevelocityalsogivesanewvalueforp1,andthecomputationalcyclerepeats,togivesuccessivesamplesofp1,ug,andthevocalfolddisplacements.Theglottalvolumevelocityobtainedinthisway,isconvolvedwiththeimpulseresponseh(n)toproducevoicedspeech.Ifthespeechsoundcallsforfrication,thechainmatrixofthetractisderivedastheproductoftwomatrices—fromtheglottistothenarrowestconstrictionandfromtheconstrictiontothelips,asdiscussedinthesectiononturbulentexcitation.Thisenablesustocomputethevolumevelocityattheconstriction,andthusintroduceanoisesourceonthebasisoftheReynoldsnumber.Finally,toproducenasalsounds,thechainmatrixforthenasaltractisalsocomputed,andtheoutputatthenostrilscomputedasdiscussedinthesectiononchainmatrices.Ifthelipsareopen,theoutputfromthelipsisalsocomputedandaddedtotheoutputfromthenostrilstogivethetotalspeechsignal.Detailsofthesynthesisproceduremaybefoundin[24].References[1]Edwards,H.T.,AppliedPhonetics:TheSoundsofAmericanEnglish,SingularPublishingGroup,SanDiego,1992,Chap.3.[2]Olive,J.P.,Greenwood,A.,andColeman,J.,AcousticsofAmericanEnglishSpeech,SpringerVerlag,NewYork,1993.[3]Fant,G.,AcousticTheoryofSpeechProduction,MoutonBookCo.,Gravenhage,1960,Chap.2.1,93-95.[4]Baer,T.,Gore,J.C.,Gracco,L.C.,andNye,P.W.,Analysisofvocaltractshapeanddimensionsusingmagneticresonanceimaging:Vowels,J.Acoust.Soc.Am.,90(2),799-828,Aug1991.c1999byCRCPressLLC44Speech Production Models and Their Digital...
... and Implications for Quality, David J. Ballard, Robert S. Hopkins III, and David Nicewander 434 Quality Improvement Systems, Theories, and Tools, Mike Stoecklein 63Part II Organization and ... Strategy, and Toolsprovides a guide for quality improvement and a facilitator for dialogabout quality. The chapters define quality in depth and put it into contextfor healthcare organizations and ... Improvement, the National Quality Forum, and the Agency for Healthcare Research andQuality has now clearly estab-lished the magnitude of the nation’s problems in healthcare quality and what needs to...
... withROUGE/BE and Pyramid, DUC and TAC also askedhuman judges to score every candidate summarywith regard to its content, readability, and overall re-sponsiveness.DUC and TAC defined linguistic quality ... be between 0 and 1. Math-ematically, TESLA takes as input the following:1. The BNG of the model summary, X, and theBNG of the candidate summary, Y . The ith en-try in X is xi and has weight ... Computational LinguisticsCombining Coherence Modelsand Machine Translation Evaluation Metrics for Summarization EvaluationZiheng Lin†, Chang Liu‡, Hwee Tou Ng‡ and Min-Yen Kan‡†SAP Research, SAP...
... camera area the videoand audio signals are switchedto the monitor and the guard sees and hears the activityin the scene and initiates a response.1.4.2 Overt vs. Covert Video Most video installations ... computing power, solid-state and magnetic memory, digital processing, and wired and wireless video signal transmission (analog, digital over theInternet, etc.), the basic video system still requires ... analog /digital video mechanisms, and brackets are smaller in size and weightresulting in lower costs and providing more aestheticinstallations. The small cameras and lenses satisfy covertvideo...
... parts to a Mini DV track: video, audio, subcode and ITI (Insert and Tracking Information). The video and audio are self-explanatory. The subcodeholds timecode, date and time and tracknumbers. The ... second of DV video. 14 The Videomaker Guide to DigitalVideoand DVD ProductionFigure 3.1 4:1:1 Sampling—DV Encodinghardware samples each frame of video forluminance (brightness) and chrominance(color) ... Web videographer.Hitachi will soon introduce a DVD-RAMcamcorder. This uses a removable DVDdisc that stores ultra-high quality MPEG-2 video. Expect long recording times, high video quality and...
... local contrast enhancement algorithm for digital video cameras. EURASIP Journal on Image andVideo Processing 20112011:6.Submit your manuscript to a journal and benefi t from:7 Convenient online ... improve visual quality has gainedincreasing attention and becomes an active area inimage andvideo processing researches [1,2]. This articleaddresses two common defects: LDR and poor contrast.Several ... contrastpreservation) and Sigma 16, the proposed method with a = -1 (local contrast enhancement) and (g) Sigma 4, (h) Sigma 8, (i) Sigma 16.Tsai and Chou EURASIP Journal on Image andVideo Processing...
... ranges from 40 to 80 for the mean ofregional standard deviation and from 100 to 200 for the imagemean.Tsai and Chou EURASIP Journal on Image andVideo Processing 2011, 2011:6http://jivp.eurasipjournals.com/content/2011/1/6Page ... two parameters mmin and mmaxset as (a) (mmin, mmax)=(100/255, 150/255), and (b) (mmin, mmax) = (10/255, 250/255).Tsai and Chou EURASIP Journal on Image andVideo Processing 2011, ... local contrast enhancement algorithm for digital video cameras. EURASIP Journal on Image andVideo Processing 20112011:6.Submit your manuscript to a journal and benefi t from:7 Convenient online...