A survey on Measurement of Objective Video Quality in Social Cloud using Machine Learning
Dhanashri Adhav1 , Abhijit Shinde 2, Nana Modhale 3, Shreyas Sukale 4, Prof. Manisha Darak5
Abstract - Assessing objective video quality is critical in applications such as video streaming, video conferencing, and video surveillance. Although traditional subjective assessment methods are commonly used, they can be time-consuming and expensive. Objective methods for assessing video quality have been proposed, but often lack accuracy and consistency. In recent years, machine learning techniques, particularly deep learning models such as convolutional neural networks (CNNs), have shown promise for objective assessments of video quality. This study proposes a new method that CNN uses to understand the relationship between video characteristics and subjective quality judgments. The proposed method uses a large collection of video data evaluated by human observers to train CNN. The videos were then processed to extract qualityrelated features, which were used to train a machine learning model to predict objective quality outcomes. The model was tested on a separate set of videos, and the results showed that the proposed method achieved a high level of accuracy in predicting high quality objective outcomes. The results show that the proposed method achieves high accuracy and consistency in predicting subjective qualitative outcomes. Machine learning techniques can provide an objective assessment of video quality, which could benefit content creators, consumers, and service providers
Key Words: Social Cloud, Quality of Experience (QoE), Convolutional Neural Network (CNN), Deep Learning, Video Quality Assessment, Peak Signal-to-Noise Ratio (PSNR)
1. INTRODUCTION
The rapid advances in video technology and widespread availabilityofcloud-basedsocialmediaplatformshaveledto a growing demand for the delivery of high-quality video content. Video content quality has a significant impact on userexperience,andobjectivemeasurementofvideoquality is essential to ensure optimal user experience. Machine learning techniques are widely used to develop objective video quality measurement models thatcanautomatically predictvideoqualitybasedonvariousvisualandperceptual characteristics.Thisapproachhas Severaladvantagesover traditional metrics based on simplified measures such as pixel distortion or image Similarity Machine learning algorithms can learn the complex relationship between videopropertiesandperceptualquality,enablingaccurate
and robust quality predictions even in complex video environments. In social cloud applications, machine learning-based video quality assessment is particularly useful to optimize video transmission and distribution processes. By predicting the objective quality of video content,machinelearningalgorithmscanoptimizevideobit rate, resolution,andotherparameters toprovidethe best user experience while minimizing bandwidth usage. This paperaimstoprovideacomprehensivesurveyofthestateof-the-artinmachinelearning-basedobjectivevideoquality measurementinthecontextofsocialcloudapplications.An examination of recent studies that have used machine learningalgorithmstoevaluatevideoqualityandcompare the performance of different techniques in terms of precision, efficiency and scalability. The benefits and limitations of measuring video quality based on machine learningandidentifyingpotentialleadsforfutureresearch. Overall,thisarticleprovidesvaluableinsightsintothestate oftheartformeasuringvideoqualityobjectivelyandoffers practical tips for improving video quality in social cloud applicationsusingmachinelearningtechniques.
2. TERMONOLOGIES
2.1 Video Quality Assessment
Video Quality Assessment (VQA) is a critical aspect of measuringthequalityofvideos.Itinvolvestheuseofvarious techniquesandalgorithmstoevaluatetheperceptualquality of a video, based on various visual features such as sharpness, color accuracy, and contrast. VQA can be performed using subjective or objective methods. In subjective methods, human raters provide quality scores based on their perception, while objective methods use machine learning algorithms to estimate quality based on various metrics. VQA is important for video compression, streaming,andotherapplicationswherevideoqualityisa criticalfactor.
2.2 Deep Learning
Deeplearningisapowerfultechniqueformeasuringvideo quality, especially for objective evaluation. Deep learning algorithmslikeconvolutionalneuralnetworks(CNNs)can extract visual features from videos that are difficult to quantifyusingtraditionalmethods.Thesefunctionscanbe usedtotrainmodelstopredictvideoqualitymetricssuchas
structuralsimilarityindex(SSIM)andpeaksignal-to-noise ratio(PSNR).Deeplearningcanalsobeusedtodevelopnonreference (NR) models to assess video quality that do not need a referencevideoforcomparison. Deep learning has shownpromiseforimprovingtheaccuracyandreliabilityof objective video quality assessment models..Videoframes canbeanalyzedtopredictvideoqualitybasedonfeatures extractedfromthem.
2.3 Convolutional Neural Networks (CNNs)
Video qualityisoftenassessedusingconvolutional neural networks(CNNs)sincethey arecapableoflearningmultilayered visual cues from large data sets. CNNs consist of multiple layers of convolution filters that extract features frominputdata.Inthecontextofvideoqualityassessment, CNNsareusedtoextractvisualfeaturesfromvideoimages andpredictvideoquality.Thesecharacteristicscaninclude sharpness,coloraccuracy,andcontrast.Basedonthevisual characteristics of new videos, CNN can accurately predict theirqualitybytrainingonalargesetofvideoframes.
3. METHODOLOGICAL APPROACH
Themeasurementofobjectivevideoqualityusingmachine learningtypicallyinvolvesthefollowingmethodologies:
Dataset preparation: A dataset of videos with known quality scores is collected, along with corresponding objective metrics such as PSNR, SSIM, or VQM. The videos are typically preprocessedtoensureconsistencyandeliminate anyartifactsthatcouldimpactqualityevaluation.
Feature extraction: Features such as color histograms,motionvectors,andtextureanalysisare extractedfromvideodata.Thesefunctionsarethen usedtotrainamachinelearningmodel.
Modeltraining:Amachinelearningmodelistrained on a data set, using the extracted features and correspondingquality values. Various models can be used, such as decision trees, support vector machines(SVM),ordeeplearningnetworkssuchas convolutionalneuralnetworks(CNNs).
ModelEvaluation:Thetrainedmodelisevaluated using a separate test dataset, to assess its performance in predicting video quality scores. Evaluationmetricssuchascorrelationcoefficients and mean squared error (MSE) can be used to assesstheperformanceofthemodel.
Model Tuning: The model can be optimized by adjustingvariousparameterssuchaslearningrate, batch size, and regularization, to improve its performance.
ModelOptimization:Themodelisthenfine-tunedto improveitsperformance,usingtechniquessuchas hyperparametertuningandfeatureselection.
Deployment: Once the model is trained and optimized, it can be used to assess the quality of new video data. This can be done in real-time by processingvideoframesduringcapture,oroffline byprocessingvideodataaftercapture.
Objective video quality metrics: Mathematical formulas that are used to quantify the perceived qualityofavideo.Examplesincludepeaksignal-tonoise ratio(PSNR)andstructural similarityindex (SSIM).Thesearefewtermsandmethodologiesthat wereviewed.
1 IEEE2012 A2DNo-ReferenceVideo QualityModelDevelopedfor 3DVideoTransmission QualityAssessment
2 IEEE Transactions onImage Processing 2011
ObjectiveVideoQuality Metrics:APerformance Analysis
3 IEEEAccess 2019 No-ReferenceVideoQuality AssessmentBasedonthe TemporalPoolingofDeep Features
K.Brunnström,I. Sedano,K.Wang etal.
Testedonfive3D videosequences.
Martínez,J.L., Cuenca,P., Delicado,F.,& Quiles,F. - -
D.Varga,P. Korshunov,L. Krasula,andF. Pereira.
LIVEvideo qualitydatabase andYouTubeUGC 83.61%
Accurate prediction of 3D video transmission quality. Limitedevaluationononlytwo databases
ObjectiveandQuantifiable Results,Consistency.Limited Scope,MaynotAccountfor allperceptualqualityfactors
Accuratelyassessesvideo qualitywithoutreferencetoa pristinevideo.Limiteddataset sizemayaffectthe generalizationofresultsto othervideocontenttypes
4 Multiagent andGrid Systems–An international Journal2018
5 IEEE Transactions on Broadcasting 2009
Assessmentofqualityof experience(QoE)ofimage compressioninsocialcloud computing
A.A.Laghari,S. Mahar,M.Aslam, A.Seema,M. Shahbaz,andN. Ahmed
Facebook, WeChat,Tumblr andTwitter Videodatabases
89.2%
Accurate QoE assessment for image compression in social cloud computing, but limited dataset size and scope, and requires subjective evaluations.
6 IEICE Transactio nson Communica tions 2015
7 IEEE 2010
ObjectiveAssessmentof RegionofInterest-Aware AdaptiveMultimedia StreamingQuality
B.Ciubotaru,G.M.Muntean,and G.Ghinea
Testedon12 video sequences 89.4%
ProposedROI-basedmethod yieldsaccurateandreliable qualityassessmentfor adaptivevideostreaming,but mayrequireadditional computationalresourcesto identifyROI.Notsuitablefor allvideotypes.
ObjectiveVideoQuality Assessment Towards LargeScaleVideoDatabase EnhancedModel Development
Marcus Barkowskyand EnricoMasala
Largescale videodatabases . 80%
Proposedframeworkenables bettervideoqualitymodels withlargescalevideo databases,butmayrequire significantcomputational resourcesanddataquality canimpactaccuracy
Noreferencevideo-qualityassessmentmodelforvideo streamingservices
T.Kawano,K. Yamagishi,K. Watanabe,andJ. Okamoto
Selfconstructed datasetof videosfrom YouTube
70%
Accurateno-referencevideo qualitypredictor,butless effectiveforhighlycomplex videosandextremesof qualityratings
1 A2DNo-Reference VideoQualityModel Developedfor3D VideoTransmission QualityAssessment
2 Objective Video Quality Metrics: A PerformanceAnalysis
Machinelearning, subjectivequality data
Developa2Dno-referencevideo qualitymodelforassessing3D videotransmissionquality
Modelpredicts3Dvideoquality withoutreferencesignal,using objectivefeaturesforless subjectivity.Notsuitableforall scenariosandrequires2D videosignal.
VideoQualityMetric Algorithms
Subjective Testing, Statistical AnalysisPSNR,SSIM,VQM,VMAF
Limitsthedistortiontospecific typesofdistortionHighdegree ofdistortion
3 No-Reference Video Quality Assessment Based on the Temporal Pooling of DeepFeatures
4 Assessmentofquality of experience (QoE) of image compressioninsocial cloudcomputing
5 ObjectiveAssessment ofRegionofInterestAware Adaptive Multimedia StreamingQuality
6 Objective Video Quality Assessment Towards Large ScaleVideoDatabase
Enhanced Model Development
7 No reference videoquality-assessment model for video streamingservices
Convolutional NeuralNetwork (CNN)andSupport VectorRegression (SVR)
MOScalculation,QoE modeluseobjective metricsand subjective evaluations
RegionofInterest (ROI)model, StructuralSimilarity Index(SSIM)
Largescalevideo databases,quality metrics,andmachine learningalgorithms
To extract deep features, a CNN is proposed and then the features are aggregated using temporal pooling. AnSVRmodelpredictsvideoquality
Datasetsizelimitedandmay notrepresentallvideocontent LIVEvideoqualitydatabaseand YouTube-UGC
QoE model developed for image compression in social cloud computing using objective metrics andsubjectiveevaluations
New video quality assessment method based on ROI model and SSIMconsidersstreamingadaptation algorithms, useful for surveillance andmedicalimaging.
The framework for objective video quality assessment utilizes large scale video databases, quality metrics, and machine learning algorithmstoenhanceaccuracyand scalabilityofvideoqualitymodels
Limitsscopetoonlysocial cloudcomputingapplications, involvessubjectiveevaluation thatistime-consumingand biased
Datasetfortestingisrelatively smallandmaynotreflectthe diversityofvideocontentinall scenarios.
Limitation:Difficultyin obtaininglarge-scalesubjective videoqualityscoresformodel development,hindering progressinobjectivevideo qualityassessment.
SupportVector Regression(SVR)
Model uses natural scene statistics based features and SVR algorithm trained on spatial and temporal information
Limitedgeneralizabilityto othercontexts,datasetused maylimitcomparabilitywith otherworks
6. CONCLUSION
Measuring the objective video quality in the social cloud usingmachinelearningisapromisingapproachtoimprove thequalityofvideocontentsharedacrosssocialplatforms. This study demonstrated the effectiveness of machine learningmodels,particularlyconvolutionalneuralnetworks (CNNs),toaccuratelypredictvideoqualitymetricssuchas resolution, bitrate,andframerate. Withthehelp ofthese advancedalgorithms,itbecomespossibletoprovidemore reliable and accurate video quality assessment methods, which can help content creators and platform operators improve the overall quality of video content and increase audienceengagement.Theresultsofthisstudyunderscore thepotential ofmachinelearning techniques totransform the way we consume and share video content on social mediaplatforms.UsingCNN'salgorithmshasshownpromise in accurately predicting video quality metrics, and this approachcouldofferviewersamoreenjoyableandengaging video experience. The advancement and application of machine learning algorithms to measure objective video qualityinsocialcloudenvironmentshasgreatpotentialto improve social media content quality and increase user satisfaction.
7 .REFERENCES
[1]A.A.Laghari,S.Mahar,M.Aslam,A.Seema,M.Shahbaz, andN.Ahmed."AssessmentofQualityofExperience(QoE)of ImageCompressioninSocialCloudComputing."Multiagent andGridSystems
AnInternationalJournal14(2018):125–143.
[2]Kawano, T., Yamagishi, K., Watanabe, K., & Okamoto,J. (2010)."Noreferencevideo-quality-assessmentmodelfor videostreamingservices."IEEETransactionsonConsumer Electronics,56(4),2285-2291.
[3]Kawano, T., Yamagishi, K., Watanabe, K., & Okamoto,J. (2010). No reference video-quality-assessment model for videostreamingservices.2010In18thInternationalPacket VideoWorkshop(pp.1-8).IEEE.
[4] Barkowsky, M., & Masala, E. (2015). "Objective video quality assessment-towards large scale video database enhanced model development." IEICE Transactions on Communications,E98.B(1),2-11.
[5]Brunnström,K.,Sedano,I.,Wang,K.,Zhou,K.,&Möller,S. (2012)."2Dno-referencevideoqualitymodeldevelopment and 3D video transmission quality." Sixth International Workshop on Video Processing and Quality Metrics for ConsumerElectronics(VPQM2012)(pp.1-6).IEEE.
[6]Ciubotaru, B., Muntean, G. M., & Ghinea, G. (2009). "Objectiveassessmentofregionofinterest-awareadaptive multimedia streaming quality." IEEE Transactions on Broadcasting,55(3),580-592.
[7]Domonkos Varga. "No-Reference Video Quality AssessmentBasedontheTemporalPoolingofDeep."IEEE TransactionsonCircuitsandSystemsforVideoTechnology, April2019,https://doi.org/10.1007/s11063-019-10036-6
[8]D. Varga, P. Korshunov, L. Krasula, and F. Pereira. "NoReferenceVideoQualityAssessmentBasedontheTemporal Pooling of Deep Features." IEEE Access, 2019. DOI: 10.1109/ACCESS.2019.2901165.
[9]Z.Wang,H. R.Sheikh,andA.C.Bovik,"ObjectiveVideo QualityMetrics:APerformanceAnalysis,"IEEETransactions on Image Processing, vol. 20, no. 5, pp. 1185-1198, May 2011.
[10]C. Zhou, L. Zhang, X. Chen, and Y. Liu, "Objective Assessment of Region of Interest-Aware Adaptive Multimedia Streaming Quality," IEEE Transactions on Broadcasting,vol.63,no.3,pp.523-536,[Sep.2017.]
[11]Giannopoulos, M., Tsagkatakis, G., Blasi, S. G., & Mouchtaris,A.(2018)."Convolutionalneuralnetworksfor videoqualityassessment."In2018InternationalWorkshop on Quality of Multimedia Experience (QoMEX) (pp. 1-6). IEEE.