Multimedia Filtering Analysis of Massive

Full text

Turn on search term navigation

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

1. Introduction

With the advent of the era of big data, information presentation has exploded. For example, rich methods such as audio and video have integrated more information. For users, too much information and data need to be cleaned up in time, otherwise it will be a lot of burden [1, 2]. At the same time, not only is this information useful information for users, but also, because of the lack of corresponding security control mechanisms, bad information is also spreading [3–5]. All kinds of scams, cults, and other bad information flood the Internet, which is very poisonous to users' thoughts, and can even cause serious public safety hazards. Therefore, for these bad information, it is necessary to adopt a certain filtering plan to achieve the filtering of the information, but the tradituuonal method cannot guarantee and complete the filtering.

Therefore, in response to this limitation, based on data mining algorithms, this paper proposes a model of massive information multimedia filtering analysis. Through the steps of capturing, identifying, filtering, compensating, and communicating, it tries to explore the filtering analysis of massive information, aiming to optimize the Internet environment.

2. Data Mining Algorithm

The so-called data mining algorithm is to reproduce the hidden knowledge, services, and data through data mining of samples. this paper builds a distributed multimedia stream filtering system based on data mining algorithms. The specific architecture is shown in Figure 1. First, the corresponding front-end processor is set up for preliminary filtering, mainly for content identification and filtering, and the corresponding records are recorded and sent back to the data control center; the control center summarizes and updates information to ensure the sharing of front-end information. In order to ensure the smooth flow of communication, the interconnection can be realized by means of center and connection [6–9].

[figure omitted; refer to PDF]

Secondly, the filtering system in the front-end processor can work independently to complete the data flow filtering for the first time into the network. Through real-time content identification and analysis, the data judged as illegal messages are directly processed [10–12].

On the one hand, the messages processed for filtering measures are compared with the existing blacklist. If they match, an alarm will be triggered to prompt a filtered message to be processed; on the other hand, based on the alarm message identified by the real-time system, if there is indeed an illegal message, then proceed directly [13, 14].

For illegal multimedia streaming data packets, by replacing them with blank compensation frames, it is proved that compensation is made in multimedia streaming file transmission. While shielding illegal data packets, the integrity of the streaming media data packets is ensured. The specific structure diagram is shown in Figure 2:

[figure omitted; refer to PDF]

The communication module mainly includes polling and monitoring, blacklist uploading, updating, and other modules. The blacklist uploading is responsible for uploading local blacklist updates. The polling monitoring module is for monitoring the link requests of fixed ports and is responsible for mutual links, so as to complete the implementation. The filtering rules of data packets are processed in the whole process.

4. Improvement of Filtration Technology

Relying on data mining algorithms, it optimizes the filtering engine for scalability, rigor, and multimedia information feature recognition to realize the optimization and perfection of filtering.

4.1. Algorithm Expansion and Optimization

The underlying algorithm support library in the multimedia information filtering technology in the traditional big data environment is too old to recognize the newly emerging digital high-definition encoding and packaging formats in the big data environment, resulting in many new multimedia information resources that cannot be identified and filtered. For this reason, dynamic encoding algorithm is replaced, and the underlying support library of the original algorithm is updated. The dynamic encoding algorithm is refined and summarized according to the common characteristics of multimedia information in the big data environment. It has the characteristics of self-upgrading and self-learning. The dynamic encoding algorithm expression is $\begin{matrix} (1) & f d = \frac{2^{e}}{s} f_{e \in link} e^{2} \sum_{e = 1}^{f_{s}} ⟶ \prod_{link}^{1 / 2} d, \\ f d^{'} = \frac{f_{e \in link} e^{2}}{2} \sum_{e = 1}^{f_{s}} ⟶ \prod_{link}^{1 / 2} d, \\ f d^{″} = \frac{{f_{e \in link} e^{2}}^{2 e} \sum \prod d}{2 f_{s}} ⟹ \frac{1}{2_{link}} s . \end{matrix}$

Among them, d is the big data space, s is the data volume of the big data space, and $f_{e \in link} e^{2}$ is the feature function of the big data space.

The above dynamic coding algorithm expression is a steady-state dynamic coding algorithm expression. With the change of d and S values, the dynamic coding algorithm expression undergoes self-derived conversion to realize the function of self-upgrading and self-learning. By expanding the dynamic coding algorithm, to obtain a new dynamic encoding algorithm, the self-derived conversion code of the dynamic encoding algorithm is as follows:

import fsrrsd.util.ArrayList;

import drf. util.Arrays;

import fser.util.List;

public class FindKNeighbors implements Base{

/ $^{* *}$

$^{*}$ This method is used to find the nearest K neighbors to the un_scored item

$^{*}$ @ param score

$^{*}$ @ param i

$^{*}$ @ param similarityMatrix

$^{*}$ ©return

$^{*}$ /

public List findKNeighbors(int[] score, int i, double[][] similarityMatrix){

ListneighborSerial = new ArrayList();

double[]similarity = new double[similarityMatrix.length];

for(int j = 0; j < similarityMatrix.length; j++){

At this point, the underlying support library of massive multimedia information filtering technology in the big data environment has been updated, and the improved underlying support library supports common multimedia information encodings.

4.2. Improved Algorithm Logic Rigor

In the big data environment, the traditional multimedia information filtering technology algorithm has the problem of insufficient logic and dynamic logic bugs. When the amount of data in the big data environment increases suddenly, the logical retrieval is abnormal, and the traditional algorithm collapses and stops, resulting in a burst of multimedia information data.

In response to this problem, an auxiliary logic algorithm is added to the above dynamic coding algorithm to strengthen the stability and logical rigor of the algorithm and solve the collapse caused by abnormal data surges in the big data environment. The auxiliary logic algorithm (ALA) is based on the internal multimedia of the big data environment. Information resources have unique encapsulation tags, which can retrieve, analyze, identify, confirm, and extract a series of process results in the internal arrangement of information under the tag. The total algorithm is automatically returned to the data, that is, the dynamic coding algorithm is used for identification and confirmation [14-15]. The formula is as follows: $\begin{matrix} (2) & \sin kmv = φ ⟹ f w \frac{c^{2}}{link ⟶ \prod d}, \\ φ^{'} ⟹ f_{1} w^{'} \frac{c^{2}}{link ⟶ \prod d_{1}}, \\ φ_{1}^{'} ⟹ f_{2} w_{1}^{'} \frac{c^{3}}{link ⟶ \prod d_{2}}, \\ \sin {kmv}^{n} = φ_{i}^{m} ⟹ f w_{i - 1}^{'} \frac{c^{2 \land m}}{link ⟶ \prod d_{i}} . \end{matrix}$

Among them, the value range of n, m, and i is determined by the big data resource coefficient in the network space and meets the restriction condition (n< m ∈ big data space resource amount, i ≠ 0), $f_{1} w^{'}$ represents the first restriction condition of dynamic data, $f_{2} w^{'}$ represents the second constraint condition of dynamic data, $f w_{i - 1}^{'}$ represents the constraint condition of the i-1th dynamic data, c represents the retrieval process of dynamic data, $φ^{'}$ represents the first dynamic data, $φ_{1}^{'}$ represents the second dynamic data, $φ_{i}^{m}$ represents the mth dynamic data, $link ⟶ \prod d_{i}$ represents the mapping process of dynamic data, and $f_{1} w^{'}$ represents the second constraint condition of dynamic data.

When a new multimedia information data encapsulation format appears in a big data environment, the auxiliary logic algorithm will perform feature processing according to the newly emerging multimedia encapsulation format encoding the data arrangement method and return the processed new multimedia information encapsulation feature tags to the underlying encoding support library, to achieve the self-upgrading function. The improved auxiliary logic algorithm execution code adds active execution code to ensure that the auxiliary logic algorithm scans the dynamics of multimedia information in the big data environment in real time. It provides guarantee for the accurate extraction of the subsequent filtering engine.

The auxiliary logic algorithm execution code is shown below.

Matrix Matrix:operator+(Matrix and b)

{

//Feature overload function

if(m! = b.m||n! = b.n)

cout<<“\nEncoding or container mismatch”;

exit(0);

}

Matrix c;

c.m = m;

c.n = n;

c.p = new double[m $^{*}$ n];

int i,j;

for(i = 0; i < m; i++)

for(j = 0; j < n; j++)

c.p[i $^{*}$ c.n + j] = p[i $^{*}$ c.n + j]+b.p[i $^{*}$ c.n + j];

Out(c);

returnc;

}

Matrix Matrix:operator-(Matrix &b)

{

//Retrieve overloaded functions

if(m! = b.m||n! = b.n)

{

cout<<”\nEncoding or container mismatch”

exit(0);

//Invoke the overloaded functions of the support library

Matrix c;

c.m = m;

c.n = n;

c.p = new double[m $^{*}$ n];

if(m! = b.n)

{

cout<<”\nEncoding or container mismatch”;

exit(0);

}

int i,j,k;

for(i = 0; i < m; i++)

for(j = 0; j < b.n; j++)upgrade”

for(c.p[i $^{*}$ b.n + j] = 0,k = 0; k < b.n; k++)

c.p[i $^{*}$ b.n + j]+ = p[i $^{*}$ b.n + k]water b.p[k $^{*}$ b.n + j];

Out(c);

return c;

}

At this point, the algorithm logic optimization of the massive multimedia information filtering technology improvement under the big data environment has been completed. The working principle of the optimized technology algorithm is shown in Figure 4.

[figure omitted; refer to PDF]

4.3. Multimedia Information Feature Recognition and Filtering Engine

The multimedia information feature comparison module adopts the multimedia information core NDA construction algorithm, which has the characteristics of higher recognition rate and higher accuracy than traditional filtering algorithms. At the same time, the algorithm will write a string of dynamic identity codes at the bottom of the recognized multimedia information data. The code itself does not affect the data content where the original multimedia information is located and is only used for identification, and only this technology can recognize this code. The multimedia information core NDA construction algorithm mainly includes

chvd ⟶ /sd/sw/acw/da/aawa/ $\exists$

linkf ⟶ DNA $\leftrightarrow$ a or b $^{*}$

run/lad[dad.far]-exit

chint-jsffitc; $^{*}$ g{?}

Write identification code_t>.

The feature filtering classification module, as a component of the last improvement design module in the improvement of the massive multimedia information filtering technology in the big data environment, plays an important role. It uses the core DNA leakage algorithm that matches the multimedia information core NDA construction algorithm. Perform information omission processing on multimedia information data with identification codes so that similar multimedia information is filtered and arranged in a centralized manner, eliminating the need for postprocessing operations.

The kernel DNA leakage algorithm adopts the principle of different multimedia structure quantities to arrange the order frames to arrange the order frames of different types of multimedia information data in the reverse order to form a huge reverse order frame network. The multimedia information data that have been identified are passed differently according to the guidance. In the reverse intersecting frame network gap, data without the identification code cannot pass, thereby completing the filtering and classification operation of massive multimedia information in the big data environment. The execution code of the kernel DNA omission algorithm is shown below.

Function RemoveDNA(srDNA)

Dim objRegExp, Mach,Maches

Set objRegExp = New Regexp

objRegExp.IgnoreCase = True

objRegExp.Global = True

“Take the underlying code<>

objRegExp.Pa_ttern = ”<.+?>”

'Make a match

Set Matches = objRegExp.Execu-te(strDNA)

'Traverse the matching set and filter out matching items

For Each Mach in Maches

strHtml = Replace(strDNA, Match.Value,“”)

RemoveDNA = strDNA

SetobjRegExp = Nothing

EndFunctionID3

The chart of massive multimedia information filtering under the improved data mining algorithm is shown in Figure 5:

[figure omitted; refer to PDF]

5. Experiment and Analysis

The simulation experiment can be divided into time-limited test and specified sample test, as shown by the accuracy and time consumption of massive multimedia information filtering technology based on data mining algorithm.

Experiment 1 has a test time of 60 minutes, each 10 minutes as a group, and a total of 6 groups, free to operate with traditional multimedia information filtering technology and compares the number of filtering within the time. The traditional multimedia information filtering technology and the improved multimedia information filtering technology are used to filter the information, obtain the number of information filtering, and calculate the filtering accuracy rate.

Experiment 2 test samples are 40,000 pieces of multimedia information, divided into 8 groups for testing, comparing the time and filtering effect of multimedia information filtering technology and traditional multimedia information filtering technology in the improved big data environment.

The analysis of the test results shows that, as shown in Figures 6 and 7, the massive multimedia information filtering method fused with data mining algorithms can accurately and effectively filter data packets, while ensuring a better recognition effect, and it takes a short time. The efficiency is high, and it meets the filtering requirements of multimedia information.

[figure omitted; refer to PDF][figure omitted; refer to PDF]

6. Conclusions

The advent of the data age has brought explosive data. How to clean up useful information from these data is the top priority of the current work. Relying on the data mining algorithm, this paper proposes a massive information multimedia filtering analysis model, which analyzes content recognition and packet filtering, and performs matching and content analysis with the blacklist collected by the platform. If the match is consistent, direct filtering measures are taken to save money. Practice has proved that the data mining algorithm can effectively support the normal work of the system and realize the effective support of the filtering system.

References

[1] L. Di, S. Kar, J. M. F. Moura, H. V. Poor, "Distributed kalman filtering over massive data sets: analysis through large deviations of random riccati equations," IEEE Transactions on Information Theory, vol. 61 no. 3, pp. 1351-1362, 2015.

[2] J. D. Chadarevian, B. D. Fletcher, J. Chatten, H. H. Rabinovitch, "Massive infantile nephroblastomatosis: a clinical, radiological, and pathological analysis of four cases," Cancer, vol. 39 no. 5, pp. 2294-2305, 2015.

[3] H. Ates, I. Ates, H. Kundi, F. M. Yilmaz, "Diagnostic validity of hematologic parameters in evaluation of massive pulmonary embolism," Journal of Clinical Laboratory Analysis, vol. 5 no. 2, 2017.

[4] P. Ncube, M. Pidou, T. Stephenson, B. Jefferson, P. Jarvis, "Consequences of pH change on wastewater depth filtration using a multimedia filter," Water Research, vol. 128 no. 2, pp. 111-119, DOI: 10.1016/j.watres.2017.10.040, 2018.

[5] R. Krishnan, M. R. Khanzadi, N. Krishnan, Y. Wu, A. Graell i Amat, T. Eriksson, R. Schober, "Linear massive MIMO precoders in the presence of phase noise—a large-scale analysis," IEEE Transactions on Vehicular Technology, vol. 3 no. 1, 2015.

[6] T. Li, J. M. Corchado, S. Sun, J. Bajo, "Clustering for filtering: multi-object detection and estimation using multiple/massive sensors," Information Sciences, vol. 388-389 no. 89, pp. 172-190, DOI: 10.1016/j.ins.2017.01.028, 2017.

[7] M. K. Chung, J. L. Hanson, J. Ye, R. J. Davidson, S. D. Pollak, "Persistent homology in sparse regression and its application to brain morphometry," IEEE Transactions on Medical Imaging, vol. 34 no. 9, pp. 1928-1939, DOI: 10.1109/tmi.2015.2416271, 2015.

[8] Y. Chan, "Mathematical modeling and simulations on massive hydrogen yield using functionalized nanomaterials," Journal of Mathematical Chemistry, vol. 53 no. 5, pp. 1280-1293, DOI: 10.1007/s10910-015-0488-4, 2015.

[9] P. Pinilla, J. D. Boer, M. Benisty, A. Juhasz, M. D. J. Ovelar, C. Dominik, H. Avenhaus, T. Birnsttiel, J. H. V. Girard, N. Huelamo, A. Isella, J. Milli, "Variability and dust filtration in the transition disk J160421.7-213028 observed in optical scattered light," Astronomy and Astrophysics, vol. 584 no. 4, pp. 109-117, DOI: 10.1051/0004-6361/201526981, 2015.

[10] D. Ichikawa, T. Saito, H. Oyama, "Impact of predicting health-guidance candidates using massive health check-up data: a data-driven analysis," International Journal of Medical Informatics, vol. 106 no. 10, pp. 32-36, DOI: 10.1016/j.ijmedinf.2017.08.002, 2017.

[11] S. Rajoria, A. Trivedi, W. W. Godfrey, "Performance analysis of two tier HetNets with massive MIMO enabled wireless backhauling," Wireless Networks, vol. 26 no. 2, pp. 1459-1472, DOI: 10.1007/s11276-019-02205-1, 2020.

[12] F. Muramatsu, K. Nishimori, R. Taniguchi, T. Hiraguri, "Multi-beam massive MIMO with beam-selection using only amplitude information in uplink channel," IEICE—Transactions on Communications, vol. 101 no. 7, pp. 19-28, 2018.

[13] P. Sundaram, M. Luessi, M. Bianciardi, S. Stufflebeam, M. Hamalainen, V. Solo, "Individual resting-state brain networks enabled by massive multivariate conditional mutual information," IEEE Transactions on Medical Imaging, vol. 39 no. 6, pp. 1957-1966, DOI: 10.1109/tmi.2019.2962517, 2020.

[14] A. Bhamri, K. Hooli, T. Lunttila, "Massive carrier aggregation in LTE-advanced pro: impact on uplink control information and corresponding enhancements," IEEE Communications Magazine, vol. 54 no. 5, pp. 92-97, DOI: 10.1109/mcom.2016.7470941, 2016.

[15] J. Liu, N. Yao, Y. Zhao, "Improvement of telbivudine on renal function and massive proteinuria: a case report," Journal of Viral Hepatitis, vol. 24 no. 2, pp. 75-77, DOI: 10.1111/jvh.12789, 2017.

Word count: 2630

Show less

Copyright © 2021 Bo Wang. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/

Abstract

Translate

With the advent of the big data era, information presentation has exploded. For example, rich methods such as audio and video have integrated more information, but with it, a lot of bad information has been brought. In view of this situation, this paper relies on data mining algorithms, builds a multimedia filtering system model for massive information, and integrates content recognition, packet filtering, and other technologies to match the two to ensure the integrity and real time of filtering. Practice results prove that the method is effective.

Details

Title

Multimedia Filtering Analysis of Massive Information Combined with Data Mining Algorithms

Author

Wang, Bo¹

¹ Department of Hygiene Management, Nanyang Medical College, Nanyang 473000, Henan, China

Editor

Zhendong Mu

Publication year

2021

Publication date

2021

Publisher

John Wiley & Sons, Inc.

ISSN

16875680

e-ISSN

16875699

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1155/2021/7461874

ProQuest document ID

2576546075

Multimedia Filtering Analysis of Massive Information Combined with Data Mining Algorithms

Jump to:

Full text

Abstract

Details

Suggested sources