This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
With the advent of the era of big data, information presentation has exploded. For example, rich methods such as audio and video have integrated more information. For users, too much information and data need to be cleaned up in time, otherwise it will be a lot of burden [1, 2]. At the same time, not only is this information useful information for users, but also, because of the lack of corresponding security control mechanisms, bad information is also spreading [3–5]. All kinds of scams, cults, and other bad information flood the Internet, which is very poisonous to users' thoughts, and can even cause serious public safety hazards. Therefore, for these bad information, it is necessary to adopt a certain filtering plan to achieve the filtering of the information, but the tradituuonal method cannot guarantee and complete the filtering.
Therefore, in response to this limitation, based on data mining algorithms, this paper proposes a model of massive information multimedia filtering analysis. Through the steps of capturing, identifying, filtering, compensating, and communicating, it tries to explore the filtering analysis of massive information, aiming to optimize the Internet environment.
2. Data Mining Algorithm
The so-called data mining algorithm is to reproduce the hidden knowledge, services, and data through data mining of samples. this paper builds a distributed multimedia stream filtering system based on data mining algorithms. The specific architecture is shown in Figure 1. First, the corresponding front-end processor is set up for preliminary filtering, mainly for content identification and filtering, and the corresponding records are recorded and sent back to the data control center; the control center summarizes and updates information to ensure the sharing of front-end information. In order to ensure the smooth flow of communication, the interconnection can be realized by means of center and connection [6–9].
[figure omitted; refer to PDF]
Secondly, the filtering system in the front-end processor can work independently to complete the data flow filtering for the first time into the network. Through real-time content identification and analysis, the data judged as illegal messages are directly processed [10–12].
On the one hand, the messages processed for filtering measures are compared with the existing blacklist. If they match, an alarm will be triggered to prompt a filtered message to be processed; on the other hand, based on the alarm message identified by the real-time system, if there is indeed an illegal message, then proceed directly [13, 14].
For illegal multimedia streaming data packets, by replacing them with blank compensation frames, it is proved that compensation is made in multimedia streaming file transmission. While shielding illegal data packets, the integrity of the streaming media data packets is ensured. The specific structure diagram is shown in Figure 2:
[figure omitted; refer to PDF]
The communication module mainly includes polling and monitoring, blacklist uploading, updating, and other modules. The blacklist uploading is responsible for uploading local blacklist updates. The polling monitoring module is for monitoring the link requests of fixed ports and is responsible for mutual links, so as to complete the implementation. The filtering rules of data packets are processed in the whole process.
4. Improvement of Filtration Technology
Relying on data mining algorithms, it optimizes the filtering engine for scalability, rigor, and multimedia information feature recognition to realize the optimization and perfection of filtering.
4.1. Algorithm Expansion and Optimization
The underlying algorithm support library in the multimedia information filtering technology in the traditional big data environment is too old to recognize the newly emerging digital high-definition encoding and packaging formats in the big data environment, resulting in many new multimedia information resources that cannot be identified and filtered. For this reason, dynamic encoding algorithm is replaced, and the underlying support library of the original algorithm is updated. The dynamic encoding algorithm is refined and summarized according to the common characteristics of multimedia information in the big data environment. It has the characteristics of self-upgrading and self-learning. The dynamic encoding algorithm expression is
Among them, d is the big data space, s is the data volume of the big data space, and
The above dynamic coding algorithm expression is a steady-state dynamic coding algorithm expression. With the change of d and S values, the dynamic coding algorithm expression undergoes self-derived conversion to realize the function of self-upgrading and self-learning. By expanding the dynamic coding algorithm, to obtain a new dynamic encoding algorithm, the self-derived conversion code of the dynamic encoding algorithm is as follows:
import fsrrsd.util.ArrayList;
import drf. util.Arrays;
import fser.util.List;
public class FindKNeighbors implements Base{
/
public List
List
double[]similarity = new double[similarityMatrix.length];
for(int j = 0; j < similarityMatrix.length; j++){
At this point, the underlying support library of massive multimedia information filtering technology in the big data environment has been updated, and the improved underlying support library supports common multimedia information encodings.
4.2. Improved Algorithm Logic Rigor
In the big data environment, the traditional multimedia information filtering technology algorithm has the problem of insufficient logic and dynamic logic bugs. When the amount of data in the big data environment increases suddenly, the logical retrieval is abnormal, and the traditional algorithm collapses and stops, resulting in a burst of multimedia information data.
In response to this problem, an auxiliary logic algorithm is added to the above dynamic coding algorithm to strengthen the stability and logical rigor of the algorithm and solve the collapse caused by abnormal data surges in the big data environment. The auxiliary logic algorithm (ALA) is based on the internal multimedia of the big data environment. Information resources have unique encapsulation tags, which can retrieve, analyze, identify, confirm, and extract a series of process results in the internal arrangement of information under the tag. The total algorithm is automatically returned to the data, that is, the dynamic coding algorithm is used for identification and confirmation [14-15]. The formula is as follows:
Among them, the value range of n, m, and i is determined by the big data resource coefficient in the network space and meets the restriction condition (n< m ∈ big data space resource amount, i ≠ 0),
When a new multimedia information data encapsulation format appears in a big data environment, the auxiliary logic algorithm will perform feature processing according to the newly emerging multimedia encapsulation format encoding the data arrangement method and return the processed new multimedia information encapsulation feature tags to the underlying encoding support library, to achieve the self-upgrading function. The improved auxiliary logic algorithm execution code adds active execution code to ensure that the auxiliary logic algorithm scans the dynamics of multimedia information in the big data environment in real time. It provides guarantee for the accurate extraction of the subsequent filtering engine.
The auxiliary logic algorithm execution code is shown below.
Matrix Matrix:operator+(Matrix and b)
{
//Feature overload function
if(m! = b.m||n! = b.n)
cout<<“\nEncoding or container mismatch”;
exit(0);
}
Matrix c;
c.m = m;
c.n = n;
c.p = new double[m
int i,j;
for(i = 0; i < m; i++)
for(j = 0; j < n; j++)
c.p[i
Out(c);
returnc;
}
Matrix Matrix:operator-(Matrix &b)
{
//Retrieve overloaded functions
if(m! = b.m||n! = b.n)
{
cout<<”\nEncoding or container mismatch”
exit(0);
//Invoke the overloaded functions of the support library
Matrix c;
c.m = m;
c.n = n;
c.p = new double[m
if(m! = b.n)
{
cout<<”\nEncoding or container mismatch”;
exit(0);
}
int i,j,k;
for(i = 0; i < m; i++)
for(j = 0; j < b.n; j++)upgrade”
for(c.p[i
c.p[i
Out(c);
return c;
}
At this point, the algorithm logic optimization of the massive multimedia information filtering technology improvement under the big data environment has been completed. The working principle of the optimized technology algorithm is shown in Figure 4.
[figure omitted; refer to PDF]4.3. Multimedia Information Feature Recognition and Filtering Engine
The multimedia information feature comparison module adopts the multimedia information core NDA construction algorithm, which has the characteristics of higher recognition rate and higher accuracy than traditional filtering algorithms. At the same time, the algorithm will write a string of dynamic identity codes at the bottom of the recognized multimedia information data. The code itself does not affect the data content where the original multimedia information is located and is only used for identification, and only this technology can recognize this code. The multimedia information core NDA construction algorithm mainly includes
chvd ⟶ /sd/sw/acw/da/aawa/
linkf ⟶ DNA
run/lad[dad.far]-exit
chint-jsffitc;
Write identification code_t>.
The feature filtering classification module, as a component of the last improvement design module in the improvement of the massive multimedia information filtering technology in the big data environment, plays an important role. It uses the core DNA leakage algorithm that matches the multimedia information core NDA construction algorithm. Perform information omission processing on multimedia information data with identification codes so that similar multimedia information is filtered and arranged in a centralized manner, eliminating the need for postprocessing operations.
The kernel DNA leakage algorithm adopts the principle of different multimedia structure quantities to arrange the order frames to arrange the order frames of different types of multimedia information data in the reverse order to form a huge reverse order frame network. The multimedia information data that have been identified are passed differently according to the guidance. In the reverse intersecting frame network gap, data without the identification code cannot pass, thereby completing the filtering and classification operation of massive multimedia information in the big data environment. The execution code of the kernel DNA omission algorithm is shown below.
Function RemoveDNA(srDNA)
Dim objRegExp, Mach,Maches
Set objRegExp = New Regexp
objRegExp.IgnoreCase = True
objRegExp.Global = True
“Take the underlying code<>
objRegExp.Pa_ttern = ”<.+?>”
'Make a match
Set Matches = objRegExp.Execu-te(strDNA)
'Traverse the matching set and filter out matching items
For Each Mach in Maches
strHtml = Replace(strDNA, Match.Value,“”)
Next
RemoveDNA = strDNA
SetobjRegExp = Nothing
EndFunctionID3
The chart of massive multimedia information filtering under the improved data mining algorithm is shown in Figure 5:
[figure omitted; refer to PDF]5. Experiment and Analysis
The simulation experiment can be divided into time-limited test and specified sample test, as shown by the accuracy and time consumption of massive multimedia information filtering technology based on data mining algorithm.
Experiment 1 has a test time of 60 minutes, each 10 minutes as a group, and a total of 6 groups, free to operate with traditional multimedia information filtering technology and compares the number of filtering within the time. The traditional multimedia information filtering technology and the improved multimedia information filtering technology are used to filter the information, obtain the number of information filtering, and calculate the filtering accuracy rate.
Experiment 2 test samples are 40,000 pieces of multimedia information, divided into 8 groups for testing, comparing the time and filtering effect of multimedia information filtering technology and traditional multimedia information filtering technology in the improved big data environment.
The analysis of the test results shows that, as shown in Figures 6 and 7, the massive multimedia information filtering method fused with data mining algorithms can accurately and effectively filter data packets, while ensuring a better recognition effect, and it takes a short time. The efficiency is high, and it meets the filtering requirements of multimedia information.
[figure omitted; refer to PDF][figure omitted; refer to PDF]6. Conclusions
The advent of the data age has brought explosive data. How to clean up useful information from these data is the top priority of the current work. Relying on the data mining algorithm, this paper proposes a massive information multimedia filtering analysis model, which analyzes content recognition and packet filtering, and performs matching and content analysis with the blacklist collected by the platform. If the match is consistent, direct filtering measures are taken to save money. Practice has proved that the data mining algorithm can effectively support the normal work of the system and realize the effective support of the filtering system.
[1] L. Di, S. Kar, J. M. F. Moura, H. V. Poor, "Distributed kalman filtering over massive data sets: analysis through large deviations of random riccati equations," IEEE Transactions on Information Theory, vol. 61 no. 3, pp. 1351-1362, 2015.
[2] J. D. Chadarevian, B. D. Fletcher, J. Chatten, H. H. Rabinovitch, "Massive infantile nephroblastomatosis: a clinical, radiological, and pathological analysis of four cases," Cancer, vol. 39 no. 5, pp. 2294-2305, 2015.
[3] H. Ates, I. Ates, H. Kundi, F. M. Yilmaz, "Diagnostic validity of hematologic parameters in evaluation of massive pulmonary embolism," Journal of Clinical Laboratory Analysis, vol. 5 no. 2, 2017.
[4] P. Ncube, M. Pidou, T. Stephenson, B. Jefferson, P. Jarvis, "Consequences of pH change on wastewater depth filtration using a multimedia filter," Water Research, vol. 128 no. 2, pp. 111-119, DOI: 10.1016/j.watres.2017.10.040, 2018.
[5] R. Krishnan, M. R. Khanzadi, N. Krishnan, Y. Wu, A. Graell i Amat, T. Eriksson, R. Schober, "Linear massive MIMO precoders in the presence of phase noise—a large-scale analysis," IEEE Transactions on Vehicular Technology, vol. 3 no. 1, 2015.
[6] T. Li, J. M. Corchado, S. Sun, J. Bajo, "Clustering for filtering: multi-object detection and estimation using multiple/massive sensors," Information Sciences, vol. 388-389 no. 89, pp. 172-190, DOI: 10.1016/j.ins.2017.01.028, 2017.
[7] M. K. Chung, J. L. Hanson, J. Ye, R. J. Davidson, S. D. Pollak, "Persistent homology in sparse regression and its application to brain morphometry," IEEE Transactions on Medical Imaging, vol. 34 no. 9, pp. 1928-1939, DOI: 10.1109/tmi.2015.2416271, 2015.
[8] Y. Chan, "Mathematical modeling and simulations on massive hydrogen yield using functionalized nanomaterials," Journal of Mathematical Chemistry, vol. 53 no. 5, pp. 1280-1293, DOI: 10.1007/s10910-015-0488-4, 2015.
[9] P. Pinilla, J. D. Boer, M. Benisty, A. Juhasz, M. D. J. Ovelar, C. Dominik, H. Avenhaus, T. Birnsttiel, J. H. V. Girard, N. Huelamo, A. Isella, J. Milli, "Variability and dust filtration in the transition disk J160421.7-213028 observed in optical scattered light," Astronomy and Astrophysics, vol. 584 no. 4, pp. 109-117, DOI: 10.1051/0004-6361/201526981, 2015.
[10] D. Ichikawa, T. Saito, H. Oyama, "Impact of predicting health-guidance candidates using massive health check-up data: a data-driven analysis," International Journal of Medical Informatics, vol. 106 no. 10, pp. 32-36, DOI: 10.1016/j.ijmedinf.2017.08.002, 2017.
[11] S. Rajoria, A. Trivedi, W. W. Godfrey, "Performance analysis of two tier HetNets with massive MIMO enabled wireless backhauling," Wireless Networks, vol. 26 no. 2, pp. 1459-1472, DOI: 10.1007/s11276-019-02205-1, 2020.
[12] F. Muramatsu, K. Nishimori, R. Taniguchi, T. Hiraguri, "Multi-beam massive MIMO with beam-selection using only amplitude information in uplink channel," IEICE—Transactions on Communications, vol. 101 no. 7, pp. 19-28, 2018.
[13] P. Sundaram, M. Luessi, M. Bianciardi, S. Stufflebeam, M. Hamalainen, V. Solo, "Individual resting-state brain networks enabled by massive multivariate conditional mutual information," IEEE Transactions on Medical Imaging, vol. 39 no. 6, pp. 1957-1966, DOI: 10.1109/tmi.2019.2962517, 2020.
[14] A. Bhamri, K. Hooli, T. Lunttila, "Massive carrier aggregation in LTE-advanced pro: impact on uplink control information and corresponding enhancements," IEEE Communications Magazine, vol. 54 no. 5, pp. 92-97, DOI: 10.1109/mcom.2016.7470941, 2016.
[15] J. Liu, N. Yao, Y. Zhao, "Improvement of telbivudine on renal function and massive proteinuria: a case report," Journal of Viral Hepatitis, vol. 24 no. 2, pp. 75-77, DOI: 10.1111/jvh.12789, 2017.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2021 Bo Wang. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/
Abstract
With the advent of the big data era, information presentation has exploded. For example, rich methods such as audio and video have integrated more information, but with it, a lot of bad information has been brought. In view of this situation, this paper relies on data mining algorithms, builds a multimedia filtering system model for massive information, and integrates content recognition, packet filtering, and other technologies to match the two to ensure the integrity and real time of filtering. Practice results prove that the method is effective.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer






