Results for 'Hadoop'

13 found
Order:
  1.  5
    Hadoop-Based Painting Resource Storage and Retrieval Platform Construction and Testing.Chenhua Zu - 2021 - Complexity 2021:1-11.
    This paper adopts Hadoop to build and test the storage and retrieval platform for painting resources. This paper adopts Hadoop as the platform and MapReduce as the computing framework and uses Hadoop Distributed Filesystem distributed file system to store massive log data, which solves the storage problem of massive data. According to the business requirements of the system, this paper designs the system according to the process of web text mining, mainly divided into log data preprocessing module, (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  2.  10
    Dynamic Prediction Research of Silicon Content in Hot Metal Driven by Big Data in Blast Furnace Smelting Process under Hadoop Cloud Platform.Yang Han, Jie Li, Xiao-Lei Yang, Wei-Xing Liu & Yu-Zhu Zhang - 2018 - Complexity 2018:1-16.
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  3.  3
    Internet jako pramen výzkumu: Přístup k archivovaným webovým zdrojům a možnosti jejich zpracování.Zdenko Vozár, Marie Haškovcová & Andrea Prokopová - 2022 - Teorie Vědy / Theory of Science 44 (1):59-87.
    The Internet has become a natural communication platform for modern society. Web archives, which began in the 1990s to capture and preserve changing web content, have thus become key sources for research in the recent past. The analysis of their data is complicated by, for example, insufficient competencies of researchers, the need for computing resources or legislation. One way to meet the needs of users is to develop tools and research interfaces that allow to work with data without the need (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  4.  6
    Big Data solutions on a small scale: Evaluating accessible high-performance computing for social research.Sawyer A. Bowman & Dhiraj Murthy - 2014 - Big Data and Society 1 (2).
    Though full of promise, Big Data research success is often contingent on access to the newest, most advanced, and often expensive hardware systems and the expertise needed to build and implement such systems. As a result, the accessibility of the growing number of Big Data-capable technology solutions has often been the preserve of business analytics. Pay as you store/process services like Amazon Web Services have opened up possibilities for smaller scale Big Data projects. There is high demand for this type (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  5.  4
    Study on data mining method of network security situation perception based on cloud computing.Rahul Neware, Vishal Jagota, Arshpreet Kaur & Yan Zhang - 2022 - Journal of Intelligent Systems 31 (1):1074-1084.
    In recent years, the network has become more complex, and the attacker’s ability to attack is gradually increasing. How to properly understand the network security situation and improve network security has become a very important issue. In order to study the method of extracting information about the security situation of the network based on cloud computing, we recommend the technology of knowledge of the network security situation based on the data extraction technology. It converts each received cyber security event into (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  6.  78
    Exploiting Sharing Join Opportunities in Big Data Multiquery Optimization with Flink.Xiao-Yan Gao, Radhya Sahal, Gui-Xiu Chen, Mohammed H. Khafagy & Fatma A. Omara - 2020 - Complexity 2020:1-25.
    Multiway join queries incur high-cost I/Os operations over large-scale data. Exploiting sharing join opportunities among multiple multiway joins could be beneficial to reduce query execution time and shuffled intermediate data. Although multiway join optimization has been carried out in MapReduce, different design principles are not considered. To bridge the gap of not considering the optimization of Big Data platforms, an end-to-end multiway join over Flink, which is called Join-MOTH system, is proposed to exploit sharing data granularity, sharing join granularity, and (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  7.  6
    Data Analysis Method of Intelligent Analysis Platform for Big Data of Film and Television.Youwen Ma & Yi Wan - 2021 - Complexity 2021:1-10.
    Based on cloud computing and statistics theory, this paper proposes a reasonable analysis method for big data of film and television. The method selects Hadoop open source cloud platform as the basis, combines the MapReduce distributed programming model and HDFS distributed file storage system and other key cloud computing technologies. In order to cope with different data processing needs of film and television industry, association analysis, cluster analysis, factor analysis, and K-mean + association analysis algorithm training model were applied (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  8.  2
    Design of metaheuristic rough set-based feature selection and rule-based medical data classification model on MapReduce framework.Sadanandam Manchala & Hanumanthu Bhukya - 2022 - Journal of Intelligent Systems 31 (1):1002-1013.
    Recently, big data analytics have gained significant attention in healthcare industry due to generation of massive quantities of data in various forms such as electronic health records, sensors, medical imaging, and pharmaceutical details. However, the data gathered from various sources are intrinsically uncertain owing to noise, incompleteness, and inconsistency. The analysis of such huge data necessitates advanced analytical techniques using machine learning and computational intelligence for effective decision making. To handle data uncertainty in healthcare sector, this article presents a novel (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  9.  1
    The Psychology Analysis for Post-production of College Students’ Short Video Communication Education Based on Virtual Image and Internet of Things.Wufeng Tang - 2022 - Frontiers in Psychology 13.
    To improve the understanding of film and television postproduction for college students in the era of intelligent media, a study is conducted on college students’ short video communication education and audience psychology based on the rapid development of virtual image and the Internet of Things. Primarily, the collaborative filtering algorithm is optimized and combined with the principle of Spark and Hadoop platforms as well as the IoT and virtual image technologies. Then, a hybrid computing model is proposed, and the (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  10.  7
    A MapReduce-Based Parallel Frequent Pattern Growth Algorithm for Spatiotemporal Association Analysis of Mobile Trajectory Big Data.Dawen Xia, Xiaonan Lu, Huaqing Li, Wendong Wang, Yantao Li & Zili Zhang - 2018 - Complexity 2018:1-16.
    Frequent pattern mining is an effective approach for spatiotemporal association analysis of mobile trajectory big data in data-driven intelligent transportation systems. While existing parallel algorithms have been successfully applied to frequent pattern mining of large-scale trajectory data, two major challenges are how to overcome the inherent defects of Hadoop to cope with taxi trajectory big data including massive small files and how to discover the implicitly spatiotemporal frequent patterns with MapReduce. To conquer these challenges, this paper presents a MapReduce-based (...)
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  11.  4
    Research on parallel data processing of data mining platform in the background of cloud computing.Lijun Wu, Haiyan Xing, Hui Zhang & Lingrui Bu - 2021 - Journal of Intelligent Systems 30 (1):479-486.
    The efficient processing of large-scale data has very important practical value. In this study, a data mining platform based on Hadoop distributed file system was designed, and then K-means algorithm was improved with the idea of max-min distance. On Hadoop distributed file system platform, the parallelization was realized by MapReduce. Finally, the data processing effect of the algorithm was analyzed with Iris data set. The results showed that the parallel algorithm divided more correct samples than the traditional algorithm; (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  12.  1
    Big Data Recommendation Research Based on Travel Consumer Sentiment Analysis.Zhu Yuan - 2022 - Frontiers in Psychology 13.
    More and more tourists are sharing their travel feelings and posting their real experiences on the Internet, generating tourism big data. Online travel reviews can fully reflect tourists’ emotions, and mining and analyzing them can provide insight into the value of them. In order to analyze the potential value of online travel reviews by using big data technology and machine learning technology, this paper proposes an improved support vector machine algorithm based on travel consumer sentiment analysis and builds an (...) Distributed File System system based on Map-Reduce model. Firstly, Internet travel reviews are pre-processed for sentiment analysis of the review text. Secondly, an improved SVM algorithm is proposed based on the main features of linear classification and kernel functions, so as to improve the accuracy of sentiment word classification. Then, HDFS data nodes are deployed on the basis of Hadoop platform with the actual tourism application context. And based on the Map-Reduce programming model, the map function and reduce function are designed and implemented, which greatly improves the possibility of parallel processing and reduces the time consumption at the same time. Finally, an improved SVM algorithm is implemented under the built Hadoop platform. The test results show that online travel reviews can be an important data source for travel big data recommendation, and the proposed method can quickly and accurately achieve travel sentiment classification. (shrink)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  13.  10
    Analysis of the Impact of Big Data on E-Commerce in Cloud Computing Environment.Rongrui Yu, Chunqiong Wu, Bingwen Yan, Baoqin Yu, Xiukao Zhou, Yanliang Yu & Na Chen - 2021 - Complexity 2021:1-12.
    This article starts with the analysis of the existing electronic commerce system, summarizes its characteristics, and analyzes and solves its existing problems. Firstly, the characteristics of the relational database My Structured Query Language and the distributed database HBase are analyzed, their respective advantages and disadvantages are summarized, and the advantages and disadvantages of each are taken into account when storing data. My SQL is used to store structured business data in the system, while HBase is used to store unstructured data (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark