Java API Operation of Hive in the Big Data Introduction Tutorial Series

To access Hive, Java needs to connect Hive by beeline. hiveserver2 provides a new command-line tool, beeline. hiveserver2 upgrades the previous hive, which has more powerful functions. It adds permission control. To use beeline, you need to start hiverserver2 first, and then use beeline connection. Operation steps: (1) Modi ...

Posted by blacklotus on Wed, 30 Jan 2019 02:54:15 -0800

Elastic Search Installation Guide and Basic Operations of its REST API

This article aims to show you how to install ES in windows system and how to use its REST API to operate ES in windows system. (1) Installation of ES under Windows Make sure that the machine has java installed before installing es, preferably a newer version. I have 10 installed on my machine. C:\>java -version java versi ...

Posted by angel_cowgirl on Wed, 30 Jan 2019 02:06:15 -0800

Spark Exercises: Seeking TopN for Teachers of All Subjects

[Note] This article refers to learning videos from Calf School. Spark Exercises: Seeking TopN for Teachers of All Subjects Data format: http://bigdata.edu360.cn/laozhang 1. Data segmentation val func=(line:String)=>{   val index=line.lastIndexOf("/")   val teacher=line.substring(index+1)   val httpHost=line.substrin ...

Posted by SureFire on Tue, 29 Jan 2019 23:33:14 -0800

SparkSQL View Debugging Generated Code

Spark SQL (DataFrame) is introduced in websites and some books to generate final running statements based on the corresponding operations. This article starts with a simple, low-level problem and ends with a look at the generated code to find the root cause of the problem and a brief introduction to how to debug SparkSQL. Sour ...

Posted by NikkiLoveGod on Tue, 29 Jan 2019 20:51:15 -0800

Configuring the Hadoop environment

1. Edit profile file export JAVA_HOME=/usr/lib/jvm/jdk/ export HADOOP_INSTALL=/sur/local/hadoop export PATH=$PATH:$HADOOP_INSTALL/bin export PATH=$PATH:$HADOOP_INSTALL/sbin export HADOOP_MAPRED_HOME=$HADOOP_INSTALL export HADOOP_COMMON_HOMR=$HADOOP_INSTALL export HADOOP_HDFS_HOME=$HADOOP_INSTALL export YARN_HOME-$HADOOP_INSTAL ...

Posted by activomate on Tue, 29 Jan 2019 20:42:14 -0800

Implementing MapReduce in eclipse

1. Prepare the environment The mapred-site.xml and yarn-site.xml configuration files of Hadoop under Windows are updated as in the virtual machine. Copy mapred-site.xml and yarn-site.xml configuration files to the project. Add dependency packages. 2. Operation mode Run locally (start multiple threads in local eclipse to simul ...

Posted by Yanayaya on Tue, 29 Jan 2019 15:15:15 -0800

Use of FreeMaker in Java

1. What is freemarker? FreeMarker is a template engine written in Java language, which generates text output based on templates. FreeMarker has nothing to do with Web containers, that is, when the Web runs, it does not know about Servlet s or HTTP. It can be used not only as a presentatio ...

Posted by BuzzStPoint on Tue, 29 Jan 2019 13:36:15 -0800

Kafka (12): integration of Kafka and flume

I. Realizing Functions Flume monitors a log file and sends the data to kafka, and then another flume gets the data from kafka. II. Implementation steps 1. environment flume1.6.0 kafka_2.10-0.8.2.1 zookeeper3.4.5 2.flume monitors logs and sends them to kafka's configuration file test1_1.6.conf [reference: http://archive. ...

Posted by cutups on Tue, 29 Jan 2019 11:15:15 -0800

Devops Key Tools and Technologies (7) - Pipeline-based Jmeter Performance Testing [2] (Pipeline Integration of Jmeter Performance Testing)

In the last three articles, we used the automated tools to integrate Jenkins Docker Cloud with Robot Framework + Selenium + Chome, and realized the automated testing of the Web in Pipeline, from the installation of the Web automated testing tools under windows to the containerization of the Web automated testing tool Robot Frame ...

Posted by shiva on Tue, 29 Jan 2019 01:24:14 -0800

Recommendation Algorithms: Item-based Collaborative Filtering Algorithms

Reference to "Recommendation System Practice" Item Liang Concept: Item-based collaborative filtering algorithm, optimization algorithm Contrast: Advantages and Disadvantages of User Collaborative Filtering python coding implementation 1. Algorithmic Definition User-based collaborative filtering algorithm Commod ...

Posted by centerwork on Mon, 28 Jan 2019 23:15:14 -0800