Are the following descriptions correct about the HBase file storage module (HBase FileStream, HFS for short)?
Compared with the open source Sqoop, what are the enhanced features of Loader
In the Output stage, Structured Streaming can define different data writing methods, including which of the following methods?
AAppend Mode
B. Update Mode
C. General Mode
D. Ccomplete Mode
The data node is the working node of HDFS. Which of the following are its functions? (multiple choice)
What are the main characteristics of big data analysis related technologies?
Which of the following belongs to DDL (Data Definition Language) in Hive SQL?
Which of the following components does the platform architecture of Huawei's big data solution include?
Which of the following statements about CarbonData in Fusioninsight is correct?
After submitting the topology using the Streaming client shell command in the Fusioninsight HD system, use Strom The UI view shows that the topology has not processed data for a long time. What are the possible reasons?
Which parts of the data need to be read to execute the HBase data reading business?
In the FusionlnsightHD product, which statement about the Kafka component is correct?
Which of the followingOSWhich version is recommended for building a Fusioninsight V1R2C60 cluster?
The following figure shows the label storage strategy of HDFS. Observe the figure below, which data nodes will HBasel data be stored on
Zookeeper enhancements include adding ephemeral nodes to audit logsDeleted audit logs.
The ResourceManager adopts a high-availability scheme. When the Active resourcemanager finds a fault, it can only start the standby resourcemanager through the built-in zookeeper and switch its state to active.
Select which of the following conversion rules are supported by Loader jobs? (multiple choice)
Fusioninsight tool is a set of health detection tools provided for technical support engineers and maintenance engineers. It can check the health status of cluster-related nodes and services, discover potential problems in the cluster in advance, and generate health check reports. It is convenient for technical support engineers and maintenance engineers to quickly understand the health status of the system.
On the Fusionlnsight Manager interface, when the kafka disk capacity insufficient alarm is received, and the cause of the read alarm has been eliminated from the disk hardware failure, the system administrator needs to consider expanding the capacity to solve the problem. )
In the MRS interface, Loader can specify a variety of different data sources, configuration data cleaning and conversion steps, and configure cluster storage systems, etC. .
Spark divides Stages according to the dependencies of RDDs. The scheduler starts from the end of the DAG graph and traverses the entire dependency chain in reverse. When encountering narrow dependencies, it is disconnected, and when encountering wide dependencies, it is added to the current Stage.
The overall process of Kafka Produceri reading data is that the Producer connects to any surviving Broker, requests the leader metadata information of the specified topic and partition, and then directly connects with the corresponding Brokerl to publish the data.
FlinkmiddleWatermarkmechanism useduntiedecidechaossequenceproblem.WatermarkCan it be produced in the following way?
The emergence of HFS solves the need to store a large number of small files (below 10MB) in HDFS. At the same time, it is necessary to store some mixed scenes of large files (above 10MB)
Which of the following statements about Huawei's big data solution is correct?
Which of the following descriptions about the deployment of big data components on Kunpeng and X86 servers is correct?
What are the key features of Streaming in Huawei's big data product Fusioninsight HD?
In the Fusioninsight product, which statement is correct about the Kafka component?
Provide enterprise-level metadata management under the DGC platform architecture. Data asset management can visually support drilling, traceability, etC. Through the data map, which module can provide data intelligent search and operation monitoring to realize the data blood relationship and data panorama visualization of data assets?
What methods or interfaces does Loader provide to implement job management?
Which of the following functions can the kafka-clustermirroring tool implement?
The Fusionlnsight HD cluster contains many kinds of services, and each service consists of thousands of roles. Which of the following are the roles of the service?( )
Which of the following descriptions about the deployment of big data components on Kunpeng and x86 servers is correct?
In FusionlnsightHD, which of the following components does Flink mainly interact with
In an MRS cluster, which of the following components does Spark mainly interact with?
Which of the following statements about the read/write process of the leader node of Zookeeper after receiving the data change request is correct?
What component does HBase use by default as its underlying file storage system?
Which of the following factors contributed to the vigorous development of the era of big data?
When viewing the partition details of a TopicE of Kafka, which command should be used?
In the Fusioninsight Manager interface, which of the following options is not included in the operation of the loader?
Which of the following is not a role or service involved in the process of reading data in HBasei?
Which of the following is not a role or service involved in the process of reading data in HBasei?
F1ink in( )interface for streaming data processing,( )interface for batch processing?
Which of the following descriptions about Hive log collection on the Fusioninsight Manager interface is incorrect?
Which of the following operations cannot be recorded in the Fusioninsight HD system audit log( )
FusionlnsightHD uses the HBase client to write 10 pieces of data in batches. A Regionserver node contains 2 Regions of the table, A and B, respectively. Two of the 10 pieces of data belong to A. 4 belong to B. Clearly write How many RPC requests do I need to send to the Regionserver to enter these 10 pieces of data?
What is wrong about the architecture description of Hive in Fusionlnsight HD?
In the Fusioninsight HD system, which component does the flume data flow not need to pass through in the node?
In the Fusioninsight product, about the Kafka topic, which of the following descriptions are incorrect?
Which of the following descriptions about the basic operations of Hive SQL is correct?
In the task scheduling process of YARN, which of the following is the task that ApplicationMastert is responsible for?
Which of the following programming languages is Spark implemented in?
AC
B.C++
C. JAVA
D. Scala
In order to ensure the reliability of snapshot storage of streaming applications, where are the snapshots mainly stored?
Existing server.channelsa=ch1, set the Channel type to File Channel, which of the following configurations is correct?
Regarding the alarm about insufficient disk capacity of Kafkat, which of the following analysis is incorrect for the possible reasons?
Watermark is a mechanism proposed by Apache Flink to process EventTime window calculation, which is essentially a timestamp.
LdapServe in Huawei's big data platform can support different types of operations such as query, update, and authentication.
TaskSlot in Flink is mainly used for resource isolation, including memory resources and CPU resources
The following offin Kafka messagesWhich one of the speed transmission methods is still correct?
Which of the following contents can be viewed in the Loader historical job record?
F1ink is a unified computing framework that combines batch processing and stream processing. Its core is a stream data processing engine for data distribution and parallel computing.
As an authentication server center, Kerberos1 can provide unified authentication services to all services in the cluster and secondary development applications of customers.
systemDuring the authentication process, all Kerbero evidence,BagContains the user's password. The user's affiliated information needs to be downloaded from( )Obtain.
In Streaming, exactly one message reliability level is achieved through the ACK mechanism.
In the flume architecture, a Source can be connected to multiple channels.
The Streaming of Fusioninsight HD is developed based on the open source Apache Storm, which is a distributed offline computing framework.
Which of the following can the big data platform be applied to?Supervisionclass scene?
A telecommunications companyplanopenexhibitionBigdatabusiness,target businesshavecustomer grouping,calendarHistorical bill analysis, real-time call charge analysis and other services. Which of the following options is the most appropriate in terms of functionality and cost to meet business needs?
The UNION ALL operator in Hive is used to combine the result sets of two more SELECT statements. Duplicate values are not allowed in the result set.
Fl?Me is akind of distributed. highreliable and highavailableofClothesservice. useto be effectivecompare?,polymerization?Move a lot of logs? ?.
Fusioninsight is Huawei's unified platform for enterprise-level big data storage, query, and analysis. It can help enterprises quickly build massive data information processing systems, and discover new value points and business opportunities through real-time and non-real-time analysis and mining of massive information data.