New Year Special Sale - Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: mxmas70

Home > Huawei > HCIA-Big Data > H13-711_V3.0

H13-711_V3.0 HCIA-Big Data V3.0 Question and Answers

Question # 4

What are the main features of the YARN capacity scheduler

A.

flexibility

B.

multiple tenancy

C.

Dynamically update configuration files

D.

Capacity Guarantee

Full Access
Question # 5

Are the following descriptions correct about the HBase file storage module (HBase FileStream, HFS for short)?

A.

Applied in the upper layer of Fusioninsight HD

B.

HFS encapsulates the interface between HBase and HDFS

C.

Provide functions such as file storage, reading, and deletion for upper-layer applications

D.

HFS is a separate module of HBase

Full Access
Question # 6

Compared with the open source Sqoop, what are the enhanced features of Loader

A.

High reliability

B.

high performance

C.

safety

D.

Graphical

Full Access
Question # 7
A.

redistributing stream

B.

one-to-one

C.

one-to-many stream

D.

distributingi flow

Full Access
Question # 8

In the Output stage, Structured Streaming can define different data writing methods, including which of the following methods?

AAppend Mode

B. Update Mode

C. General Mode

D. Ccomplete Mode

Full Access
Question # 9

The data node is the working node of HDFS. Which of the following are its functions? (multiple choice)

A.

Responsible for storing and reading data

B.

Data storage and retrieval are performed according to the scheduling of the client or the name node

C.

Records all operations for file creation, deletion, renaming, etC.

D.

Periodically send the namenode a list of its own stored blocks.

Full Access
Question # 10

What are the main characteristics of big data analysis related technologies?

A.

machine learning, full features

B.

Event correlation analysis behind the data

C.

Based on massive data

D.

based on exact samples

Full Access
Question # 11

Which of the following belongs to DDL (Data Definition Language) in Hive SQL?

A.

Modify table

B.

delete table

C.

build table

D.

data import

Full Access
Question # 12

Which of the following components does the platform architecture of Huawei's big data solution include?

A.

Hadoop layer

B.

GaussDB 200

C.

Datafarm layer

D.

Fusiolnght Manager

Full Access
Question # 13

Which of the following statements about CarbonData in Fusioninsight is correct?

A.

cArbon is also a high-performance analytics engine that integrates data sources with spark.

B.

cArbon uses a combination of lightweight compression and heavyweight compression to compress data, which can reduce data storage space by 60%-80% and greatly save hardware storage costs.

C.

arbon is a new Apache Hadoop native file format that uses advanced columnar storage, indexing, compression, and encoding techniques to improve computational efficiency to help accelerate data queries over petabytes of magnitude, and can be used for faster interactive queries.

D.

The purpose of using carbon is to provide ultra-fast responses to ad-hoc queries on big data.

Full Access
Question # 14

Which of the following windows can F1ink perform statistics on?

A.

time window

B.

sliding window

C.

session window

D.

Cout window

Full Access
Question # 15

Which of the following conversion rules can be implemented by Loader

A.

Null conversion

B.

splice conversion

C.

long time conversion

D.

Incremental conversion

Full Access
Question # 16

After submitting the topology using the Streaming client shell command in the Fusioninsight HD system, use Strom The UI view shows that the topology has not processed data for a long time. What are the possible reasons?

A.

Supervisor is the component that receives data in topology and then performs processing

B.

There is a logic error in the topology business, and it cannot run normally after submission

C.

The topology is too complex or the number of concurrent users is too large, resulting in workerThe startup time is too long, exceeding the waiting time of Supervisort

D.

The supervisor's slots resources are exhausted, and after the topology is submitted, the slots cannot be allocated to start the worker process.

Full Access
Question # 17

What are the following main functions of FusionInsightManager?

A.

data integration

B.

System Management

C.

safety management

D.

Service Governance

Full Access
Question # 18

Which parts of the data need to be read to execute the HBase data reading business?

A.

HLog

B.

MemStore

C.

HFile

D.

HMaster

Full Access
Question # 19

In the FusionlnsightHD product, which statement about the Kafka component is correct?

A.

When deleting TopicE, make sure that Kafkal's service configuration delete.topiC. enable is set to true

B.

Kafka installation and operation log storage path is /srv/Bigdata/kafka

C.

Unavailable ZooKeeper service will cause Kafka service to become unavailable

D.

Topic must be created using the admin user or the Kafka admin group user

Full Access
Question # 20

Which of the followingOSWhich version is recommended for building a Fusioninsight V1R2C60 cluster?

A.

SUSE11 SP1/SP2/SP3 for AMD64&Inter64

B.

CentOS6.6

C.

redhat-6.4-x86_64

D.

RedHat-6.5-x86_64

E.

RedHat-6.7-x86_64F Ubuntu6.3

Full Access
Question # 21

The following figure shows the label storage strategy of HDFS. Observe the figure below, which data nodes will HBasel data be stored on

A.

DataNode A

B.

DataNode B

C.

DataNode E

D.

DataNode F

Full Access
Question # 22

The nodes in the ElasticSearch cluster are divided into master and slave.

A.

True

B.

False

Full Access
Question # 23

Zookeeper enhancements include adding ephemeral nodes to audit logsDeleted audit logs.

A.

True

B.

False

Full Access
Question # 24

The ResourceManager adopts a high-availability scheme. When the Active resourcemanager finds a fault, it can only start the standby resourcemanager through the built-in zookeeper and switch its state to active.

A.

True

B.

False

Full Access
Question # 25

Select which of the following conversion rules are supported by Loader jobs? (multiple choice)

A.

Modulo conversion

B.

Null conversion

C.

splice conversion

D.

Add constant field

Full Access
Question # 26

Fusioninsight tool is a set of health detection tools provided for technical support engineers and maintenance engineers. It can check the health status of cluster-related nodes and services, discover potential problems in the cluster in advance, and generate health check reports. It is convenient for technical support engineers and maintenance engineers to quickly understand the health status of the system.

A.

True

B.

False

Full Access
Question # 27

On the Fusionlnsight Manager interface, when the kafka disk capacity insufficient alarm is received, and the cause of the read alarm has been eliminated from the disk hardware failure, the system administrator needs to consider expanding the capacity to solve the problem. )

A.

True

B.

False

Full Access
Question # 28

In the MRS interface, Loader can specify a variety of different data sources, configuration data cleaning and conversion steps, and configure cluster storage systems, etC. .

A.

TRUE

B.

FALSE

Full Access
Question # 29

Spark divides Stages according to the dependencies of RDDs. The scheduler starts from the end of the DAG graph and traverses the entire dependency chain in reverse. When encountering narrow dependencies, it is disconnected, and when encountering wide dependencies, it is added to the current Stage.

A.

True

B.

False

Full Access
Question # 30

The overall process of Kafka Produceri reading data is that the Producer connects to any surviving Broker, requests the leader metadata information of the specified topic and partition, and then directly connects with the corresponding Brokerl to publish the data.

A.

True

B.

False

Full Access
Question # 31

RedisThe commands in are case-sensitive.

A.

TRUE

B.

FALSE

Full Access
Question # 32

FlinkmiddleWatermarkmechanism useduntiedecidechaossequenceproblem.WatermarkCan it be produced in the following way?

A.

inheritassiknerWithPunctuateca termmarks

B.

Inherit assigner Timestamp WithWatermark

C.

Inherit get Current Watermark

D.

Inherit assignerWithPeriodicWatermarks

Full Access
Question # 33

The emergence of HFS solves the need to store a large number of small files (below 10MB) in HDFS. At the same time, it is necessary to store some mixed scenes of large files (above 10MB)

A.

True

B.

False

Full Access
Question # 34

Which of the following statements about Huawei's big data solution is correct?

A.

Farmer is a data service framework

B.

GaussDB is an open source database product

C.

Fusioninsight Manager is a distributed system management framework, administrators can control distributed clusters through multiple access points

D.

Fusioninsight HD is an enhanced version based on the open source big data software Hadoopl

Full Access
Question # 35

Which of the following descriptions about the deployment of big data components on Kunpeng and X86 servers is correct?

A.

No shortcomings in performance

B.

Single component (for example: HDFS) supports mixed deployment of Kunpeng server and X86 server

C.

Supports mixed deployment of Kunpeng servers and ordinary X86 servers in a single cluster

D.

Realize the autonomous control of some equipment

Full Access
Question # 36

What are the key features of Streaming in Huawei's big data product Fusioninsight HD?

A.

flexibility

B.

Scalability

C.

Disaster recovery capability

D.

message reliability

Full Access
Question # 37

In the Fusioninsight product, which statement is correct about the Kafka component?

A.

When creating a topic, the number of replicas must not be greater than the number of currently surviving Broker instances, otherwise the topic creation will fail

B.

When the Producer of Kafkal sends a message, it can specify which Consumer consumes the message

C.

Kafka will store metadata information in Zookeeper for

D.

After Kafka is installed, the sensitive data storage directory cannot be configured.

Full Access
Question # 38

Provide enterprise-level metadata management under the DGC platform architecture. Data asset management can visually support drilling, traceability, etC. Through the data map, which module can provide data intelligent search and operation monitoring to realize the data blood relationship and data panorama visualization of data assets?

A.

data development

B.

Data Asset Management

C.

Specification design

D.

data integration

Full Access
Question # 39

What processes are included in the HBase service of Fusioninsight HD?

A.

HMaster

B.

Slave

C.

HRegionServer

D.

Data Node

Full Access
Question # 40

What methods or interfaces does Loader provide to implement job management?

A.

WEB UI

B.

Linuxt command line

C.

REST interface

D.

Java API

Full Access
Question # 41

Which of the following functions can the kafka-clustermirroring tool implement?

A.

Kafka cluster data synchronization scheme

B.

Kafka data backup in a single cluster

C.

Kafka data recovery in a single cluster

D.

to all wrong

Full Access
Question # 42

Which way is correct to load data into Hive table?

A.

Load the file of the local path directly into the Hive table

B.

Hive supports insert into! The method of a single record. So you can insert a single record directly on the command line

C.

Insert result set from other table into Hive table

D.

Load files on HDFS into Hive tables.

Full Access
Question # 43

The Fusionlnsight HD cluster contains many kinds of services, and each service consists of thousands of roles. Which of the following are the roles of the service?( )

A.

HDFS

B.

NameNode

C.

DataNode

D.

Hbase

Full Access
Question # 44

Which of the following descriptions about the deployment of big data components on Kunpeng and x86 servers is correct?

A.

No shortcomings in performance

B.

Single component (eg HDFS) supports mixed deployment of Kunpeng server and x86 server

C.

Supports mixed deployment of Kunpeng servers and common x86 servers in a single cluster

D.

Realize the autonomous control of some equipment

Full Access
Question # 45

In FusionlnsightHD, which of the following components does Flink mainly interact with

A.

zookeeper

B.

HDFS

C.

Kafka

D.

Yarr

Full Access
Question # 46

In an MRS cluster, which of the following components does Spark mainly interact with?

A.

Zookeeper

B.

Yarin

C.

Hive

D.

HDFS

Full Access
Question # 47

What is the physical storage unit of Region in HBasel

A.

Region

B.

ColumnFamily

C.

olumn

D.

Row

Full Access
Question # 48

Which object cannot be managed by Fusioninsight manager?

A.

Spark

B.

host OS

C.

YARN

D.

HDFS

Full Access
Question # 49

Which of the following statements about the read/write process of the leader node of Zookeeper after receiving the data change request is correct?

A.

Simultaneous writes to disk and memory

B.

Write to disk first, then write to memory

C.

write to memory only

D.

Write to memory first, then write to disk

Full Access
Question # 50

What component does HBase use by default as its underlying file storage system?

A.

File

B.

Kafka

C.

Memory

D.

HDFS

Full Access
Question # 51

Which of the following factors contributed to the vigorous development of the era of big data?

A.

Reduced hardware costs and increased network bandwidth

B.

The rise of cloud computing

C.

The popularization of smart terminals and the improvement of social demands

D.

all of the aboveA. True

Full Access
Question # 52

When viewing the partition details of a TopicE of Kafka, which command should be used?

A.

bin/kafka-topiC. sh-create

B.

bin/kafka-topiC. sh -list

C.

bin/kafka-topiC. sh -describe

D.

bin/kafka-topiC. sh -delete

Full Access
Question # 53

In the Fusioninsight Manager interface, which of the following options is not included in the operation of the loader?

A.

Switch the active and standby Loader nodes

B.

Start the loader instance

C.

Configure loader parameters

D.

View loader service status

Full Access
Question # 54

Which of the following is not a role or service involved in the process of reading data in HBasei?

A.

HDFS

B.

Zookeeper

C.

HMaster

D.

HRegionServer

Full Access
Question # 55

Which of the following types of data is not semi-structured data?

A.

HTML

B.

XML

C.

two-dimensional table

D.

JSON

Full Access
Question # 56

In Hive, which of the following statements about partitions is incorrect

A.

There can be further partitions or buckets under the partition

B.

The data table can be partitioned by the value of a field

C.

Each partition is a directory

D.

The number of partitions is fixed

Full Access
Question # 57

Which of the following is not a role or service involved in the process of reading data in HBasei?

A.

HDFS

B.

HRegionServer

C.

LHMaster

D.

ZooKeeper

Full Access
Question # 58

The following descriptions about Kafkaf are wrong( )

A.

Used as the basis for activity streams and operational data processing pipelines

B.

Developed by Apache Hadoop and open sourced in 2011

C.

It has the characteristics of information persistence, high throughput, real-time, etC.

D.

Implemented using Scala, Java language

Full Access
Question # 59

F1ink in( )interface for streaming data processing,( )interface for batch processing?

A.

Datastream API, DataSet API

B.

Data batch API.DataStream API

C.

Stream API.Batch API

D.

Batch API, Stream API

Full Access
Question # 60

Which of the following descriptions about Hive log collection on the Fusioninsight Manager interface is incorrect?

A.

You can specify a specific user for log collection, for example, only download logs generated by UserA.

B.

You can specify a time period for log collection, for example, only collect logs from 2016-1-1 to 2016-1-10.

C.

You can specify an instance for log collection, for example, specify to collect metstore logs.

D.

The node IP can be specified for log collection, for example, only the logs of a certain IP can be downloaded.

Full Access
Question # 61

Which of the following operations cannot be recorded in the Fusioninsight HD system audit log( )

A.

delete service instance

B.

Start and stop the service instance

C.

Manually clear the camp

D.

Query history monitoring

Full Access
Question # 62

FusionlnsightHD uses the HBase client to write 10 pieces of data in batches. A Regionserver node contains 2 Regions of the table, A and B, respectively. Two of the 10 pieces of data belong to A. 4 belong to B. Clearly write How many RPC requests do I need to send to the Regionserver to enter these 10 pieces of data?

A.

1

B.

2

C.

3

D.

4

Full Access
Question # 63

Which of the following descriptions about Hive features is incorrect?

A.

Flexible and convenient ETL

B.

Only supports MapReduce computing engine

C.

Direct access to HDFS files and HBase

D.

Easy to use and easy to program

Full Access
Question # 64

What is wrong about the architecture description of Hive in Fusionlnsight HD?

A.

As long as one HiveServer is unavailable, the entire HiveEcluster is unavailable

B.

HiveServert is responsible for accepting client requests, parsing, executing HQL commands and returning query results

C.

MetaStore is used to provide raw data services and depends onDBServer

D.

At the same time, only one HiveServer is in Active state, and the other is in Standby state

Full Access
Question # 65

Which of the following scenarios does Hive not apply to?

A.

Non-real-time analysis. Such as log analysis, statistical analysis

B.

data mining. Such as user behavior analysis, interest analysis, regional display

C.

Data summary. such as clicks per user per day,Click to rank

D.

Real-time online data analysis

Full Access
Question # 66

Which of the following commands deletes files?

A.

dfs-clear

B.

dfs -del

C.

dfs -rm

D.

dfs -Is

Full Access
Question # 67

In the Fusioninsight HD system, which component does the flume data flow not need to pass through in the node?

A.

sink

B.

topic

C.

Source

D.

Channel

Full Access
Question # 68

Which of the following commands can be used to create node data?

A.

get/node

B.

create /node

C.

set/node data

D.

1s/node

Full Access
Question # 69

In the Fusioninsight product, about the Kafka topic, which of the following descriptions are incorrect?

A.

Each topic can only be divided into one partition (area)

B.

The number of Topic partitions can be configured at creation time

C.

The storage layer of each Partition corresponds to a log file, and the log file records all information data

D.

Each message published to Kafkab has a category, which is called Topic, which can also be understood as a queue for storing messages.

Full Access
Question # 70

The description of HBase's Region Split splitting process is incorrect( )

A.

The table will be suspended during Spliti

B.

Split splits a Region into two Regions in order to reduce the data size in Regiont

C.

The Region that is split during the Spliti process will suspend the service

D.

The Spliti process does not actually split the file, it just creates the reference file

Full Access
Question # 71

Which of the following descriptions about the basic operations of Hive SQL is correct?

A.

When loading data into Hive, the source data must be a path in HDFS

B.

To create an external table, you must specify location information

C.

Column delimiters can be specified when creating a table

D.

Create an external table using the external keyword. To create a normal table, you need to specify the internal keyword

Full Access
Question # 72

In the task scheduling process of YARN, which of the following is the task that ApplicationMastert is responsible for?

A.

Apply for and receive resources

B.

Set up the running environment for the task

C.

Allocate Container

D.

Start a Map or Reduce task

Full Access
Question # 73

Which of the following programming languages is Spark implemented in?

AC

B.C++

C. JAVA

D. Scala

Full Access
Question # 74

In order to ensure the reliability of snapshot storage of streaming applications, where are the snapshots mainly stored?

A.

In the memory of the JobManager

B.

In a single-machine database with high reliability

C.

in the local file system

D.

in HDFS

Full Access
Question # 75

How is the HBaseM master elected?

A.

Randomly selected

B.

Adjudicated by RegionServer

C.

Adjudication via Zookeeper

D.

HMaster is a dual-master mode and does not need to be adjudicated

Full Access
Question # 76

Existing server.channelsa=ch1, set the Channel type to File Channel, which of the following configurations is correct?

A.

server.channels.ch1.type file

B.

server.channels.chl.type memory

C.

server.channels.type memory

D.

server.channels.type file

Full Access
Question # 77

Regarding the alarm about insufficient disk capacity of Kafkat, which of the following analysis is incorrect for the possible reasons?

A.

The disk configuration used to store Kafka data (such as the number of disks, size, etC. ) cannot meet the current business data stream declaration, resulting in the disk usage reaching the upper limit

B.

The data storage time is configured too long, and the accumulated data reaches the upper limit of the disk usage.

C.

Unreasonable business planning results in uneven data distribution and makes some disks reach the upper limit of usage

D.

Caused by the failure of the rocker node

Full Access
Question # 78

How many shards does an index library of ElasticSearchl have by default?

A.

5

B.

6

C.

3

D.

4

Full Access
Question # 79

Watermark is a mechanism proposed by Apache Flink to process EventTime window calculation, which is essentially a timestamp.

A.

True

B.

False

Full Access
Question # 80

LdapServe in Huawei's big data platform can support different types of operations such as query, update, and authentication.

A.

True

B.

False

Full Access
Question # 81

The topology will automatically end running after the task is completed.

A.

True

B.

False

Full Access
Question # 82

TaskSlot in Flink is mainly used for resource isolation, including memory resources and CPU resources

A.

True

B.

False

Full Access
Question # 83

The following offin Kafka messagesWhich one of the speed transmission methods is still correct?

A.

Postingonesubscriptioninformationsystem, the sameNumber of barsData can be consumed by multiple consumers. data isConsumptionnot laterdelete immediately

B.

Distributed Messaginghand overThere are two mainwantofmessage passing pattern,peer to peertransferpattern, haircloth-subscription model

C.

point-to-pointinformation systemmedium, cancanhave multiple consumptionsat the same timeremovefeedata, becauseThis does not guarantee the order in which data is processed.

D.

In a point-to-point messaging system, when a messagefeeByremovefeeteamone of the columnsdataAfter that, thedata rulefromdelete from message queue

Full Access
Question # 84

Which of the following contents can be viewed in the Loader historical job record?

A.

job status

B.

Job start/run time

C.

Error lines/number of files

D.

dirtydata link

Full Access
Question # 85

F1ink is a unified computing framework that combines batch processing and stream processing. Its core is a stream data processing engine for data distribution and parallel computing.

A.

True

B.

False

Full Access
Question # 86

As an authentication server center, Kerberos1 can provide unified authentication services to all services in the cluster and secondary development applications of customers.

A.

True

B.

False

Full Access
Question # 87

systemDuring the authentication process, all Kerbero evidence,BagContains the user's password. The user's affiliated information needs to be downloaded from( )Obtain.

Full Access
Question # 88

In Streaming, exactly one message reliability level is achieved through the ACK mechanism.

A.

True

B.

False

Full Access
Question # 89

Spark Streaming has higher real-time performance than Storm.

A.

True

B.

False

Full Access
Question # 90

In the flume architecture, a Source can be connected to multiple channels.

A.

True

B.

False

Full Access
Question # 91

What types of data sources does F1ink stream processing include?

A.

Socket streams

B.

JDBC

C.

Files

D.

Ccollections

Full Access
Question # 92

The Streaming of Fusioninsight HD is developed based on the open source Apache Storm, which is a distributed offline computing framework.

A.

True

B.

False

Full Access
Question # 93

Which of the following can the big data platform be applied to?Supervisionclass scene?

A.

Public Security Network Supervisor

B.

foodTraceability

C.

Yusituation monitoring

D.

satellite remote sensing

Full Access
Question # 94

A telecommunications companyplanopenexhibitionBigdatabusiness,target businesshavecustomer grouping,calendarHistorical bill analysis, real-time call charge analysis and other services. Which of the following options is the most appropriate in terms of functionality and cost to meet business needs?

A.

Deploy Soark separately

B.

Deploy Map Rabace

C.

Deploy Storm

D.

Deploy MapReduce with Stare

Full Access
Question # 95

The UNION ALL operator in Hive is used to combine the result sets of two more SELECT statements. Duplicate values are not allowed in the result set.

A.

True

B.

False

Full Access
Question # 96

Fl?Me is akind of distributed. highreliable and highavailableofClothesservice. useto be effectivecompare?,polymerization?Move a lot of logs? ?.

A.

TRUE

B.

FALSE

Full Access
Question # 97

Fusioninsight is Huawei's unified platform for enterprise-level big data storage, query, and analysis. It can help enterprises quickly build massive data information processing systems, and discover new value points and business opportunities through real-time and non-real-time analysis and mining of massive information data.

A.

True

B.

False

Full Access