Winter Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: myex65

Home > Huawei > HCIP-Big Data > H13-723_V2.0

H13-723_V2.0 HCIP-Big Data Developer V2.0 Question and Answers

Question # 4

existKafkain, aboutConsumerWhich statement is correct? (multiple choice)

A.

ConsumerMainly for data consumption

B.

ConsumerNeed to specify when consuming dataTopic

C.

multipleConsumerconfigure the samegroup.idwhen theseConsumerConsumed data will not be duplicated

D.

newConsumerAPIConsumption data does not need to be specifiedZooKeeperaddress

Full Access
Question # 5

Suppose there is an application that needs to be accessed frequentlyOracleThe user table in the database, in order to improve performance, introduce

Redisto cache user information. For this scene,RedisWhich of the following is the best data structure choice for ?

A.

List

B.

Set

C.

Hash

D.

Sorted set

Full Access
Question # 6

existFusionInsight HDin, useSparkSQL, which of the following methods (or tools) can be used to performSQLstatement?

(multiple choice)

A.

JDB, C

B.

spark-beeline

C.

spark-sql

Full Access
Question # 7

HDFSIt adopts a "write once, read many" file access model. So it is recommended that a file be created, written and

After closing, do not modify it again.

A.

True

B.

False

Full Access
Question # 8

existStreamingin application development,BoltUse which of the following interfaces to sendTuple?

A.

to emit

B.

execute

C.

open

D.

nextTuple

Full Access
Question # 9

HDFSclient withNcopy toHDFSWhen writing a file, if one of the replicas fails to write, all replicas will return to writing

Entry failed.

A.

True

B.

False

Full Access
Question # 10

FusionInsight HDin, aboutHive UFDSecondary development, is the following description correct? (multiple choice)

A.

user definedUDFBefore being used, it needs to beHivecreate this in the systemUDonly

B.

user definedUDF, information such as summary and status is not allowed.

C.

user definedUDF, can be added according to the actual situationdeterministicandstatefulletc. Notes.

D.

Under the secure cluster, user-definedUDF, it is recommended to create it once before each use.

Full Access
Question # 11

.existSparkmiddle,SparkSQLis an independent module that does not depend onSparkCorefinish independentlySQLSentence parsing, optimization

operations such as transformation and execution.

A.

True

B.

False

Full Access
Question # 12

RDDasSparkThe core object, which has the following characteristics? (multiple choice)

A.

read only

B.

partition

C.

fault tolerance

D.

efficient

Full Access
Question # 13

A project needs to save the Internet access data in a certain area, and search the full text of these Internet access records to see if there is any sensitive data.

Sensitive information is used to prevent crimes in this area. In this scenario, which of the following options is the best?

A.

Create an index on solr, save the data, and return all the data during full-text search

B.

Store data on HBase, and satisfy fuzzy matching queries through HBase's filtering feature

C.

When storing data, build an index on solr, and store complete data in HBase: query full text through solr

D.

Retrieve the key information of the record, and obtain the complete record on HBase through the key information

Full Access
Question # 14

set the block storage size to128M,HDFSWhen the client is writing a file, when writing a100M

How much storage space does a file of this size actually occupy?

A.

128M

B.

100M

C.

64M

D.

50M

Full Access
Question # 15

existFusionInsight HDofHBase, which of the following scenarios will not triggerFlushoperate?

A.

When the client initiates a scan request, it willscan cachingset too large

B.

Initiate by client-specified methodFlushdo

C.

RegionServerTotal memory exceeds threshold

D.

RegionofMemStoreset over threshold

Full Access
Question # 16

FusionInsight HD V100R002C60in, aboutHiveofPythonInterface type, which of the following descriptions is incorrect?

A.

RequirePythonversion no lower than2.6.6,not higher than2.7.0

B.

Client must havesetuptools, and the version is not lower than5.0

C.

hive_python_clientscript provides direct executionSQLfunctions, such ashive_python_client'show tables'

D.

PythonSecondary development does not support secure clusters

Full Access
Question # 17

FusionInsight HDofHiveIn the application, there are the following scenarios:? ? ?Storage files have higher? ?efficiency, and most

Minute? ?Only a part of the letter is involved in the file, this scenario is suitable for using a column file (ORC F??)storage.

A.

True

B.

False

Full Access
Question # 18

existFusionInsight HDin, aboutHivepartition (partition) function, which is wrong as described below?

A.

Partition fields are defined when the table is created

B.

There can only be one partition field, and multi-level partitions cannot be created

C.

Using partitions can reduce the data scan range of some queries, thereby improving query efficiency

D.

Partition fields can be used aswherecondition of the sentence

Full Access
Question # 19

existFusionInsight HDmiddle,SparkBy defaultYARNAs a cluster resource management system.

A.

True

B.

False

Full Access
Question # 20

FusionInsight HDmiddle,StreamingWhich of the following scenarios is applicable? (multiple choice)

A.

Streaming data monitoring

B.

Real-time visit statistics of the website

C.

Offline log analysis

D.

Traffic flow analysis

Full Access
Question # 21

writingMapReduceWhich two interfaces are usually required to be implemented by developers?

A.

mapandcombine

B.

reduceandcombine

C.

combineandsort

D.

mapandreduce

Full Access
Question # 22

existKafka, which of the following commands can view aTopicHow many partitions are there?

A.

Kafka-topics.sh–describe

B.

Kafka-topics.sh–list

C.

kafka-topics.sh–alter

D.

kafka-topics.sh–create

Full Access
Question # 23

Which of the following causesHDFSofNameNodeEntersafemode(safe read-only mode)? (multiple choice)

A.

Active and standbyNameNodeThe disk space where the metadata is located is insufficient.

B.

The number of missing blocks exceeds the threshold.

C.

Lost replicas exceed the threshold.

D.

The corrupted replica exceeds the threshold.

Full Access
Question # 24

existSpark, the accumulator can realize high-speed parallel counter and variable summation; inSparkduring application development,

only inDriverGet the value of this counter on .

A.

True

B.

False

Full Access
Question # 25

existFusionInsight HDcluster, aboutkinitOperation command, which of the following statements is wrong? (multiple choice)

A.

Only the HMI account can be used.

B.

Only the machine account can be used.

C.

A client does not support the simultaneous use of multiple accounts.

D.

The ticket obtained by executing this command is intwenty fourIt will time out when it is small and needs to be executed againkinitcommand to log in again.

Full Access
Question # 26

YarnWhich role is to manage individual node resources (CPU/Memory)of?

A.

NodeManager

B.

Resource Manager

C.

Data Node

D.

NameNode

Full Access
Question # 27

when aMapReduceWhen the application is executed, which of the following actions ismapoccurred before the stage?

A.

split

B.

combine

C.

partition

D.

sort

Full Access
Question # 28

useFusionInsight HDofHiveQuery data, when the amount of data in the intermediate results of the query is large, you can choose to compress the intermediate data

Structured data for better performance.

Which of the following areHiveRelevant parameters for intermediate result compression? (multiple choice)

A.

hive.exec.compress.intermediate

B.

hive.intermediate.compression.codec

C.

hive.exec.compress.output

twenty three

D.

hive.intermediate.compression.type

Full Access
Question # 29

In the online log query scheme, the?processing to complete the calculation work. During the whole calculation process, the intermediate calculation results need to be

For temporary storage, which of the following components are suitable for storing intermediate calculation results? (multiple choice)

A.

HDfS

B.

HBase

C.

Kafka

D.

Redis

Full Access
Question # 30

Which of the following measures can improveHBasequery performance? (multiple choice)

A.

pre-scoreregion,MakeregionEvenly distributed, improve concurrency

B.

Query frequent attributes are placed inrowkeyThe front part

C.

Use secondary indexes to adapt to more query scenarios

D.

Data read at the same time are stored in the samecellmiddle

Full Access
Question # 31

existFlumeDuring cascaded transfers, you can usefail overmode transfer, so that if the next hop isFlumenode failure or

When the data is received abnormally, it can automatically switch to another way to continue transmission.

A.

True

B.

False

Full Access
Question # 32

HBaseFilters can set column names or column values as filter conditions, and support the use of multiple filters at the same time.

A.

True

B.

False

Full Access
Question # 33

In useSolrWhen performing a full-text search, you canwtThe parameter specifies the response format of the query result.

aboutSolrThe response format of the query result, which of the following statements is wrong?

A.

supportCSVandJSON

B.

supportCSV,JSONandHTML

C.

supportCSV,JSONandXML

Full Access
Question # 34

which come downRedisIs the command an atomic operation?

A.

INCR(increment the stored digital value by one)

B.

HINCRBY(for the hash tablekeydomain infieldvalue plus incrementincrement)

C.

LPUSH(for linked listspushoperate)

D.

All of the above

Full Access
Question # 35

FusionInsight HDsystematicV100R002C60version,HiveOnly supports based onMapReduceEngine query service,

not supported based onSparkEngine query service.

A.

True

B.

False

Full Access
Question # 36

FlumewriteHDFSWhen the file is generated, what are the ways of generating the file? (multiple choice)

A.

Generate files directly according to the source file name

B.

Set the file generation time interval, close the old file when the time interval is reached, and the new data will be written to the new file

C.

Set the size of a single file, when the file size reaches the set value, the old file will be closed, and the new data will be written to the new file

D.

Set up single file storageeventquantity, when the file is storedeventWhen the number reaches the set value, the old file will be closed, and the new data will be

is written to a new file

Full Access
Question # 37

FlinksupportLocalpattern andClusterpattern deployment(and cloud deployment), other deployment modes are not currently supported.

A.

True

B.

False

Full Access
Question # 38

Fusionlnsigt HD one deployed inTomcatapplication on theHBaseservice, it is recommended to use in this scenario machine account.

A.

True

B.

False

Full Access
Question # 39

FusionInsight HDproduct, aboutKafkaThe component statement is correct?

(multiple choice)

A.

KafkaofProducerWhen sending a message, you can specify which message is sent byConsumerConsumption.

B.

while creatingTopc, the number of replicas must not be greater than the currently survivingBrokerThe number of instances, otherwise createTopicWill will fail.

C.

KafkaAfter the installation is complete, you can no longer configure the data storage directory.

D.

KafkaMetadata information will be stored inZooKeepersuperior.

Full Access
Question # 40

In ThereNnodesFusionInsightHDDeploy in a clusterHBase, it is recommended to deploy()HMaster process, ( )RegionServerprocess?

A.

2, 2

B.

3, N

C.

2, N

D.

N, N

Full Access
Question # 41

FusionInsight HDWhat are the supported ways to submitOozieOperation? (multiple choice)

A.

ShellOrder

B.

Java API

C.

Hue UI

D.

Rest API

Full Access
Question # 42

FusionInsight ManagerRegarding the management operations of services, which of the following statements is wrong?

A.

Can start, stop and restart the service

B.

Services can be added and uninstalled.

C.

Uncommon services can be set to hide or show

D.

Can view the current status of the service

Full Access
Question # 43

FusionInsight HDin, aboutHivethe data load function (viaHiveofLOADcommand guide

input data), which of the following descriptions is wrong?

A.

available fromHDFSdirectly into thehivesurface

B.

available fromHiveServerThe local hard disk of the node is directly loaded into thehivesurface

C.

It can be directly loaded from the local hard disk of the node where the client is located.hivesurface

D.

HiveThe data loading process does not parse the specific file content, mainly the process of file transfer

Full Access
Question # 44

existMapReduceDuring application development,setMapOutputCompressorClassWhat is the role of classes?

A.

set upmapinput compression method

B.

set upmapHow the output is delivered

C.

set upmapoutput compression method

D.

set upreduceoutput compression method

Full Access
Question # 45

existFusionInsight HDproduct,KafkaWhat is the role name of the service?

A.

Producer

B.

Broker

C.

Consumer

D.

ZooKeeper

Full Access
Question # 46

HDFS"Write once, read many"__________File access model. Therefore, it is recommended that a document

A.

After creating,

B.

writing and closing,

C.

do not modify it again.

D.

right or wrong

Full Access
Question # 47

existHBaseWhich of the following interfaces or classes does not need to be involved in the implementation of business logic for writing data?

A.

Put

B.

HTable

C.

HBaseAdmin

D.

PutList

Full Access
Question # 48

existSpark, which of the following statements about broadcast variables is correct? (multiple choice)

A.

broadcast variable in eachTasksave a copy

B.

broadcast variable in eachExecutorsave a copy

C.

Broadcast variables are read-only

D.

broadcast variable viaSparkContext.broadcast) method to generate

Full Access
Question # 49

Which of the following causesHDFSofNameNodeEntersafemode(safe read-only mode)?

(multiple choice)

A.

Active and standbyNameNodeof metadata is running out of disk space.

B.

The number of missing blocks exceeds the threshold.

C.

Lost replicas exceed the threshold.

D.

The corrupted copy exceeds the threshold.

Full Access
Question # 50

aboutKafkaThe characteristics of the following description are correct? (multiple choice)

A.

KafkaIt is a high-throughput, distributed, publish-subscribe-based messaging system

B.

KafkaPersistence of messages

C.

KafkaApplicable to offline and online message consumption scenarios

D.

Kafkaguarantee eachPartitionmessages in order

Full Access
Question # 51

FusionInsght ManagerWhat interfaces are supported when interfacing with external management platforms?

(multiple choice)

A.

SNMP

B.

VPN

C.

BGP

D.

syslog

Full Access
Question # 52

FusionInsight HD HBasein the cluster,Table1belongNamespace1,Table2belong

Namespace2,Table1There are two column families, namelycf11,c12,Table2There is a column family namedcf21,

Which of the following options will allow the user accountAalso havecf11andcf21read and write permissions. (multiple choice)

A.

Assign this user accountglobalread permission

B.

Give this user accountNamespace1read and write permissions

C.

Assign this user accountTable1andTable2read and write permissions

D.

Assign this user accountNamespace1andNamespace2read and write permissions

Full Access
Question # 53

Fusionlngight HDin the system,SolkaExample projectsdk-sxamgelconlf\udlr-eamgple

propertyamiddleZX.URLandZ0OKEEPER DEFAULT SERVER PRINCIPLThe value can be obtained from the following

Which is the quickest pick?

A.

fromSolr Adimin Ulinterface acquisition

B.

existsdr-ecawnpletconftsolr-esamole-properts, has always defaulted to these two parameters

C.

fromFuwnloninuight Manager SoService configuration interface acquisition

Full Access
Question # 54

useHBaseClient batch write10piece of data, aHRegionServercontains the table on the node

of2indivualRegion, respectivelyAandB,10in the data2Article belongs toA,4Article belongs toB, please write this

10pieces of data need to be sent to theHRegionServersend several timesRPCask?

A.

1

B.

2

C.

6

D.

10

Full Access
Question # 55

FusionInsight HDin the systemHDFSdefaultBlock Sizehow many?

A.

32MB

B.

64MB

C.

128MB

D.

256MB

Full Access
Question # 56

FusionInsight HDmiddle,StreamingPackaging tools are used to package business codejarpackages and other dependenciesjarpackage, etc.

a completeStreamingAppliedjarBag

A.

True

B.

False

Full Access
Question # 57

FusionInsight HD assuming a topology, set the roadspoutConcurrency is3,bolt1Concurrency

for2,bolt2Concurrency is3,workerThe number is2,Sobolt1ofexecutorexistworkerhow to divide

Cloth?

A.

twoworkerone of theworkerdistribute two

B.

twoworkerone for each

C.

no fixed rules

D.

twoworkerBoth are distributed

Full Access
Question # 58

passing throughFusionindight ManagerWhen the interface creates a user, you want the user to have access to allTopichave

Read and write permissions, but not createTopicpermissions should be addedkafkaWhich group of services?

A.

KafkaadminGroup

B.

KakfaGroup

C.

kafkasuperuserGroup

Full Access
Question # 59

existFusionInsight HDmiddle,FlumeWhich of the following are supportedsourceTypes of? (multiple choice)

A.

Avro

B.

HBase

C.

Kafka

D.

HTTP

Full Access
Question # 60

existFlumemiddle,sourceWhat is the main function of the function module?

A.

Get data and convert raw data into data objects that you process yourself

B.

Cache data and save data in memory or files according to different reliability policies

C.

Output data to the destination, support multiple output protocols

D.

Split the data and send the data to different destinations according to the characteristics of the data

Full Access
Question # 61

FusionInsight HDofLoader, a connector can only be assigned to one job.

A.

True

B.

False

Full Access
Question # 62

aboutKafkaInsufficient disk capacity alarm, which of the following analysis is incorrect for the possible reasons?

A.

The disk configuration (such as the number of disks, disk size, etc.) used to store Kafka data cannot meet the current industry requirements.

service data traffic, causing the disk usage to reach the upper limit

B.

The data storage time is configured too long, and the accumulated data reaches the upper limit of the disk usage.

C.

Unreasonable business planning results in uneven data distribution and some disks reach the upper limit of usage

D.

Broker node failure caused

Full Access
Question # 63

existSparkmiddle,SparkSQLis an independent module that does not depend onSparkCoretstandtasQaon.eaR. operation, etc.

A.

True

B.

False

Full Access
Question # 64

useFusionInsight HDofHiveQuery data, when the intermediate result of the query has a large amount of data.

You can choose to compress the network result data to obtain better performance. Which of the following areHiveCorrelation of intermediate result compression

parameter? (multiple choice)

A.

hive.exec.compress.intermediate

B.

hive.intermediate.compression.codec

C.

hive.exec.compress.output

D.

hive.intermediate.compression.type

Full Access
Question # 65

Which of the following descriptions about massive small file storage is correct ( )?

(multiple choice)

A.

HDFSStoring a large number of small files will giveNameNodebring great pressure.

B.

HBaseStore a large number of small files,Compactionwill bringIOWaste of resources.

C.

HuaweiHFSIt is suitable for storing a large number of small files, and files can be selectively stored inHDFSorMOBmiddle.

D.

The above statement is not correct.

Full Access
Question # 66

SparkThe calculation logic of the application will be parsed intoDAG, this parsing operation is completed by which of the following function modules

become?

A.

Client

B.

ApplicationMaster

C.

Executor.

D.

Driver

Full Access
Question # 67

FlinkThe two key elements of the program arestreamdata andtransformationoperator.

A.

True

B.

False

Full Access
Question # 68

existHBaseWhich of the following interfaces or classes does not need to be involved in the implementation of business logic for writing data?

A.

Put

B.

HTable

C.

HBaseAdmin

D.

Puttist

Full Access
Question # 69
A.

The port preferentially downloads data from the nearest DataNode.

B.

The port file data is first returned from the DataNode to the NameNode, and then downloaded from the NameNode to the client.

C.

If the client and the connected DataNode fail while reading, the client will abandon the failure

node, to connect to the node where its replica is located.

D.

The port supports multiple clients reading the same file data from the DataNode at the same time.

Full Access
Question # 70

Which of the following is notHBaseThe roles or services involved in the read data process? (Writing isHMaster)

A.

ZooKeeper

B.

HDFS

C.

HMaster

D.

RegionServer

Full Access
Question # 71

forFusionInsight HDplatformHBasecomponent, adding a secondary index needs to define the secondary index Which properties? (multiple choice)

A.

index name

B.

index column

C.

Index column type (and index column length)

D.

the name of the column family to which the indexed column belongs,

Full Access
Question # 72

InstallFusionInsight HDofStreamingcomponents,NimbusHow many nodes does the role require to install?

A.

1

B.

2

C.

3

D.

4

Full Access
Question # 73

existFusionInsight HDclient, executeskinit{account number}command is to getKDCwhich of the item content?

A.

Krb5

B.

conf

C.

TGT

D.

jaas.conf

Full Access
Question # 74

existFusionInsight HDproduct,KafkaWhat is the role name of the service?

A.

Producer

B.

Broker

C.

Consumer

D.

ZooKeeper

Full Access
Question # 75

Rcboft0system, abouteWhich of the provided file formats is not in column format?

A.

ORC(High-performance column store)

B.

Parquetcolumn store

C.

RCFileRow-column storage

D.

TextFile

Full Access
Question # 76

FusionInsight HDin, aboutOoziejob management for suspended jobs

Which of the following operations? (multiple choice)

A.

mouthkill

B.

mouthrun

C.

mouthresume(wake)

D.

mouthsubmit

Full Access
Question # 77

FusionInsight HDin, aboutHivethe data load function (viaHiveofLOADcommand guide

input data), which of the following descriptions is wrong?

A.

available fromHDFSdirectly into thehivesurface

B.

available fromHiveServerThe local hard disk of the node is directly loaded into thehivesurface,

C.

It can be directly loaded from the local hard disk of the node where the client is located.hivesurface

D.

HiveThe data loading process does not parse the specific file content, mainly the process of file transfer

Full Access
Question # 78

HBasedata fileHFileone ofKeyValueWhat information does the format contain?

(multiple choice)

A.

Key

B.

Value

C.

timeStamp

D.

KeyType

Full Access