Spring Sale Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: pass65

H13-723_V2.0 HCIP-Big Data Developer V2.0 Questions and Answers

Questions 4

HiveWhich of the following scenarios does not apply to?

Options:

A.

Real-time online data analysis.

B.

Data mining, such as user behavior analysis, interest division, regional display.

C.

Data summary, such as daily and weekly user clicks, click ranking.

D.

Non-real-time analysis, such as log analysis, statistical analysis.

Buy Now
Questions 5

FusionInsight HDWhich components in the platform support table and column encryption?

(multiple choice)

Options:

A.

HDFS

B.

HBase

C.

Hive

D.

Flink

Buy Now
Questions 6

FusiontnsightHD in which ways can you viewOozieDebug results of the job? (multiple choice)

Options:

A.

mouth viewOozieofJava APIWhether the returned result is as expected

B.

mouth throughHueofworkflowDashboard to view program running results

C.

mouth throughHue ' s file browser to see ifHDFSThe specified directory produces the expected file

D.

mouthOozieself-containedWebThe interface can also view the job execution results

Buy Now
Questions 7

FHumeofproperties.propertiesMultiple configurations can be configured in the configuration filechannelto transmit data.

Options:

A.

True

B.

False

Buy Now
Questions 8

FusionInsight HD assuming a topology that setsspoutConcurrency is3,bolt1Concurrency

for2,bok2Well degree is3.workerThe number is2,Sobolt1ofexecutorexistworkerhow to divide Cloth?

Options:

A.

twoworkerone of theworkerdistribute two

B.

twoworkerone for each

C.

no fixed rules

D.

twoworkerBoth are distributed

Buy Now
Questions 9

existSpark, which of the following is trueDataFrameThe operator that takes the intersection?

Options:

A.

filer(conditionExpr:String):DataFrameFilter data based on filter criteria

B.

Join(right:DataFrame,joinExprs:Column):DataFrameaccording to the specified expressionjoin

C.

intersect(other:DataFrame):DataFrameright2indivualdataframeTake the intersection

D.

dropDuplicates():DataFramedata deduplication

Buy Now
Questions 10

Spark Streamingavailable fromKafkaReceive data and perform calculations, and the calculation results can only be stored inHDFS,

can ' t write backKafka.

Options:

A.

True

B.

False

Buy Now
Questions 11

existSpark, assuminglinesIs anDStreamobject,filterStatements can be filtered out80%the number of

According to the following two sentences, the law is correct:X:lines.filter( ).groupByKey(.)Y:

lines.groupByKey( ).filter( )

Options:

A.

X is more performant than Y

B.

X is less performant than Y

C.

X and Y are the same as performance

D.

Uncertainty about the performance difference between X and Y

Buy Now
Questions 12

Which of the following causesHDFSofNameNodeEntersafemode(install " aroot form)? (multiple choice)

Options:

A.

Active and standbyNameNodeThe disk space where the metadata is located is insufficient.

B.

The number of missing blocks exceeds the read value.

C.

Lost copies exceed read value.

D.

The corrupted copy exceeds the threshold.

Buy Now
Questions 13

FusionInsigt HD one deployed inTomcatapplication on theHBaseservice, this

It is recommended to use a machine account in this scenario.

Options:

A.

True

B.

False

Buy Now
Questions 14

which is notflumeofchannelTypes of?

Options:

A.

Memory Channel

B.

File Channel

C.

JDB, C Channel

D.

HDFS Channel

Buy Now
Questions 15

HBasedata fileHFileone ofKeyValueWhat information does the format contain?

(multiple choice)

Options:

A.

Key

B.

Value

C.

timeStamp

D.

KeyType

Buy Now
Questions 16

FusionInsight HDin, aboutHive UDFsSecondary development, is the following description correct? (multiple choice)

Options:

A.

user definedUDFBefore being used, it needs to beHivecreate this in the systemUDF.

B.

user definedUDF, information such as summary and status is not allowed.

C.

user definedUDF, can be added according to the actual situationdeterministicandstatefulletc. Annotation

D.

Under the secure cluster, user-definedUDF, it is recommended to create it once before each use.

Buy Now
Questions 17

LoaderIf the job execution fails, the data imported during the running of this job will not be deleted automatically.

must be deleted manually.

Options:

A.

True

B.

False

Buy Now
Questions 18

Fusioninsight HDin useStreamingofA, CKWhich of the following statements is true

of? (multiple choice)

Options:

A.

enabledAckerLater,StreamingIt will identify the failure to sendTupleAnd automatically resend, no need for human intervention pre.

B.

AckerThe message that will identify the processing timeout or processing failure isfail.

C.

fromspoutstarted, formedTupleFailure of any link in the tree will mark the entire tree as failed.

D.

The application needs to be inspoutoffail() interface method to implement message resend logic.

Buy Now
Questions 19

Which of the following measures can improveHBasequery performance? (multiple choice)

Options:

A.

pre-scoreregion,MakeregionEvenly distributed, improve concurrency

B.

Query frequent attributes are placed inrowkeyThe front part

C.

Use secondary indexes to adapt to more query scenarios

D.

Data read at the same time are stored in the samecellmiddle

Buy Now
Questions 20

Suppose there is an application that needs to be accessed frequentlyOracleThe user table in the database, in order to improve performance, introduceRedisto cache

account information.

For this scene,RedisWhich of the following is the best data structure choice for ?

Options:

A.

list

B.

set

C.

hash

D.

sorted set

Buy Now
Questions 21

existKafkamiddle,ProducerThis can be done by configuring the synchronization parameters (producer.type) to ensure that data is sent in order.

Options:

A.

True

B.

False

Buy Now
Questions 22

when carried outsolrone ofcollectionWhen designing, it is necessary to design itsschema, by configuringschema.xml

file implementation pairschemadesign, below aboutschemaWhich statement is wrong?

Options:

A.

SchemaSimilar to the table structure in a relational database, the structure used to define index data mainly includes three

part:uniquekye,FieldandFieldType

B.

For fields that do not need to store the original value, their propertiesstoredcan be set tofalse

C.

for the samedocumentHow many values of a field of the

multiValuedmust be set totrue

D.

copyFieldThe following definitions can be made: < field name= " cat " type= " string " indexed= " true "

stored= " true " docValues= " true " multiValued= " false " / >

< field name= " dog " type= " string " indexed= " true " stored= " true "

docValues= " true " multiValued= " false " / >

< field name= " text " type= " text_general " indexed= " true " stored= " false "

multiValued= " false " / >

< copyFie

Buy Now
Questions 23

existFusionInsight HDin, useSparkSQL, which of the following methods (or tools) can be used to performSQLstatement?

(multiple choice)

Options:

A.

JDB, C

B.

spark-beeline

C.

spark-sql

Buy Now
Questions 24

FusionInsight HD assuming a topology, set the roadspoutConcurrency is3,bolt1Concurrency

for2,bolt2Concurrency is3,workerThe number is2,Sobolt1ofexecutorexistworkerhow to divide

Cloth?

Options:

A.

twoworkerone of theworkerdistribute two

B.

twoworkerone for each

C.

no fixed rules

D.

twoworkerBoth are distributed

Buy Now
Questions 25

existFusionInsight HDin the cluster,FlumeWhich service does not support writing collected data to the cluster?

Options:

A.

HDFS

B.

HBase

C.

Kafka

D.

Redis

Buy Now
Questions 26

Which of the following options isYarncore functionality?

Options:

A.

storage

B.

Resource management

C.

search

D.

data transmission

Buy Now
Questions 27

Fusionlnsiht HDmiddle,Oozieclient ' sJava APIwill be called when the task is runOozieClientWhich method of the class?

Options:

A.

suspend

B.

getJoblnfo

C.

submit

D.

run

Buy Now
Questions 28

existFusionInsight HDmiddle,FlumeIn a configuration file, if there are multiplesource,butsourceThe name cannot be repeated.

Options:

A.

True

B.

False

Buy Now
Questions 29

HiveWhich of the following table types are supported? (multiple choice)

Options:

A.

Partition Table

B.

bucket table

C.

Tilt table

D.

partition+bucket table

Buy Now
Questions 30

FusionInsight HDin, useStreamingThe command? way to submitexample.jarmiddleom huawei example

WrodCounttask, task name iswcTeat, is the following execution command correct?

Options:

A.

storm jar example jar wcTest om huawei example WordCount

B.

storm jar example jar wcTest WrodCount

C.

storm jar example jar om huawei example WordCount wcTest

D.

storm jar example jar WorkCount wcTest

Buy Now
Questions 31

for running onMapReduceThe application on the platform that this application depends onjarWhere will the bag be placed?

Options:

A.

HIVE

B.

HBASE

C.

HDFS

D.

DB

Buy Now
Questions 32

writingMapReduceWhich two interfaces are usually required to be implemented by developers?

Options:

A.

mapandcombine

B.

reduceandcombine

C.

combineandsort

D.

mapandreduce

Buy Now
Questions 33

FusionInsight HDsystem, aboutHiveofJDB, CInterface type, which of the following descriptions is correct?

Options:

A.

In a network isolation environment,ZooKeeperofIPCan be configured as a businessIP, can also be configured to manageIP.

B.

HiveUnlike traditional databases, soJDB, CThe established connection does not need to be closed.

C.

In a secure cluster environment, establish andHiveBefore connecting, you need to use unified authentication to log in.

D.

useJDB, CsubmitSQL, you canSQLA semicolon " ; " is added after the statement.

Buy Now
Questions 34

FusionInsight HDin, belonging toStreamingWhat are the roles of the service? (multiple choice)

Options:

A.

Nimbus

B.

Supervisor

C.

Broker

D.

quorumpeer

Buy Now
Questions 35

HDFSclient withNWhen a copy writes a file, which of the following is true about the writing process? (multiple choice)

Options:

A.

eachDataNodemaximum storage1copies.

B.

Support multiple users to write to the same file at the same time.

C.

The first copy of the data block is placed preferentially on the node where the client writing the data block is located.

D.

Copied file blocks all exist on the same rack by default.

Buy Now
Questions 36

aboutFusionInsight HDofSpark, which of the following programming languages can be used to developSparkapplication? (multiple choice)

Options:

A.

C

B.

Scala

C.

Java

D.

Python

Buy Now
Questions 37

A project needs to save the Internet access data in a certain area, and search the full text of these Internet access records to see if there is any sensitive data.

Sensitive information is used to prevent crimes in this area. In this scenario, which of the following options is the best?

Options:

A.

Create an index on solr, save the data, and return all the data during full-text search

B.

Store data on HBase, and satisfy fuzzy matching queries through HBase ' s filtering feature

C.

When storing data, build an index on solr, and store complete data in HBase: query full text through solr

D.

Retrieve the key information of the record, and obtain the complete record on HBase through the key information

Buy Now
Questions 38

MapReduceIn which of the following is the task ultimately performed?

Options:

A.

NodeManager

B.

container

C.

ResourceManager

D.

AppMaster

Buy Now
Questions 39

rightHBase, which of the following properties can be set? (multiple choice)

Options:

A.

Versions

B.

TTL

C.

Coprocessor

D.

Compression

Buy Now
Questions 40

existMapReduceDuring application development,setMapOutputCompressorClassWhat is the role of classes?

Options:

A.

AssumeSmapinput compression method

B.

set upmapHow the output is delivered

C.

set upmapoutput compression method

D.

set upreduceoutput compression method

Buy Now
Questions 41

FusionInsight HDmiddle,OoziesupportHDFS Action.

Options:

A.

True

B.

False

Buy Now
Questions 42

When the cluster is normal,RedisClient initiates oncegetCall, the client has () times of message interaction with the server?

Options:

A.

1

B.

2

C.

3

D.

4

Buy Now
Questions 43

An application requires simultaneous and twoFusionInsightsCluster interaction: both need to access the cluster1ofHBaseservice, need to visit

Ask the cluster2ofHiveServe;

So which of the following operations are required? (multiple choice)

Options:

A.

in the cluster1Create the account required for this application on theHBaseServe

B.

in the cluster2created onAThe account with the same name in the options, and set this account to be able to access the cluster ' sHiveServe

C.

Aoptions andBThe account created by the option must belong tosupergroupGroup

D.

cluster1and cluster2Complete the mutual trust operation

Buy Now
Questions 44

SparkandHadoopNeither is suitable for iterative calculation scenarios.

Options:

A.

True

B.

False

Buy Now
Questions 45

existKafka, as follows aboutProducerWhat is wrong with the statement of sending data? (multiple choice)

Options:

A.

ProducerResponsible for production data,Consumerresponsible for consuming data,ProducerandConsumerneed between EstablishSocketconnect

B.

Producercan either send data toBroker, and can send data toConsumersuperiorProducer

C.

As a message producer, you can directly write data toZooKeeper

mouthProducerconnect any normalBrokerInstances can produce data

Buy Now
Questions 46

FusionInsight HDmiddle,StreamingWhich of the following scenarios is applicable? (multiple choice)

Options:

A.

Streaming data monitoring

B.

Real-time visit statistics of the website

C.

Offline log analysis

D.

Traffic flow analysis

Buy Now
Questions 47

HDFSIn application development of , which of the following areHDFSInterfaces supported by the service? (multiple choice)

Options:

A.

BufferedOutputStream.write

B.

BufferedOutputStream.flush

C.

FileSystem.create

D.

FileSystem.append

Buy Now
Questions 48

existFusionInsight HDofHBase, which of the following scenarios will not triggerFlushoperate?

Options:

A.

When the client initiates a scan request, it willscan cachingset too large

B.

Initiate by client-specified methodFlushdo

C.

RegionServerTotal memory exceeds threshold

D.

RegionofMemStoreset over threshold

Buy Now
Questions 49

aboutStreamingthe topology (Topology), which of the following descriptions is wrong?

Options:

A.

OneTopologyis made up of a group ofSpoutcomponents andBoltcomponents passStream GroupingsA connected directed acyclic graph

(DAG)

B.

Topologywill run until it is explicitlykill

C.

Business logic is encapsulatedTopologymiddle

D.

OneTopologyOnly one can be specified to startWorkerprocess

Buy Now
Questions 50

What is wrong?

Options:

A.

available fromHDFSdirectly into thehivesurface

B.

available fromHiveServerThe local hard disk of the node is directly loaded into thehivesurface

C.

It can be directly loaded from the local hard disk of the node where the client is located.hivesurface

D.

HiveThe data loading process will not parse the specific file content, mainly the process of file transfer

Buy Now
Questions 51

Flumesupport real-time data collectionsourceWhich is the type?

Options:

A.

taildir

B.

Log4j

C.

JMS

D.

Thrift

Buy Now
Questions 52

aboutRedisFeatures, the following description is correct?

Options:

A.

low latency

B.

Rich data structures

C.

Support data persistence

D.

supportSQLquery interface

Buy Now
Questions 53

forHBase rowkeyThe design principles described below are correct? (multiple choice)

Options:

A.

Attribute values with high access weight are placed inRowKeyFront

B.

Attribute values with high access weight are placed inRowkeysecond half

C.

Attribute values with good dispersion are placed inRowkeyfirst half

D.

Attribute values with good dispersion are placed inRowkeysecond half

Buy Now
Questions 54

aboutRedisCluster topology information, is the following description correct?

Options:

A.

The client caches the topology information of the cluster

B.

The server caches the topology information of the cluster

C.

both are

D.

more than two

Buy Now
Questions 55

FusionInsight HDin, yesSolrThe creation of various resources and the use of read and write permissions, which of the following statements is wrong?

Options:

A.

SolrUser group users can create configuration sets and createCollectionwhen specifying the configuration set

B.

SolrUsers of the user group can createCollection

C.

Collectioncreator, theCollectionroles with read and write permissions andSolr adminrole can beCollectionEnter row update

D.

onlyCollectioncreator andSolr adminRoles can be deletedCollection

Buy Now
Questions 56

existFusionInsight HDmiddle,FlumeWhich of the following are supportedsourceTypes of? (multiple choice)

Options:

A.

Avro

B.

HBase

C.

Kafka

D.

HTTP

Buy Now
Questions 57

existFusionInsight HDproduct,KafkaWhat is the role name of the service?

Options:

A.

Producer

B.

Broker

C.

Consumer

D.

ZooKeeper

Buy Now
Questions 58

HDFSRuntime,NameNodewill load all the metadata of the file system from disk into memory, so the file system can

The total number of files stored is limited byNameNodememory capacity.

Options:

A.

True

B.

False

Buy Now
Questions 59

existFusionInsight HDproductSolrDuring application development, you canSolr Admin UIrightCollectionDo some verification.

Below aboutSolr Admin UIIs the statement correct? (multiple choice)

Options:

A.

clickClouddownTree,CheckSolr CloudmiddleCollections, a configuration set, andlive_nodesand other metadata information

B.

clickCloud, see eachCollectionunder eachShardReplicadistribution and status

C.

enter aCollectionofShard ReplicaofCore Overviewinterface, you can view theReplicaIndex the actual document

Number of bars, storage size and location information

D.

Solruser group,Solr adminrole andSuper groupof users have access toSolr Admin UI

Buy Now
Questions 60

existKafka, which of the following commands can view aTopicHow many partitions are there?

Options:

A.

Kafka-topics.sh–describe

B.

Kafka-topics.sh–list

C.

kafka-topics.sh–alter

D.

kafka-topics.sh–create

Buy Now
Questions 61

FlumewriteHDFSWhen the file is generated, what are the ways of generating the file? (multiple choice)

Options:

A.

Generate files directly according to the source file name

B.

Set the file generation time interval, close the old file when the time interval is reached, and the new data will be written to the new file

C.

Set the size of a single file, when the file size reaches the set value, the old file will be closed, and the new data will be written to the new file

D.

Set up single file storageeventquantity, when the file is storedeventWhen the number reaches the set value, the old file will be closed, and the new data will be

is written to a new file

Buy Now
Questions 62

FusionInsight Managerinterface, when receivedKafkaInsufficient disk capacity alarm, and the alarm ' s

When the cause has been ruled out for the hard disk hardware failure, the system administrator needs to consider expanding the capacity to solve this problem.

Options:

A.

True

B.

False

Buy Now
Questions 63

MapReduceIn which of the following is the task ultimately performed?

Options:

A.

NodeManager

B.

container

C.

ResourceManager

D.

AppMaster

Buy Now
Questions 64

solris a high-performance, basedLucenefull-text search service.SolrxrightLuceneto expand,

Loss of fruit supportSolrCloudmodel.

Options:

A.

True

B.

False

Buy Now
Questions 65

which come downRedisIs the command an atomic operation?

Options:

A.

INCR(aThe stored digital value is incremented by one)(DECR)

B.

HOY(eesurfacekeydomain infieldvalue plus incrementincrement)

C.

UURH(MRnpuhoperate)

D.

All of the above

Buy Now
Questions 66

existFlumeDuring cascaded transfers, you can usefail overmode transfer, so that if the next hop isFlumenode failure or

When the data is received abnormally, it can automatically switch to another way to continue transmission.

Options:

A.

True

B.

False

Buy Now
Questions 67

existFusionInsight HD where can I viewMapReduceThe result of running the application?

Options:

A.

FusionInsight ManagerofWebinterface

B.

Loader Webinterface

C.

Solr Webinterface

D.

Yarn Webinterface

Buy Now
Questions 68

In useSolrWhen performing a full-text search, you canwtThe parameter specifies the response format of the query result. close

AtSolrThe response format of the query result, which of the following statements is wrong?

Options:

A.

supportCSVandJSON

B.

supportCSV,JSONandHTML

C.

supportCSV,JSONandXML

Buy Now
Questions 69

existSpark, assuminglinesIs anDStreamobject,filterStatements can be filtered out80%data for the following two

The correct statement is:

X: lines.filter(…).groupByKey(…)

Y: lines.groupByKey(…).filter(…)

Options:

A.

XCompareYhigher performance

B.

XCompareYlower performance

C.

XandYsame as performance

D.

UncertaintyXandYdifference in performance

Buy Now
Questions 70

FusionInsight HDin, yesSolrThe creation of various resources and the use of read and write permissions, which of the following statements is wrong?

Options:

A.

SolrUser group users can create configuration sets and createCollectionwhen specifying the configuration set

B.

SolrUsers of the user group can createCollection

C.

Collectioncreator, theCollectionroles with read and write permissions andSolr adminrole can beCollectionEnter

row update

D.

onlyCollectioncreator andSolr adminRoles can be deletedCollection

Buy Now
Questions 71

FusionInsight HDin, aboutHive UFDSecondary development, is the following description correct? (multiple choice)

Options:

A.

user definedUDFBefore being used, it needs to beHivecreate this in the systemUDonly

B.

user definedUDF, information such as summary and status is not allowed.

C.

user definedUDF, can be added according to the actual situationdeterministicandstatefulletc. Notes.

D.

Under the secure cluster, user-definedUDF, it is recommended to create it once before each use.

Buy Now
Questions 72

existFusionInsight HDproduct,KafkaWhat is the role name of the service?

Options:

A.

Producer

B.

Broker

C.

Consumer

D.

ZooKeeper

Buy Now
Questions 73

FusionInsight HDin, useStreamingThe command?way to submitexample.jarmiddleom huawei example.

WrodCounttask, task name iswcTeat, is the following execution command correct?

Options:

A.

storm jar example jar wcTest om huawei example WordCount

B.

storm jar example jar wcTest WrodCount

C.

storm jar example jar om huawei example WordCount wcTest

D.

storm jar example jar WorkCount wcTest

Buy Now
Questions 74

existKafkamiddle,ProducerThis can be done by configuring the synchronization parameters (producer.type) to ensure that data is sent in order.

Options:

A.

True

B.

False

Buy Now
Questions 75

existFusionInsight HDWhen developing applications with a secure version, you can usekeytabDocuments are authenticated securely.

Options:

A.

True

B.

False

Buy Now
Questions 76

forSpark Streamingapplication, in aJVM, there can only be one at a timeStreamingContextactive condition.

Options:

A.

True

B.

False

Buy Now
Questions 77

FusionInsight HD which components are provided externallySQLor classSQLability? (multiple choice)

Options:

A.

HBase

B.

Hive

C.

Spark

D.

Redis

Buy Now
Questions 78

aboutKafkaInsufficient disk capacity alarm, which of the following analysis is incorrect for the possible reasons?

Options:

A.

The disk configuration (such as the number of disks, disk size, etc.) used to store Kafka data cannot meet the current industry requirements.

service data traffic, causing the disk usage to reach the upper limit

B.

The data storage time is configured too long, and the accumulated data reaches the upper limit of the disk usage.

C.

Unreasonable business planning results in uneven data distribution and some disks reach the upper limit of usage

D.

Broker node failure caused

Buy Now
Questions 79

FlinkThe two key elements of the program arestreamdata andtransformationoperator.

Options:

A.

True

B.

False

Buy Now
Questions 80

which come downRedisIs the command an atomic operation?

Options:

A.

INCR(increment the stored digital value by one)

B.

HINCRBY(for the hash tablekeydomain infieldvalue plus incrementincrement)

C.

LPUSH(for linked listspushoperate)

D.

All of the above

Buy Now
Questions 81

FusionInsightHDin, aboutHivepartition (partation) function, which is wrong as described below?

Options:

A.

Partition fields are defined when the table is created

B.

There can only be one partition field, and multi-level partitions cannot be created

C.

Using partitions can reduce the data scan range of some queries, thereby improving query efficiency

D.

Partition fields can be used aswherecondition of the sentence

Buy Now
Questions 82

There are the following business scenarios: User online log files have been stored inHDFSabove, the log file content format the formula is: each online record has three fields, namely name, gender, and online time, and the fields are separated by " , " ;

It is required to print out all female netizens who spend more than two hours online. Which of the following code snippets can achieve

The above business scenario? (multiple choice)

Options:

A.

sc.textFile( " /data/file/path”).map(.split( " , " )).map(p= > FemaleInfo(p

(O),p(1),p(2).trim.toInt)).toDF.registerTemp Table( " FemaleinfoTable”)sqlContext.sql

( " select name,sum(stay Time)as stay Time from FemaleInfoTable where gender=

female ' group by name”).filter( " stay Time > =120”).collect).foreach(printin)

B.

sc.textFile( " /data/file/path " ).map(.split( " , " )).map(p= > FemaleInfo

(p(O),p(1),p(2).trim.toInt)).toDF.registerTemp Table( " Female infoTable " )

sqlContext.sq(l " select name, sum(stay Time)as stay Time from FemaleInfoTable where

gender=female).filter(“stay Time > =120”).collect).foreach(printin)

C.

val text=sc.textFile( " /data/fle/path " )

val data=text.filter(_.contains( " female " ))

Val fermaleData:RDD(String,Int) =data.map{line= > val t=line.split( ' , ' )

(t(0),t(2).toInt)

}.reduceByKey(_+_)

Val result=femaleData.filter(line= > line.2 > 120)

Rusult.collect.map(x= > x._1 + ' , ' + x._2).foreach(println)

D.

val text=sc.textFile( " /data/fle/path”)

val data=text.filter(_.contains( " female”))

Val fermaleData:RDD(String,Int) =data.map(line= > val t=line.split( ' , ' )

(t(0),t(2).toInt)

}

Val result=femaleData.filter(line= > line.2 > 120)

Rusult.collect.map(x= > x._1 + ' , ' + x._2).foreach(println)

Buy Now
Questions 83

aboutFlumeWhich of the following descriptions is correct?

Options:

A.

(1) data fromsourcearrivechannel;(2)sinkfromchannelfetch data

B.

(1) data fromchannelarrivesource;(2)sinkfromsourcefetch data.

C.

(1) data fromsinkarrivechannel;(2)sourcefromchannelfetch datao

D.

(1) data fromchannelarrivesource;(2)sinkfromsourcefetch data

Buy Now
Questions 84

aboutKafkaofProducer, is the following statement correct? (multiple choice)

Options:

A.

Produceris a message producer

B.

ProducerProduction data needs to be specifiedTopic

C.

Can play multiple at the same timeProducerprocess to the sameTopicsend data

D.

ProducerNeed to connect first when producing dataZooKeeper, and then connectBroker

Buy Now
Questions 85

Fasionlrnight HD Heenein the group,Table1GenusFNamespacel,Table2GenusFNamespace2,

Table1there are two1columns, respectivelyRdf11.df12,TablezThere is a column home namedf21, then which of the following

user account9AAt the same time have|ef18d21oftwrite rights. (multiple choice)

Options:

A.

Give this user accountglobelread permission

B.

Give this user accountNamespace1read and write permissions

C.

Give this user accountTable1andTable2read and write permissions

D.

Give wave user accountNamespace1andNamespace2read and write permissions,

Buy Now
Questions 86

Options:

A.

Spark

B.

Streaming

C.

MapReduce

D.

HDFS

Buy Now
Questions 87

existHBaseWhile the application is running, the application can write data while creating the table.

Options:

A.

True

B.

False

Buy Now
Questions 88

pass throughHBaseofcreateTableThe method creates a table, what parameters must be passed in?

Options:

A.

Table Name

B.

table names and columns

C.

Table names and column families

D.

can be empty

Buy Now
Questions 89

existFusionInsight HDclient, executeskinit{account}command is to getKDCwhich content?

Options:

A.

Krb5.conf

B.

TGT

C.

ST

D.

jaas.conf

Buy Now
Questions 90

existFusionInsight HDofHBase, which of the following scenarios will not triggerFlushoperate?

Options:

A.

When the client initiates a scan request, it willscan cachingset too large

B.

Initiate by client-specified methodFlushdo

C.

RegionServerTotal memory exceeds threshold

D.

RegionofMemStoreset over threshold

Buy Now
Questions 91

FusionInsight HDWhich of the following belong toOozieofMapReduce Actionconfiguration item? (multiple choice)

Options:

A.

name-node

B.

source

C.

mapred.mapper.class

D.

job-tracker

Buy Now
Questions 92

existSpark, assuminglinesIs anDStreamObject, which of the following statements can periodically count the number of words on this stream?

Options:

A.

lines.flatMap(_.split??)).map(word= > (word,1)).reduce(_+_).print ( )

B.

lines.flatMap(_.split??)).map(word= > (word,word.length())).reduceByKey(_+_).print ( )

C.

lines.flatMap(_.split??)).map(word= > (word,1)).reduceByKey(_+_).print ( )

D.

lines.flatMap(_.split??)).flatmap(word= > (word,1)).groupByKey(_+_).print ( )

Buy Now
Questions 93

Which of the following is notMapReducespecialty?

Options:

A.

easy to program

B.

good scalability

C.

real-time computing

D.

High fault tolerance

Buy Now
Questions 94

aboutFusionInsight HDplatformHiveservice, itsWebHCatDevelopment interface, which of the following descriptions is incorrect?

Options:

A.

Support based onRESTquery request

B.

WebHCatThe return data format isSML

C.

WebHCatbased onHTTPandHTTPSAgreement to provide services to the outside world

D.

able to passWebHCatCreate tables, query, etc.

Buy Now
Questions 95

existFusionInsight HDin the cluster,FlumeWhich service does not support writing collected data to the cluster?

Options:

A.

HDFS

B.

HBase

C.

Kafka

D.

Redis

Buy Now
Questions 96

existSpark, which of the following is trueDataFrameThe operator that takes the intersection?

Options:

A.

filter (condifionExpr: String): DataFrame

B.

join (right: DataFrame, join Exprs: Column): DataFrame

C.

intersect (other: Data Frame): Data Frame

D.

dropDuplicates (): DataFrame

Buy Now
Questions 97

RedisofLISTData structure, suitable for which of the following scenarios? (multiple choice)

Options:

A.

Build queuing systems, such as message queues

B.

uniqOperations, such as getting the ranking value of all data in a certain period of time

C.

get latestNdata operations: for example, for a certain microblog, get the latest10comments

D.

Simulate stack operations

Buy Now
Questions 98

FusionInsight HDofHiveIn the application, there are the following scenarios:? ? ?Storage files have higher? ?efficiency, and most

Minute? ?Only a part of the letter is involved in the file, this scenario is suitable for using a column file (ORC F??)storage.

Options:

A.

True

B.

False

Buy Now
Questions 99

existStreamingin application development,BoltUse which of the following interfaces to sendTuple?

Options:

A.

to emit

B.

execute

C.

open

D.

nextTuple

Buy Now
Questions 100

In the online log query scheme, the?processing to complete the calculation work. During the whole calculation process, the intermediate calculation results need to be

For temporary storage, which of the following components are suitable for storing intermediate calculation results? (multiple choice)

Options:

A.

HDfS

B.

HBase

C.

Kafka

D.

Redis

Buy Now
Questions 101

forFusionInsight HDplatformHBaseComponent, which properties of the secondary index need to be defined to add a secondary index? (multiple choice)

Options:

A.

index name

B.

index column

C.

index column type

D.

The name of the column family to which the indexed column belongs

Buy Now
Questions 102

writingMapReduceWhich two interfaces are usually required to be implemented by developers?

Options:

A.

mapandcombine

B.

reduceandcombine

C.

combineandsort

D.

mapandreduce

Buy Now
Questions 103

rightHBase, which of the following properties can be set? (multiple choice)

Options:

A.

Versions

B.

TTL

C.

Coprocessor

D.

Compression

Buy Now
Questions 104

pass throughHBaseofcreateTableThe method creates a table, what parameters must be passed in?

Options:

A.

Table Name

B.

table names and columns

C.

Table names and column families

D.

can be empty

Buy Now
Questions 105

forN(N > 1) copies of stored documents,HDFSThe client initiates a read file request. If the read replica node fails, the

If the connection fails, it will not go to other replica nodes for reading.

Options:

A.

True

B.

False

Buy Now
Questions 106

An application requires simultaneous and twoFusionInsightsCluster interaction: both need to access the cluster1ofHBaseservice, need to visit

Ask the cluster2ofHiveServe;

So which of the following operations are required? (multiple choice)

Options:

A.

in the cluster1Create the account required for this application on theHBaseServe

B.

in the cluster2created onAThe account with the same name in the options, and set this account to be able to access the cluster ' sHiveServe

C.

Aoptions andBThe account created by the option must belong tosupergroupGroup

D.

cluster1and cluster2Complete the mutual trust operation

Buy Now
Questions 107

A project requires Internet access to a certain area? ?Save it, and search the full text of these Internet records to see if there is any? ?information, with

to prevent crime in the region.

In this scenario, which of the following options is the best?

Options:

A.

existSolrCreate an index and save the data, and return all the data during full-text search.

B.

existHBasestore data on theHBaseThe filtering characteristics of , satisfy fuzzy matching query.

C.

When storing data, inSolrindex on theHBaseStore complete data? ?when, throughSolrFull-text search to obtain and record

record key information, through the key information inHBasefor the full record.

Buy Now
Questions 108

aboutFlumeThe characteristics of the collected data, which of the following descriptions are correct?

Options:

A.

Supports distributed and streaming acquisition

B.

with the help ofMapReducetask parallel collection

C.

Only structured data can be collected

D.

Trigger data collection with external timing tasks

Buy Now
Questions 109

HDFSThe system time of the node where the client is located is the same as theFusionInsight HDThe system time of the cluster should be consistent. If there is a time difference, So the time difference should be less than a few minutes?

Options:

A.

5

B.

10

C.

20

D.

30

Buy Now
Questions 110

existMapReduceIn the development framework,InputFormatWhat is the function of the class?

Options:

A.

mergemapThe result of the stage has the samekeyvalue key-value pair

B.

put allmapThe results of the stages are merged to get the final result

C.

rightmapThe results of the stages are sorted and merged

D.

Split the input data into individualsplit, and willsplitfurther split into < key,value > yes, then asmapletter input of numbers

Buy Now
Questions 111

There are the following scenarios: new data is generated by the online system every day500G, you need to make statistics on these data by day, week, month and other dimensions summary.

ask if it is suitable for useHiveWhat kind of table to handle?

Options:

A.

Bucket table, bucket according to day

B.

Partition table, partitioned by day

C.

skew table, skew field select date, skew value random

D.

partition+Bucket table, partitioned by time, each partition2560buckets

Buy Now
Questions 112

HDFSThere is a file in the cluster and directorytext.txt, which of the following commands can find theDatNodenode

information?

Options:

A.

hdfs fsck /test.txt–files

B.

hdfs fsck /text.txt–locations

C.

hdfs fsck /test.txt–blocks

D.

hdfs fsck /test.txt–list–corruptfileblocks

Buy Now
Questions 113

HiveWhich of the following table types are supported? (multiple choice)

Options:

A.

Partition Table

B.

bucket table

C.

Tilt table

D.

partition+bucket table

Buy Now
Questions 114

FusionInsight HDofHive, user-definedUDFcan andHiveBuilt-inUDFduplicate name, in this case,

will use user-definedUDF.

Options:

A.

True

B.

False

Buy Now
Questions 115

existFusionInsight HDclient, executeskinit{account}command is to getKDCwhich content?

Options:

A.

Krb5.conf

B.

TGT

C.

ST

D.

jaas.conf

Buy Now
Questions 116

HBasetablerowkeyDesign is a very important development and design link. Suppose there is the following scenario,

The most frequent query scenario is to query the historical call records of each month and half a year based on the mobile phone number. Which of the followingrowkey

Design is optimal?

Options:

A.

name ten mobile number

B.

date ten phone number

C.

Phone number+date

D.

Phone number+Name

Buy Now
Questions 117

existSparkIn application development, which of the following codes can correctly count words?

Options:

A.

rval counts = textFile.map(line = > line.split( " " )).map(word = > (word,

1)).reduceByKey(+)

B.

val counts = textFile.flatMap(line = > line.split( " )).map(word = > (word,

1)).reduceByKey(+)

C.

vrval counts = textFie.map(line = > line.split( " )).map(word = > (word,

1)).groupByKey( )

D.

vrval counts = textFile.flatMap(line = > line.split( " " )).map(word = > (word,

1)).groupByKey( )

Buy Now
Questions 118

FusionInsight HDsystem, aboutHiveofJDB, CInterface type, which of the following descriptions is correct of?

Options:

A.

In a network isolation environment,ZooKeeperofIPCan be configured as a businessIP, can also be configured to manageIP.

B.

hiveUnlike traditional databases, soJDB, CThe established connection does not need to be closed.

C.

In a secure cluster environment, establish andHiveBefore connecting, you need to use unified authentication to log in.

D.

useJDB, CsubmitSQL, you canSQLPut a semicolon after the statement " ; "

Buy Now
Questions 119

implementHBaseWhat parts of the data need to be read for the data read service?

(multiple choice)

Options:

A.

HFile

B.

HLog

C.

MemStore

D.

HMaster

Buy Now
Questions 120

Options:

A.

The port preferentially downloads data from the nearest DataNode.

B.

The port file data is first returned from the DataNode to the NameNode, and then downloaded from the NameNode to the client.

C.

If the client and the connected DataNode fail while reading, the client will abandon the failure

node, to connect to the node where its replica is located.

D.

The port supports multiple clients reading the same file data from the DataNode at the same time.

Buy Now
Questions 121

Spark SQLIn the table, there are often many small files (the size is much smaller thanHDFSblock size), in this case,Sparkwill enable aTaskto process these small files, whenSQLexist in operationShufleWhen operating, will greatly increasehashThe number of dynamic buckets will seriously affect the performance.

Options:

A.

True

B.

False

Buy Now
Questions 122

Which of the following is notMapReducespecialty?

Options:

A.

Easy to program

B.

Good scalability

C.

Real-time computing

D.

High fault tolerance

Buy Now
Questions 123

aboutFusionInsight HDofSpark, which of the following programming languages can be used to developSparkapplication? (multiple choice)

Options:

A.

C

B.

Scala

C.

Java

D.

Python

Buy Now
Questions 124

for running onMapReduceThe application on the platform that this application depends onjarpackage will be put where?

Options:

A.

HIVE

B.

HBASE

C.

HDFS

D.

DB

Buy Now
Questions 125

Which of the following scenarios is notflinkWhat does the component excel at()?

(multiple choice)

Options:

A.

Batch Iterative

B.

Computing Stream

C.

Processing

D.

Data Storage

Buy Now
Questions 126

Sparkis a memory-based computing engine, allSparkData during program operation can only be stored in

in memory.

Options:

A.

True

B.

False

Buy Now
Questions 127

useHBaseClient batch write10piece of data, aHRegionServercontains the table on the node

of2indivualRegion, respectivelyAandB,10in the data2Article belongs toA,4Article belongs toB, please write this

10pieces of data need to be sent to theHRegionServersend several timesRPCask?

Options:

A.

1

B.

2

C.

6

D.

10

Buy Now
Questions 128

existSpark, which of the following statements about broadcast variables is correct? (multiple choice)

Options:

A.

broadcast variable in eachTasksave a copy

B.

broadcast variable in eachExecutorsave a copy

C.

Broadcast variables are read-only

D.

broadcast variable viaSparkContext.broadcast) method to generate

Buy Now
Questions 129

forHBase rowkeyThe design principles described below are correct?

(multiple choice)

Options:

A.

Attribute values with high access weight are placed inRowKeyFront

B.

Attribute values with high access weight are placed inRowkeysecond half

C.

Attribute values with good dispersion are placed inRowkeyfirst half

D.

Attribute values with good dispersion are placed inRowkeysecond half

Buy Now
Questions 130

Kerberossafety certificate.

Options:

A.

True

B.

False

Buy Now
Questions 131

Solris a high-performance, basedLucenefull-text search service.SolrrightLuceneexpanded,

provides a ratioLuceneA richer query language and a powerful full-text search function are implemented, with a high degree of reliability.

Extensibility. At the same time fromSolr 4.0Version starts, supportsSolrCloudmodel.

Options:

A.

True

B.

False

Buy Now
Questions 132

pass throughHBaseofcreateTableThe method creates a table, what parameters must be passed in?

Options:

A.

Table Name

B.

Table names and column families(or table name andfamily)

C.

table names and columns

D.

can be empty

Buy Now
Questions 133

existFusionInsight HDproductSolrDuring application development, you canSolr Admin UIright

CollectionDo some verification. Below aboutSolr Admin UIIs the statement correct? (multiple choice)

Options:

A.

clickClouddownTree,CheckSolr CloudmiddleCollections, a configuration set, andlive_nodesequivalence information

B.

clickCloud, see eachCollectionunder eachShardofReplicadistribution and status

enter aCollectionofShard ReplicaofCore Overviewinterface, you can view theRepicaSow

C.

Quoting the actual number of documents, storage size and location information

D.

Solruser group,Solr adminrole andSuper groupof users have access toSolr Admin UI

Buy Now
Questions 134

HDFSofClientWhen writing to a file, the first copy of the data is written to the location specified byNameNodeSure,

The other replicas are written to byDataNodeSure.

Options:

A.

True

B.

False

Buy Now
Questions 135

Fusionlnsigt HDofHiveWhat distributed computing frameworks can components run on? (multiple choice)

Options:

A.

Spark

B.

Streaming

C.

MapReduce

D.

HDFS

Buy Now
Questions 136

Oozieofworkflow.xmlfile is the workflow configuration file

Options:

A.

True

B.

False

Buy Now
Questions 137

HDFSThere is a file in the cluster root directorytest, which of the following commands can find the file stored in ofDataNodeNode information?

Options:

A.

hdfs fsck/test.txt-files

B.

hdfs fsck/test.txt-locations

C.

hdfs fsck/test.txt-blocks

D.

hdfs fsck/test.txt-list-corruptfileblocks

Buy Now
Questions 138

FusionInsght ManagerWhat interfaces are supported when interfacing with external management platforms?

(multiple choice)

Options:

A.

SNMP

B.

VPN

C.

BGP

D.

syslog

Buy Now
Questions 139

Flumebranch__________real-time data collectionsourceWhich is the type?

Options:

A.

taildir

B.

Log4j

C.

JMS

D.

Thrift

Buy Now
Questions 140

FlinksupportLocalpattern andClusterpattern deployment(and cloud deployment), other deployment modes are not currently supported.

Options:

A.

True

B.

False

Buy Now
Questions 141

about the followingHBaseofBloomFilterCharacter understanding, which statement is incorrect?

Options:

A.

used to filter data

B.

Used to optimize random performance

C.

will increase storage consumption

D.

Can accurately determine that a certain data does not exist

Buy Now
Questions 142

existMapReduceIn the development framework,InputFormatWhat is the function of the class?

Options:

A.

mergemapThe result of the stage has the samekeyvalue key-value pair

B.

put allmapThe results of the stages are merged to get the final result

C.

rightmapThe results of the stages are sorted and merged

D.

Divide the input data into one by one. and split this further into pairs, then asmapfunction input of

Buy Now
Questions 143

FusionInsightHDin, aboutHivepartition (partition) function, which is wrong as described below?

Options:

A.

Partition fields are defined when the table is created

B.

There can only be one partition field, and multi-level partitions cannot be created

C.

Using partitions can reduce the data scan range of some queries, thereby improving query efficiency

D.

Partition fields can be used aswherecondition of the sentence

Buy Now
Questions 144

FusionInsight HDsystem, aboutHiveofJDB, CInterface type, which of the following descriptions is correct?

Options:

A.

In a network isolation environment,ZooKeeperofIPCan be configured as a businessIP, can also be configured to manageIP.

B.

HiveUnlike traditional databases, soJDB, CThe established connection does not need to be closed.

C.

In a secure cluster environment, establish andHiveBefore connecting, you need to use unified authentication to log in.

D.

useJDB, CsubmitSQL, you canSQLA semicolon " ; " is added after the statement.

Buy Now
Questions 145

Oozieofworkflow.xmlfile is the workflow configuration file

Options:

A.

True

B.

False

Buy Now
Questions 146

existFusionInsight HDproduct,KafkaWhat is the role name of the service?

Options:

A.

Producer

B.

Broker

C.

Consumer

D.

ZooKeeper

Buy Now
Questions 147

FusionInsight HDmiddle,OoziesupportHDFS Action.

Options:

A.

True

B.

False

Buy Now
Questions 148

FusionInsight HDin, aboutOozieWhich of the following descriptions is correct?

(multiple choice)

Options:

A.

submitcommand to change the status of the job toPREP

B.

statcommand allowsPREPThe status of the job changes toRUNNINGrun

C.

Runcommand to change the job status toRUNNINGl

D.

KillCommands can act onPREP/SUSPEND/RUNNINGstatus job

Buy Now
Questions 149

forHBase rowkeyThe design principles described below are correct? (multiple choice)

Options:

A.

Attribute values with high access weight are placed inRowKeyFront

B.

Attribute values with high access weight are placed inRowkeysecond half

C.

Attribute values with good dispersion are placed inRowkeyfirst half

D.

Attribute values with good dispersion are placed inRowkeysecond half

Buy Now
Questions 150

FusionInsight ManagerRegarding the management operations of services, which of the following statements is wrong?

Options:

A.

Can start, stop and restart the service

B.

Services can be added and uninstalled.

C.

Uncommon services can be set to hide or show

D.

Can view the current status of the service

Buy Now
Questions 151

Hadoopenabled in the platformYARNWhich parameter needs to be configured for the log aggregation function of the component?

Options:

A.

yarn.nodemanager.local-dirs

B.

yarn.nodemanager.log-dirs

C.

yarn.acl.enable

D.

yarn.log-aggregation-enable

Buy Now
Questions 152

aboutFusonInsight HDofSpark, which of the following programming languages can be used to developSparkapplication? (multiple choice)

Options:

A.

C

B.

Scala

C.

Java

D.

Python

Buy Now
Questions 153

which come downRedisIs the command an atomic operation?

Options:

A.

INCR(increment the stored digital value by one)

B.

HINCRBY(for the hash tablekeydomain infieldvalue plus incrementincrement

C.

LPUSH(for linked listspushoperate)

D.

All of the above

Buy Now
Questions 154

Fusionlnsigt HD one deployed inTomcatapplication on theHBaseservice, it is recommended to use in this scenario machine account.

Options:

A.

True

B.

False

Buy Now
Questions 155

Suppose there is an application with10Tables, each table has tens of millions of records, and the number of fields is about20indivual. now

useRedisto cache this10The data of a table, the design of its data structure, which of the following is the best design?

Options:

A.

usehashstructure, and a table uses ahashKEY, a row in the table records forhash keyone of indivualed.

B.

usehashTable structure, each row record of each table uses onehashKEY, hashKEYofebCorrespondence table

C.

record fields, and designKEYEach table adds a different prefix to distinguish it.

useringstructure, each field of each row record of each table uses oneKEY.

D.

usem3structure, each row record of each table uses oneKEY,,aueAll records for a row in the table. The value after the field is concatenated.

Buy Now
Questions 156

FusionInsight HDmiddle,StreamingWhat are the characteristics of? (multiple choice)

Options:

A.

stream processing

B.

high performance

C.

Scalable

D.

High fault tolerance

Buy Now
Questions 157

Below aboutZooKeeperThe description of the feature is wrong ()?

Options:

A.

Updates sent by the client are applied in the order in which they were sent.

B.

A message to be read by more than halfserverreceive, it will be able to successfully write to disk.

C.

Message updates can only succeed or fail, with no intermediate states.

D.

ZooKeeperThe number of nodes must be an odd number.

Buy Now
Exam Code: H13-723_V2.0
Exam Name: HCIP-Big Data Developer V2.0
Last Update: May 15, 2026
Questions: 526

PDF + Testing Engine

$64.99  $185.69

Testing Engine

$49.99  $142.83
buy now H13-723_V2.0 testing engine

PDF (Q&A)

$54.99  $157.11
buy now H13-723_V2.0 pdf