H13-723_V2.0 HCIP-Big Data Developer V2.0 Questions and Answers
FusionInsight HDWhich components in the platform support table and column encryption?
(multiple choice)
FusiontnsightHD in which ways can you viewOozieDebug results of the job? (multiple choice)
FHumeofproperties.propertiesMultiple configurations can be configured in the configuration filechannelto transmit data.
FusionInsight HD assuming a topology that setsspoutConcurrency is3,bolt1Concurrency
for2,bok2Well degree is3.workerThe number is2,Sobolt1ofexecutorexistworkerhow to divide Cloth?
existSpark, which of the following is trueDataFrameThe operator that takes the intersection?
Spark Streamingavailable fromKafkaReceive data and perform calculations, and the calculation results can only be stored inHDFS,
can ' t write backKafka.
existSpark, assuminglinesIs anDStreamobject,filterStatements can be filtered out80%the number of
According to the following two sentences, the law is correct:X:lines.filter( ).groupByKey(.)Y:
lines.groupByKey( ).filter( )
Which of the following causesHDFSofNameNodeEntersafemode(install " aroot form)? (multiple choice)
FusionInsigt HD one deployed inTomcatapplication on theHBaseservice, this
It is recommended to use a machine account in this scenario.
HBasedata fileHFileone ofKeyValueWhat information does the format contain?
(multiple choice)
FusionInsight HDin, aboutHive UDFsSecondary development, is the following description correct? (multiple choice)
LoaderIf the job execution fails, the data imported during the running of this job will not be deleted automatically.
must be deleted manually.
Fusioninsight HDin useStreamingofA, CKWhich of the following statements is true
of? (multiple choice)
Which of the following measures can improveHBasequery performance? (multiple choice)
Suppose there is an application that needs to be accessed frequentlyOracleThe user table in the database, in order to improve performance, introduceRedisto cache
account information.
For this scene,RedisWhich of the following is the best data structure choice for ?
existKafkamiddle,ProducerThis can be done by configuring the synchronization parameters (producer.type) to ensure that data is sent in order.
when carried outsolrone ofcollectionWhen designing, it is necessary to design itsschema, by configuringschema.xml
file implementation pairschemadesign, below aboutschemaWhich statement is wrong?
existFusionInsight HDin, useSparkSQL, which of the following methods (or tools) can be used to performSQLstatement?
(multiple choice)
FusionInsight HD assuming a topology, set the roadspoutConcurrency is3,bolt1Concurrency
for2,bolt2Concurrency is3,workerThe number is2,Sobolt1ofexecutorexistworkerhow to divide
Cloth?
existFusionInsight HDin the cluster,FlumeWhich service does not support writing collected data to the cluster?
Fusionlnsiht HDmiddle,Oozieclient ' sJava APIwill be called when the task is runOozieClientWhich method of the class?
existFusionInsight HDmiddle,FlumeIn a configuration file, if there are multiplesource,butsourceThe name cannot be repeated.
FusionInsight HDin, useStreamingThe command? way to submitexample.jarmiddleom huawei example
WrodCounttask, task name iswcTeat, is the following execution command correct?
for running onMapReduceThe application on the platform that this application depends onjarWhere will the bag be placed?
writingMapReduceWhich two interfaces are usually required to be implemented by developers?
FusionInsight HDsystem, aboutHiveofJDB, CInterface type, which of the following descriptions is correct?
FusionInsight HDin, belonging toStreamingWhat are the roles of the service? (multiple choice)
HDFSclient withNWhen a copy writes a file, which of the following is true about the writing process? (multiple choice)
aboutFusionInsight HDofSpark, which of the following programming languages can be used to developSparkapplication? (multiple choice)
A project needs to save the Internet access data in a certain area, and search the full text of these Internet access records to see if there is any sensitive data.
Sensitive information is used to prevent crimes in this area. In this scenario, which of the following options is the best?
existMapReduceDuring application development,setMapOutputCompressorClassWhat is the role of classes?
When the cluster is normal,RedisClient initiates oncegetCall, the client has () times of message interaction with the server?
An application requires simultaneous and twoFusionInsightsCluster interaction: both need to access the cluster1ofHBaseservice, need to visit
Ask the cluster2ofHiveServe;
So which of the following operations are required? (multiple choice)
existKafka, as follows aboutProducerWhat is wrong with the statement of sending data? (multiple choice)
FusionInsight HDmiddle,StreamingWhich of the following scenarios is applicable? (multiple choice)
HDFSIn application development of , which of the following areHDFSInterfaces supported by the service? (multiple choice)
existFusionInsight HDofHBase, which of the following scenarios will not triggerFlushoperate?
aboutStreamingthe topology (Topology), which of the following descriptions is wrong?
forHBase rowkeyThe design principles described below are correct? (multiple choice)
FusionInsight HDin, yesSolrThe creation of various resources and the use of read and write permissions, which of the following statements is wrong?
existFusionInsight HDmiddle,FlumeWhich of the following are supportedsourceTypes of? (multiple choice)
HDFSRuntime,NameNodewill load all the metadata of the file system from disk into memory, so the file system can
The total number of files stored is limited byNameNodememory capacity.
existFusionInsight HDproductSolrDuring application development, you canSolr Admin UIrightCollectionDo some verification.
Below aboutSolr Admin UIIs the statement correct? (multiple choice)
existKafka, which of the following commands can view aTopicHow many partitions are there?
FlumewriteHDFSWhen the file is generated, what are the ways of generating the file? (multiple choice)
FusionInsight Managerinterface, when receivedKafkaInsufficient disk capacity alarm, and the alarm ' s
When the cause has been ruled out for the hard disk hardware failure, the system administrator needs to consider expanding the capacity to solve this problem.
solris a high-performance, basedLucenefull-text search service.SolrxrightLuceneto expand,
Loss of fruit supportSolrCloudmodel.
existFlumeDuring cascaded transfers, you can usefail overmode transfer, so that if the next hop isFlumenode failure or
When the data is received abnormally, it can automatically switch to another way to continue transmission.
existFusionInsight HD where can I viewMapReduceThe result of running the application?
In useSolrWhen performing a full-text search, you canwtThe parameter specifies the response format of the query result. close
AtSolrThe response format of the query result, which of the following statements is wrong?
existSpark, assuminglinesIs anDStreamobject,filterStatements can be filtered out80%data for the following two
The correct statement is:
X: lines.filter(…).groupByKey(…)
Y: lines.groupByKey(…).filter(…)
FusionInsight HDin, yesSolrThe creation of various resources and the use of read and write permissions, which of the following statements is wrong?
FusionInsight HDin, aboutHive UFDSecondary development, is the following description correct? (multiple choice)
FusionInsight HDin, useStreamingThe command?way to submitexample.jarmiddleom huawei example.
WrodCounttask, task name iswcTeat, is the following execution command correct?
existKafkamiddle,ProducerThis can be done by configuring the synchronization parameters (producer.type) to ensure that data is sent in order.
existFusionInsight HDWhen developing applications with a secure version, you can usekeytabDocuments are authenticated securely.
forSpark Streamingapplication, in aJVM, there can only be one at a timeStreamingContextactive condition.
FusionInsight HD which components are provided externallySQLor classSQLability? (multiple choice)
aboutKafkaInsufficient disk capacity alarm, which of the following analysis is incorrect for the possible reasons?
FlinkThe two key elements of the program arestreamdata andtransformationoperator.
FusionInsightHDin, aboutHivepartition (partation) function, which is wrong as described below?
There are the following business scenarios: User online log files have been stored inHDFSabove, the log file content format the formula is: each online record has three fields, namely name, gender, and online time, and the fields are separated by " , " ;
It is required to print out all female netizens who spend more than two hours online. Which of the following code snippets can achieve
The above business scenario? (multiple choice)
Fasionlrnight HD Heenein the group,Table1GenusFNamespacel,Table2GenusFNamespace2,
Table1there are two1columns, respectivelyRdf11.df12,TablezThere is a column home namedf21, then which of the following
user account9AAt the same time have|ef18d21oftwrite rights. (multiple choice)
existHBaseWhile the application is running, the application can write data while creating the table.
pass throughHBaseofcreateTableThe method creates a table, what parameters must be passed in?
existFusionInsight HDclient, executeskinit{account}command is to getKDCwhich content?
existFusionInsight HDofHBase, which of the following scenarios will not triggerFlushoperate?
FusionInsight HDWhich of the following belong toOozieofMapReduce Actionconfiguration item? (multiple choice)
existSpark, assuminglinesIs anDStreamObject, which of the following statements can periodically count the number of words on this stream?
aboutFusionInsight HDplatformHiveservice, itsWebHCatDevelopment interface, which of the following descriptions is incorrect?
existFusionInsight HDin the cluster,FlumeWhich service does not support writing collected data to the cluster?
existSpark, which of the following is trueDataFrameThe operator that takes the intersection?
RedisofLISTData structure, suitable for which of the following scenarios? (multiple choice)
FusionInsight HDofHiveIn the application, there are the following scenarios:? ? ?Storage files have higher? ?efficiency, and most
Minute? ?Only a part of the letter is involved in the file, this scenario is suitable for using a column file (ORC F??)storage.
existStreamingin application development,BoltUse which of the following interfaces to sendTuple?
In the online log query scheme, the?processing to complete the calculation work. During the whole calculation process, the intermediate calculation results need to be
For temporary storage, which of the following components are suitable for storing intermediate calculation results? (multiple choice)
forFusionInsight HDplatformHBaseComponent, which properties of the secondary index need to be defined to add a secondary index? (multiple choice)
writingMapReduceWhich two interfaces are usually required to be implemented by developers?
pass throughHBaseofcreateTableThe method creates a table, what parameters must be passed in?
forN(N > 1) copies of stored documents,HDFSThe client initiates a read file request. If the read replica node fails, the
If the connection fails, it will not go to other replica nodes for reading.
An application requires simultaneous and twoFusionInsightsCluster interaction: both need to access the cluster1ofHBaseservice, need to visit
Ask the cluster2ofHiveServe;
So which of the following operations are required? (multiple choice)
A project requires Internet access to a certain area? ?Save it, and search the full text of these Internet records to see if there is any? ?information, with
to prevent crime in the region.
In this scenario, which of the following options is the best?
aboutFlumeThe characteristics of the collected data, which of the following descriptions are correct?
HDFSThe system time of the node where the client is located is the same as theFusionInsight HDThe system time of the cluster should be consistent. If there is a time difference, So the time difference should be less than a few minutes?
existMapReduceIn the development framework,InputFormatWhat is the function of the class?
There are the following scenarios: new data is generated by the online system every day500G, you need to make statistics on these data by day, week, month and other dimensions summary.
ask if it is suitable for useHiveWhat kind of table to handle?
HDFSThere is a file in the cluster and directorytext.txt, which of the following commands can find theDatNodenode
information?
FusionInsight HDofHive, user-definedUDFcan andHiveBuilt-inUDFduplicate name, in this case,
will use user-definedUDF.
existFusionInsight HDclient, executeskinit{account}command is to getKDCwhich content?
HBasetablerowkeyDesign is a very important development and design link. Suppose there is the following scenario,
The most frequent query scenario is to query the historical call records of each month and half a year based on the mobile phone number. Which of the followingrowkey
Design is optimal?
existSparkIn application development, which of the following codes can correctly count words?
FusionInsight HDsystem, aboutHiveofJDB, CInterface type, which of the following descriptions is correct of?
implementHBaseWhat parts of the data need to be read for the data read service?
(multiple choice)
Spark SQLIn the table, there are often many small files (the size is much smaller thanHDFSblock size), in this case,Sparkwill enable aTaskto process these small files, whenSQLexist in operationShufleWhen operating, will greatly increasehashThe number of dynamic buckets will seriously affect the performance.
aboutFusionInsight HDofSpark, which of the following programming languages can be used to developSparkapplication? (multiple choice)
for running onMapReduceThe application on the platform that this application depends onjarpackage will be put where?
Which of the following scenarios is notflinkWhat does the component excel at()?
(multiple choice)
Sparkis a memory-based computing engine, allSparkData during program operation can only be stored in
in memory.
useHBaseClient batch write10piece of data, aHRegionServercontains the table on the node
of2indivualRegion, respectivelyAandB,10in the data2Article belongs toA,4Article belongs toB, please write this
10pieces of data need to be sent to theHRegionServersend several timesRPCask?
existSpark, which of the following statements about broadcast variables is correct? (multiple choice)
forHBase rowkeyThe design principles described below are correct?
(multiple choice)
Solris a high-performance, basedLucenefull-text search service.SolrrightLuceneexpanded,
provides a ratioLuceneA richer query language and a powerful full-text search function are implemented, with a high degree of reliability.
Extensibility. At the same time fromSolr 4.0Version starts, supportsSolrCloudmodel.
pass throughHBaseofcreateTableThe method creates a table, what parameters must be passed in?
existFusionInsight HDproductSolrDuring application development, you canSolr Admin UIright
CollectionDo some verification. Below aboutSolr Admin UIIs the statement correct? (multiple choice)
HDFSofClientWhen writing to a file, the first copy of the data is written to the location specified byNameNodeSure,
The other replicas are written to byDataNodeSure.
Fusionlnsigt HDofHiveWhat distributed computing frameworks can components run on? (multiple choice)
HDFSThere is a file in the cluster root directorytest, which of the following commands can find the file stored in ofDataNodeNode information?
FusionInsght ManagerWhat interfaces are supported when interfacing with external management platforms?
(multiple choice)
FlinksupportLocalpattern andClusterpattern deployment(and cloud deployment), other deployment modes are not currently supported.
about the followingHBaseofBloomFilterCharacter understanding, which statement is incorrect?
existMapReduceIn the development framework,InputFormatWhat is the function of the class?
FusionInsightHDin, aboutHivepartition (partition) function, which is wrong as described below?
FusionInsight HDsystem, aboutHiveofJDB, CInterface type, which of the following descriptions is correct?
FusionInsight HDin, aboutOozieWhich of the following descriptions is correct?
(multiple choice)
forHBase rowkeyThe design principles described below are correct? (multiple choice)
FusionInsight ManagerRegarding the management operations of services, which of the following statements is wrong?
Hadoopenabled in the platformYARNWhich parameter needs to be configured for the log aggregation function of the component?
aboutFusonInsight HDofSpark, which of the following programming languages can be used to developSparkapplication? (multiple choice)
Fusionlnsigt HD one deployed inTomcatapplication on theHBaseservice, it is recommended to use in this scenario machine account.
Suppose there is an application with10Tables, each table has tens of millions of records, and the number of fields is about20indivual. now
useRedisto cache this10The data of a table, the design of its data structure, which of the following is the best design?
FusionInsight HDmiddle,StreamingWhat are the characteristics of? (multiple choice)
