Spring Sale Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: pass65

ACA-BigData1 ACA Big Data Certification Exam Questions and Answers

Questions 4

Alibaba Cloud E-MapReduce can be easily plugged with other

Alibaba Cloud services such as Log Service, ONS, MNS that act

as data ingestion channels from real-time data streams. Which

of the following descriptions about real-time processing are

correct? (Number of correct answers: 3)

Options:

A.

This data is streamed and processed using Apache

Flume or Kafka in integration with Apache Storm using

complex algorithmsB. Kafka is usually preferred with Apache Storm to

provide data pipeline

B.

The final processed data can be stored in HDFS, HBase

or any other big data store service in real time.

C.

Apache Sqoop is used to do the real-time data

transmission of structured data

Buy Now
Questions 5

_______ instances in E-MapReduce are responsible for computing and can quickly add computing

power to a cluster. They can also scale up and down at any time without impacting the operations of the

cluster.

Score 2

Options:

A.

Task

B.

Gateway

C.

Master

D.

Core

Buy Now
Questions 6

MaxCompute Tunnel provides high concurrency data upload and download services. User can use

the Tunnel service to upload or download the data to MaxCompute. Which of the following descriptions

about Tunnel is NOT correct:

Score 2

Options:

A.

MaxCompute Tunnel provides the Java programming interface for usersB. MaxCompute provides two data import and export methods: using Tunnel Operation on the console

directly or using TUNNEL written with java

B.

If data fails to be uploaded, use the restore command to restore the upload from where it was

interrupted

C.

Tunnel commands are mainly used to upload or download data.They provide the following

functions:upload, download, resume, show, purge etc.

Buy Now
Questions 7

If a task node of DataWorks is deleted from the recycle bin, it can still be restored.

Options:

A.

True

B.

False

Buy Now
Questions 8

When odpscmd is used to connect to a project in MaxCompute, the command ______ can be

executed to view the size of the space occupied by table table_a.

Score 2

Options:

A.

select size from table_a;

B.

size table_a;

C.

desc table_a;

D.

show table table_a;

Buy Now
Questions 9

In DataWorks table permission system, you can revoke permissions only on the fields whose security level is higher than the security level of your account.

Score 1

Options:

A.

True

B.

False

Buy Now
Questions 10

MaxCompute supports two kinds of charging methods: Pay-As-You-Go and Subscription (CU cost).

Pay-As-You-Go means each task is measured according to the input size by job cost. In this charging

method the billing items do not include charges due to ______.

Score 2

Options:

A.

Data upload

B.

Data download

C.

Computing

D.

Storage

Buy Now
Questions 11

In DataWorks, a task should be instantiated first before a

scheduled task is running every time, that is, generating a

corresponding instance which is executed for running the

scheduled task. The status is different in each phase of the

scheduling process, including ________. (Number of correct

answers: 3)

Options:

A.

Not running

B.

Running

C.

Running Successfully

Buy Now
Questions 12

Which node type in DataWorks can edit the Python code to operate data in MaxCompute?

Score 2

Options:

A.

PyODPS

B.

ODPS MR Node

C.

ODPS Script Node

D.

SHELL node

Buy Now
Questions 13

In MaxCompute, you can use Tunnel command line for data upload and download. Which of the

following description of Tunnel command is NOT correct:

Score 2

Options:

A.

Upload: Supports file or directory (level-one) uploading. Data can only be uploaded to a single table

or table partition each time.

B.

Download: You can only download data to a single file. Only data in one table or partition can be

downloaded to one file each time. For partitioned tables, the source partition must be specified.

C.

Resume: If an error occurs due to the network or the Tunnel service, you can resume transmission of

the file or directory after interruption.

D.

Purge: Clears the table directory. By default, use this command to clear information of the last three

days.

Buy Now
Questions 14

If the DataWorks(MaxCompute) tables in your request belong to two owners. In this case, Data

Guard(DataWorks component) automatically splits your request into two by table owner.

Score 1

Options:

A.

True

B.

False

Buy Now
Questions 15

Machine Learning Platform for Artificial Intelligence (PAI) node is one of the node types in DataWorks

business flow. It is used to call tasks created on PAI and schedule production activities based on the

node configuration. PAI nodes can be added to DataWorks only _________ .

Score 2

Options:

A.

after PAI experiments are created on PAI

B.

after PAI service is activated

C.

after MaxCompute service is activated

D.

Spark on MaxCompute Machine Learning project is created

Buy Now
Questions 16

Data Migration Unit (DMU) is used to measure the amount of resources consumed by data integration, including CPU, memory, and network. One DMU represents the minimum amount of

resources used for a data synchronization task.

Score 1

Options:

A.

True

B.

False

Buy Now
Questions 17

Your company stores user profile records in an OLTP databases. You want to join the serecords with web server logs you have already ingested into the Hadoop file system.

What is the best way to obtain and ingest these user records?

Options:

A.

Ingest with Hadoop streaming

B.

Ingest using Hive

C.

Ingest with sqoop import

D.

Ingest with Pig ' s LOAD command

Buy Now
Questions 18

Function Studio is a web project coding and development tool independently developed by the

Alibaba Group for function development scenarios. It is an important component of DataWorks.

Function Studio supports several programming languages and platform-based function development

scenarios except for ______ .

Score 2

Options:

A.

Real-time computing

B.

Python

C.

Java

D.

Scala

Buy Now
Questions 19

DataV is a powerful yet accessible data visualization tool, which features geographic

information systems allowing for rapid interpretation of data to understand

relationships, patterns, and trends. When a DataV screen is ready, it can embed works to

the existing portal of the enterprise through ______.

Options:

A.

URL after the release

B.

URL in the preview

C.

MD5 code obtained after the release

D.

Jar package imported after the release

Buy Now
Questions 20

The FTP data source in DataWorks allows you to read/write data to FTP, and supports configuring

synchronization tasks in wizard and script mode.

Score 1

Options:

A.

True

B.

False

Buy Now
Questions 21

DataWorks can be used to develop and configure data sync tasks. Which of the following statements

are correct? (Number of correct answers: 3)

Score 2

Options:

A.

The data source configuration in the project management is required to add data source

B.

Some of the columns in source tables can be extracted to create a mapping relationship between

fields, and constants or variables can ' t be added

C.

For the extraction of source data, " where " filtering clause can be referenced as the criteria of

incremental synchronization

D.

Clean-up rules can be set to clear or preserve existing data before data write

Buy Now
Questions 22

Scenario: Jack is the administrator of project prj1. A new team member, Alice (already has an Alibaba

Cloud account alice@aliyun.com), applies for joining this project with the following permissions: view

table lists, submit jobs, and create tables. Which of the following SQL statements is useless:

Score 2

Options:

A.

use prj1;

B.

add user aliyun$alice@aliyun.com;

C.

grant List, CreateTable, CreateInstance on project prj1 to user aliyun$alice@aliyun;

D.

flush privileges;

Buy Now
Questions 23

MaxCompute takes Project as a charged unit. The bill is charged according to three aspects: the

usage of storage, computing resource, and data download respectively. You pay for compute and

storage resources by the day with no long-term commitments.

Score 1

Options:

A.

True

B.

False

Buy Now
Exam Code: ACA-BigData1
Exam Name: ACA Big Data Certification Exam
Last Update: May 5, 2026
Questions: 78

PDF + Testing Engine

$63.52  $181.49

Testing Engine

$50.57  $144.49
buy now ACA-BigData1 testing engine

PDF (Q&A)

$43.57  $124.49
buy now ACA-BigData1 pdf