Playground

# Introduction to Teradata Aster Analytics

This a note for Teradata Aster Basics 6.10 Exam a.k.a TACP(Teradata Aster Certified Professional).

Recommended courses are followings and this note is for the 3rd course.

• Teradata Certification, What’s New and How to Prepare
• Introduction to Big Data and Teradata Aster*
• Introduction to Teradata Aster Analytics

Map function doesn’t have PARTITION BY
Reduce function has PARTITION BY
PARTITION BY affects SHUFFLE phases

Quiz
What are the two different types of SQL-MR functions? (Choose two.)
Partition functions (Reduce) & Row functions (Map)

In SQL-MR the ON clause can be what three things? (Choose three)
Function, Table & Query

Fill in the missing word. How do you distinguish between a Map vs. Reduce function? A REDUCE function has a PARTITION BY clause, whereas the Map function does not

What criteria would you use to determine if you want to run SQL versus SQL-MR? Select the four criteria that are better suited to run with SQL-MR. (Choose four)
Unstructured or multi-structured data, Machine learning algorithm, Recursive querie & Self-joins

Acquistion functions

load_from_hadoop
anydatabase2aster


Define foreign server

create foreign server hdp21
using server('192.168.100.21')

create foreign server td15
using tdpid('192.168.100.15')


Pull & Push-down query

--pull
select c1, sum(c2)
from t1@td15
group by 1;

--push down
select * from FOREIGN SERVER
($$select c1, sum(c2) from t1 group by 1$$)@td15;


Quiz
Which two Teradata QueryGrid connectors can acquire data for Aster? (Choose two.

Why move data between Teradata and Aster? Match Aster and Teradata to what each database is best designed for.
Aster - for analytics by limited number of data scientists
Teradata - for high concurrency (hundreds of users)

What are some Teradata Aster parser functions?
Apache logs, xml, json and pst

Using the Stream API, you can write functions in programming languages that are not native to Teradata Aster (e.g., write non SQL-MR or SQL-GR functions) and run them on Aster, generating output that Aster can receive, including:

• writing R functions to run on Aster
• write custom python, perl, C/C++/C# functions to run on Teradata Aster

Quiz
nPath is used for Pattern Matching across Time Series
What three expressions are used to specify input data for nPath? on, partition by and order by What three expressions are used to specify nPath search criteria? mode, pattern and symbols
What kind of function is Kmeans? clustering
What kind of function is Decision Tree? predictive function

Quiz
What visualization function(s) are in Teradata Aster AppCenter?
Visualizer (formerly nPathViz and cFilterViz)

What needs to be configured before building a new Application?
Create a JDBC connection

Name three Data Format types. (Choose three)
nPath, Table and cFilter

Name four different chart types that Teradata Aster AppCenter visualizations create. (Choose four)
Tree, SanKey, Sigma and Chord

How can users dynamically change Teradata Aster AppCenter chart visualizations?
By clicking on objects and/or by changing Layout/Format specs

Quiz
How do you connect to Aster via RStudio?
Aster ODBC driver

What is the name of the Aster package for Teradata Aster R?

You want to access Help for Teradata Aster R to see a list of commands. What syntax would accomplish this?

Final Exam SCORE: 96 PASSED Question 1 Correct

1. True or False: Map-Reduce is a programming model and an associated implementation for processing and generating large data sets.
2. Each Map function performs an ETL on ____ in the input.
3. The ___ gets a key and the array of values emitted with that key and produces the final result.
4. The SQL-MR syntax ON clause specifies the input rows, which can be a ___. (Choose four)
5. Does the syntax use a Map Function or a Reduce Function? Drag and Drop the Map Function and Reduce Function labels (at left) to the correct syntax (at right).
6. True or False: Functions can be Map and Reduce functions at the same time.
7. In the syntax below, click on the input.
8. Match the function (at left) with its description (at right):
9. The _____ is used for clustering. Clustering is a fast/simple method for grouping objects into preliminary clusters using an approximate distance method. Each point is represented as a point in a multidimensional feature space.
10. True or False: Map-Reduce is a programming model and an associated implementation for processing and generating large data sets.
11. The _____ can extract multiple columns of structured data from standard Apache Web Logs.
12. Match the function (at left) with what it’s used for (at right):
13. This question tests your knowledge of nPath pattern matching using the mode: non-overlapping and the pattern: ‘B+.C.A’. Given this input table and nPath syntax, which pattern matches will be in the output rows?
14. This question tests your knowledge of nPath pattern matching using the mode: overlapping and the pattern: ‘B+.C.A’. Given this input table and nPath syntax, which pattern matches will be in the output rows?