Get a Quote


Self-Service Data Transformation: Intro to Oracle Data Flow & Tableau Prep

Share This Post

In the world of self-service analytics, Tableau and Oracle Data Visualization are two tools that are often put on the same discussion platform. In the last year, the conversations surrounding these two tools have increased dramatically — with most of our clients using self-service analytics. In this blog, I am not going to do a comparison rundown between Tableau and Oracle DV. What I do want to show you is two similar tools which introduce exciting new possibilities: Tableau Prep and Oracle Data Flow.

First, let’s talk about the concept that both Tableau Prep and Oracle Data Flow share. They allow users to take one or more data sets (or sources) and organize, transform, and integrate them into a single data set to use for visualizations. Throughout this process, one can…

  • Join
  • Filter
  • Remove or add columns
  • Create new groupings
  • Perform calculations
  • Perform aggregations
  • And use a variety of integrated functions to accomplish the desired results

part 1_1

Now, you may be thinking “this sounds a bit familiar.” Isn’t that what ETL (or ELT for the ODI peeps) is? Yep. Due to the simplistic nature of both the Tableau Prep and Oracle Data Flow interfaces, it could potentially put this process in the hands of analysts, report developers, and high-level users.

Part 1_2

Before the Data Integration specialists start filling the comments section questioning my sanity and my mom and dad’s parenting decisions, let me expound a little more on where I think this fits in.

Rather than replacing the formal ETL process you may have in place, I believe that Tableau Prep and Data Flow are meant to play a complementary role to these processes. To put it simply, you aren’t going to be connecting to a source that contains millions upon millions of rows of data and then transforming everything to put it into a file to report off of. What you can do is quickly and efficiently create a curated dataset on a case-by-case basis without the need to request changes or additions to be made to the formal ETL process.

Now that we’ve covered what Tableau Prep and Oracle Data Flow do at high level, let’s take a look at each tool individually.

Tableau Prep

Formally called Project Maestro, Tableau Prep is the much-anticipated data preparation tool that is meant to be used in conjunction with Tableau Desktop, Server, and Online. Unlike Oracle Data Flow (which is a part of Oracle Data Visualization), Tableau Prep is its own tool and requires a separate installation.

Now, let’s take a look at the tool itself and do a brief overview of the interface and some of the Tableau Prep functions.

For those of you who are familiar with Tableau, the Tableau Prep interface will feel very familiar. Upon opening Tableau Prep, the user is presented with the homepage, where you can…

  • Choose to make connections to sources (files or database)
  • Open existing workflows
  • Explore documentation, tutorials, and blog posts about Tableau Prep

Part 1_3

Once a connection to a source has been made, the user is taken to the workflow. A workflow (or Flow) is where the user can design the transformation process to produce the desired output.

Part 1_4

When designing a Flow, the user can create joins between multiple sources…

Part 1_5

Use operations to clean, rename, remove, split, or filter columns…

Part 1_6

Create new calculated fields…

Part 1_7

Create aggregates…

Part 1_8

And create one or more output stages that produce files…

Part 1_9

Part 1_10

…that can be used with Tableau Desktop, Tableau Server, or Tableau Online.

Part 1_11

Once a Flow has been created and saved, it can be modified or run multiple times to update the source file with new data. In part two of this three-part series, I will take an in-depth look at Tableau Prep and its features. For now, let’s move on and take a quick look at Oracle Data Flow.

Oracle Data Flow

Data Flow is a part of the Oracle Data Visualization suite of products and is included with the latest on-prem release of OBIEE (provided you have the additional Data Visualization license), Oracle Data Visualization Desktop (Data Visualization license also required), and Oracle Analytics Cloud (OAC). Because it’s a feature and not a standalone tool, there is no additional configuration or installation required after OBIEE, Data Visualization Desktop, or OAC have been properly configured. Let’s take a brief look at Data Flow and some of its features.

Creating a Data Flow is done from the Oracle DV or DVD home screen from the create menu.

Part 1_12

From here, the user is prompted to select a source. Sources can be created from connections to files (.csv, .xlsx), databases, or other Oracle applications.

Part 1_13

 Part 1_14

Once a source is selected, the Data Flow edit window is displayed. Here, a user can bring in additional sources if needed…

Part 1_15

Create joins….

Part 1_16

Part 1_17

Add filters…

Part 1_18

Part 1_19

Remove and rename columns…

Part 1_20

Add calculated columns…

Part 1_21

Create aggregates…

Part 1_22

…or add a number of other functions such as machine learning, forecasting, custom scripts, custom grouping, or bins.

 Part 1_23

The last step in a Data Flow can be three different kinds of outputs.

  • An Essbase Cube – requires an Essbase connection to be established beforehand.
    Part 1_24
  • Data Set Storage – for use with Oracle DV.
    Part 1_25
  • Database Connection – requires a database connection to be set up beforehand.
    Part 1_26

Another feature in Oracle DV that goes hand in hand with Data Flow is Sequences. Sequences allow Data Flows to be kicked off in a specified order and allow for a more streamlined workflow when it comes to updating sources that are created from Data Flow.

Part 1_27

That concludes part one of our Self-Service Data Transformation blog series. In part two, I am going to take you through an in-depth look at Tableau Prep. Thanks for reading!

You might also be interested in…


Subscribe to Email Updates