All-in-one data and application integration platform.

Mobilize data to the cloud with visual ETL/ELT and reverse ETL.

Connect every application with our no-code/low-code iPaaS solutions.

API Management

Efficiently create, manage, and secure all your APIs at scale.

AutoSync

What is Data Integration?

Experience the leading self-service integration platform for yourself.

GenAI BuilderNEW

Create LLM-powered applications and automations in minutes.

Use Case Overview

Bring automation to every part of your organization.

By Industry

By Function

By Popular Workflow

By Solution

Generative Integration

Legacy Modernization

Enterprise Automation

Cloud Data Warehouse

Informatica Replacement

Partners Overview

Drive profitability and growth through joint sales and marketing strategies.

OEM/Embedded

Partnerships for ISV, MSP, OEM, and Embedded providers.

Become a Partner

Gain access to a world-class partner ecosystem.

Access the SnapLogic Partner Connect Portal.

Get free access to SnapLogic Partner resources.

Explore our Partners

Search for partners in our robust global network.

Customers Overview

Our customers’ accomplishments continue to shape SnapLogic’s success.

Integration Nation

Our community for thought leadership, peer support, customer education, and recognition.

MVP Program

Recognizing individuals for their contributions to the SnapLogic community.

SnapLogic Academy

Enhance your expertise about intelligent integration and enterprise automation.

Customer Awards

Highlighting customers and partners who have transformed their organizations with SnapLogic.

Case Studies

Learn more about how our customers benefit from using SnapLogic.

Resource Library

Our home for eBooks, white papers, videos, and more.

SnapLogic is here to support you throughout your entire experience.

Evolving the Enterprise Virtual Summit

Watch the sessions on-demand!

Generative AI Survey Report

Insights from 900+ respondents

Home ❯ Machine Learning Showcase ❯ The Decision Tree

The Decision Tree

Problem: Train a model to: 1) distinguish between different species of the Iris flower based on four features; and 2) predict which passengers survived on the Titanic based on eight different features –all using the decision tree method.

Context: The decision tree is a simple yet powerful machine learning algorithm. It is easy to understand and has been in circulation for a long time.

Model type: Decision tree

What we did: In developing the machine learning model, we started with the k-fold cross-validation process, in which we first split the training dataset into k-chunks. We then trained the model on the k-1 chunks and evaluated the model on the last chunk. We repeated this process while computing the average accuracy of the model’s outputs. When the cross-validation results were satisfactory, we then trained the model on the whole training dataset.

In this demo, we have two datasets: the Iris Flower and Titanic. For the Iris Flower dataset, the model reads four flower measurements (a.k.a., inputs or features) to determine which species of Iris Flower is in question. The four inputs are: sepal length, sepal width, petal length, and petal width.

For the Titanic dataset, the model reads eight features about each passenger to determine whether a given passenger did or did not survive the sinking of the Titanic.

Choose a dataset below and then try cross-validating, training, and/or training and testing the decision tree model.

Training Set:

Test Set:

Select the dataset and operation.

Try self-service machine learning today

Start Free Trial