A Hadoop Data Lake For Banking: A SnapLogic Story

Last week, part of the SnapLogic team was in New York City for the Strata/Hadoop World conference. It’s one of the largest big data events in the U.S. and has grown steadily larger over recent years. The agenda has shifted a bit as well – from largely academic discussions and how-to presentations by open source committers to real-world case studies by non-ISV enterprises.

With that in mind, I’d like to share a story from one of our enterprise customers. In fact, this customer is a 100+ year old financial institution. Perhaps not a company that you would associate with the cutting edge of data management technologies… Due the nature of their industry, I can’t share their name.

Like many established companies, this bank’s data processing and storage systems have been acquired or added over the years based on the most pressing needs and compliance requirements at the time. They ultimately found themselves trying to manage an unwieldy mix of 240+ interfaces and applications. Continue reading “A Hadoop Data Lake For Banking: A SnapLogic Story”

New Podcast Episode: Navigating the Data Lake – Tips From a Practitioner

In this episode of the SnapTalk podcast series, enterprise architect Ravi Dharnikota talks with Rakesh Raghavan, Director of Snap Engineering at SnapLogic. Rakesh comes to SnapLogic having designed, developed and managed data lakes for several leading online retailers and consumer-facing websites. He has successfully navigated enterprise data lakes using open source tools and manual techniques, and in this episode shares his first-hand experiences.

Ravi and Rakesh discuss the pitfalls of jumping into a data lake without a clear architecture, the challenges of supporting both traditional reporting and ad hoc data exploration use cases in the same environment, and the often-overlooked, often manual data engineering tasks involved in data lake implementation.

Subscribe to the series: https://soundcloud.com/snaplogic/sets/snaptalk

Customer Spotlight: AstraZeneca

It’s always a treat to be able to share our customers’ inspiring success stories. MWD Advisors, a UK-based industry research and advisory services firm, recently spent time with our customer, global pharmaceuticals giant AstraZeneca. MWD published the results of that interview to their research subscribers in a report titled “AstraZeneca supports strategic cloud shift with ‘citizen integrator’ platform” (reprint provided here).

AstraZeneca early iPaaS success
AstraZeneca’s early iPaaS success

AstraZeneca started their journey with SnapLogic as part of an IT initiative to make their business process infrastructure more agile, and to incorporate cloud-based platforms and applications. They wanted to efficiently integrate SaaS systems and on-premises platforms without investing in a large, specialized integration team.

Continue reading “Customer Spotlight: AstraZeneca”

New Podcast Episode: The Lifecycle of Data

Next up in our ongoing podcast series: an episode on the “lifecycle of data” featuring our guest, Enterprise Solution Architect Rich Dill. The series is hosted by our own head of enterprise architecture, Ravi Dharnikota.

In this episode, Ravi Dharnikota and Rich Dill discuss the lifecycle of data, including the transition of data storage and processing to the cloud, the implications of distributed data, a “multi-tiered data lifecycle,” and the evolution of the data lake.

You can view and subscribe to the entire series here.

SnapLogic Summer 2016 Release Now Available

Another release is in the books – today we announced the Summer 2016 SnapLogic platform update, along with several additions and improvements to our Snap library.  The release brings additions for big data integration, self-service integration, and enterprise governance and control.

As our VP Engineering Vaikom Krishnan put it:

SnapLogic Summer 2016 Release
SnapLogic Summer 2016 Release

“SnapLogic continues to break down the barriers between data and application integration in the enterprise with a converged platform that is built for self-service. The Summer 2016 release further enhances our Snap library and resources for Snap developers to help support our vision of ‘anything, anytime, anywhere’ integration.”

Highlights of this “Snappy” release include:

  • New Snaps for Apache Hive and Teradata
  • Major updates to Snaps for Anaplan and Tableau
  • Enhancements to the Mapper Snap that make it faster and simpler to search, filter and map the entries in a complex schema tree
  • User-defined pipeline parameters can now be logged and retained with runtime history in order for administrators to audit API usage and quickly debug pipeline performance issues
  • A new, seamless way to auto-shard documents across all nodes in a SnapLogic data processing Snaplex, leveraging the power of all nodes and boosting data integration performance 
  • Users can now limit invocation of triggered tasks to one instance at a time for more granular control and to avoid overloading resources.

We’re also excited for the launch of the new  Snap developer site. It’s easy to use, mobile-friendly and full of practical guidance for our customers and partners building and maintaining their own Snaps.

For more information on the Summer 2016 release, including demo videos, see: https://www.snaplogic.com/summer2016

SnapTalk Podcast: Streaming Data, part 2

Next up in our ongoing podcast series: episode 2 on streaming data and the unique challenges of stream data processing. The series is hosted by our own head of enterprise architecture, Ravi Dharnikota.

In this episode Ravi again talks with Dr. Greg Benson, SnapLogic’s Chief Scientist and professor of computer science at University of San Francisco. In this episode they continue the conversation on the challenges of streaming data, including a discussion of Apache Kafka, its advantages, and how it differs from older messaging systems.

Here is part two of the two-part episode:

You can view and subscribe to the entire series here.

SnapTalk Podcast: Streaming Data With Dr. Greg Benson

New in our ongoing podcast series — called SnapTalk, of course — is a two-part episode on streaming data and the unique challenges of stream data processing. The series is hosted by our own head of enterprise architecture, Ravi Dharnikota.

In this episode Ravi talks with Dr. Greg Benson, SnapLogic’s Chief Scientist and professor of computer science at University of San Francisco. Holding these dual roles, Greg has a unique perspective on the world of big data and the rapidly-changing landscape of big data technologies.

Here is part one of the two-part series:

Next up: part 2 of streaming data.

You can view and subscribe to the entire series here.