May 22, 2017 | Data Analysis

Detecting stolen AWS credential usage with Apache Spark – Webinar Recording

As a final piece of our recent blog series about Apache Spark on 16 May we have presented details of a use-case about using Spark Structured Streaming to generate real-time alerts of suspicious activity in an AWS-based infrastructure.

This blog is written exclusively by the OpenCredo team. We do not accept external contributions.

WRITTEN BY

Danielle Blake

Marketing & Brand Manager

If you missed the webcast or would like to watch it again, here is a recording. You can also view this on our YouTube channel

Check out the full blog series, too: Spark, the pragmatic bits

Data analytics using Cassandra and Spark by David Borsos

Cassandra is a highly performant database when used to store large amounts of data, and performing queries for which it has been optimized. However, when it comes to trying to analyze and gain broader insight from the data captured, Cassandra can be cumbersome to work with, and may not be as performant and scalable as needed. This article demonstrates how you can practically combine Apache Spark with Apache Cassandra in order to better deal with such scenarios.

Deploy Spark with an Apache Cassandra cluster by David Borsos

This post will show how you can deploy the open source version of Apache Spark alongside an Apache Cassandra cluster. It also includes a programmable infrastructure code example.

Testing Spark by Matt Long

The ability to write and run adhoc Spark queries is helpful for getting immediate insight into certain data problems, but what happens when these queries needs to form part of a bigger software system? Matt takes you on a journey looking at how you may need to take existing Spark code (in fact the same demo code used in David’s first article), and refactor it in order to make it more testable.

RETURN TO BLOG

SHARE

SIMILAR POSTS

Let’s Flink on EKS: Data Lake Primer

Check out the latest blog by Our Senior Consultant Howard Hill where he offers an engineer’s guide to streamlining real-time data using an open-model infrastructure.…

Yow! London – Searching for Research Fraud in OpenAlex with Graph Data Science (Recording)

Check out our Lead Consultant Ebru Cucen and Sage Publishing Data Scientist Adam Day co-present on “Research Fraud Detection in OpenAlex with Graph Data Science”…

Ingesting Big Data into Neo4j – Part 3

Check out the last part of Ebru Cucen and Fahran Wallace’s blog series, in which they discuss their experience ingesting 400 million nodes and a…

Blog