As a final piece of our recent blog series about Apache Spark on 16 May we have presented details of a use-case about using Spark Structured Streaming to generate real-time Read more
Get an insight from the minds behind OpenCredo.
Navigate through the thoughts and advice of our experienced consultants that possess rich capabilities across the entire software development spectrum.
This is the third article in our Spark-related blog series, which covers some basic approaches for the testing of Cassandra/Spark code. It will show you how to restructure your code to be testable, covering unit testing, integration testing, and acceptance testing Read more
This is the second article in our Spark-related blog series which moves on from demonstrating functionality and describes the details of how to set up an infrastructure capable of running such analytical processing Read more
Interested in learning more about the practical and pragmatic aspects of using Apache Spark for your data processing challenges? Please join us for our new blog and webinar series: Apache Spark - The Pragmatic Bits. The blog series will aim to cover topics including how Spark can be used to get the most out of your Cassandra setup, how to actually deploy a Spark and Cassandra cluster through programmable infrastrcture, and how to ensure you write testable Spark code which will play nicely with the rest of your system. The series finishes with a webinar which explores the use case of “Detecting stolen AWS credential usage with Spark”. Read more
Recently I've been doing a lot with Terraform; having briefly flirted with it in the past, it's only now with v0.8.x that I've been seriously stepping out with it (and Read more
Cassandra is a great distributed datastore for high-volume transactional systems, however it is challenging to use it for certain type of queries or data analytics. Adopting Spark alongside Cassandra however results in an extremely powerful data platform. This is an introduction into what these technologies together are capable of. Read more
This post is part of a series which introduces key concepts in successful test automation. Each post contains sample code using the test-automation-quickstart project, a sample Java test automation framework available from Github. Read more
In this article, I argue that programmable infrastructure is similar to software, and needs testing too. I describe some of the unique challenges the industry faces in adopting infrastructure testing: both cultural and technical. I end by pointing out where we can go from here: we need to talk about it more, identify best practice, and improve our tooling. Read more
Google has recently made its internal Spanner database available to the wider public, as a hosted solution on Google Cloud. This is a distributed relational/transactional database used inside for various Google projects (including F1, the advertising backend), promising high throughput, low latency and 99.999% availability. As such it is an interesting alternative to many open source or other hosted solutions. This blog takes a closer look in to Google Cloud Spanner. Read more
Introducing Type Aliases As Kotlin's 1.1 release draws closer, I've been looking at some of the new language features it supports. Type aliases may seem like a relatively minor feature next Read more