Author: Cassandra Sampson

Troubleshoot a Failed Spark Job – Troubleshoot Data Storage ProcessingTroubleshoot a Failed Spark Job – Troubleshoot Data Storage Processing

It is inherently difficult to train or document how to troubleshoot technical problems because of the wide variety of symptoms one is exposed to. That means when an example is used to teach troubleshooting, it will most likely not be one that the person being trained will experience. Instead, there are two points to make […]

Optimize Pipeline for Descriptive versus Analytical Workloads – Troubleshoot Data Storage ProcessingOptimize Pipeline for Descriptive versus Analytical Workloads – Troubleshoot Data Storage Processing

The “Analytics Types” section in Chapter 2 described the numerous categories of data analytics—descriptive, diagnostic, predictive, preemptive, and prescriptive—each of which is an analytical workload. This is concluded by what you learned in the previous section: that OLTP operations are transactional, and OLAP operations are analytical. With the review of those five data analytics types, […]