Big Data

Home/Posts/Big Data

Data Transformations: Box-Cox

Data transformations are useful and necessary in some statistical applications. We will discuss one of the most famous ones, the Box-Cox transformation.  We will use two of the most often used software programs for Six Sigma analysis: SigmaXL and Minitab to implement the Box-Cox Transformation. First, let’s briefly remind you of why a transformation is [...]

Cleaning Data for Analysis

When you acquire data for analysis, it is highly unlikely that it will be in a form ready to analyze. Experienced data analysts and statisticians know that data preparation or data cleaning is the most time-consuming part of data analysis but incredibly important, otherwise any conclusions drawn from your analysis will be suspect. We’ll explore [...]

Cluster Analysis: K-Means and Hierarchical

Cluster analysis is an extremely important set of techniques to determine groups with similar characteristics. Often cluster analysis is used to determine similar groups for marketing products. This is called market segmentation. In this excellent video, Luis Serrano, Quantum Artificial Intelligence Research Scientist, introduces the two of the most common methods for splitting data into [...]

Top 10 Mistakes in Data Management

With today’s myriad ways for generating huge amounts of data, organizations need more than ever to have a data management plan. Jared Hillam, VP of Emerging Technologies at INTRICITY, discusses the top ten mistakes made in data management. Here is his list. Flaky Data Management Plan No strategy in place for managing data Tools Used [...]

Go to Top