Louis Mugnano 3/2/26 Louis Mugnano 3/2/26

Our Blogs: A Curated Guide to Our Technical Posts

This curated guide follows the journey from installation and onboarding to automation, data modeling, advanced analytics, and machine learning. Each blog post here is organized to reflect the natural progression of a Greenplum deployment.

Louis Mugnano 3/2/26 Louis Mugnano 3/2/26

When “Nothing Changed” Breaks Your Greenplum Performance

Experiencing Greenplum performance issues after a network change? If Informatica ETL jobs are suddenly slow but CPU, memory, and disk look normal, the problem may be hidden in the Greenplum interconnect layer. This post explains how subtle backend network behavior, UDP retries, and inter-segment communication issues can quietly degrade MPP database performance.

Greg Spiegelberg 1/3/26 Greg Spiegelberg 1/3/26

Rethinking Disaster Recovery for MPP Databases - Part 2

Part 2 dives into the discovery phase of a real-world DR redesign, uncovering the constraints that matter most at scale—WAL volume, retention windows, and storage behavior. It shows why understanding these realities is critical before any recovery architecture can succeed.

Nicholas Diglio 12/1/25 Nicholas Diglio 12/1/25

Putting It All Together: ClearVesta – The Property Management Data Pipeline Demo

Property-management platforms provide rich APIs, but inconsistent schemas make analytics difficult. The real work is standardizing REST, JSON, and CSV feeds into a clean Type-2 dimensional model that supports KPIs and compliance reporting.

Mugnano Data Consulting’s demo pipeline shows how this can be done automatically with minimal custom code.

Greg Spiegelberg 11/28/25 Greg Spiegelberg 11/28/25

Rethinking Disaster Recovery for MPP Databases - Part 1

As MPP PostgreSQL platforms grow, traditional logical backup strategies quietly stop meeting business expectations. Part 1 sets the stage by examining why multi-day backups and restores are no longer acceptable, and why disaster recovery needs to be rethought before scale makes change unavoidable.

Louis Mugnano 11/16/25 Louis Mugnano 11/16/25

Greenplum Architecture Assessment Automation

From script sprawl to structured, drill-down diagnostics.

When I assess customer Greenplum environments, I lean on a suite of lightweight shell utilities that make deep inspection fast and repeatable. At the core is gpview.sh, an interactive catalog viewer. Around it are wrappers for scheduled health checks, single-report runs, and persistence for trending over time.

Louis Mugnano 10/11/25 Louis Mugnano 10/11/25

Kickstarter: Automating Backup, Replicate & Restore in Greenplum

Disaster Recovery (DR) isn’t just “take a backup.” At MPP scale, you need a repeatable end-to-end flow with minimal coordination and downtime. This post walks through the DBA Operations Kickstarter framework that wraps gpbackup, gpbackup_manager, and gprestore into a fully automated DR pipeline

Louis Mugnano 10/2/25 Louis Mugnano 10/2/25

Turn “Ad-Hoc Chaos” into Trusted Self-Service BI

Self-service BI works when the data foundation removes friction for business users. Our reusable dimensional framework gives you trusted definitions, “as-was” and “as-is” choices, foolproof joins, and upstream data quality checks, so teams can explore confidently without hand-coding SQL or exporting to Excel (or god help us all MS Access).

Louis Mugnano 9/26/25 Louis Mugnano 9/26/25

Kickstarter: Automating Partition Maintenance in Greenplum

Partitioning is a cornerstone of scalable analytics in Greenplum Database. The hardest part of partitioning isn’t design, it’s keeping partitions current. The kickstarter Partition Maintenance toolset is a set of Bash/Python utilities that operationalizes this lifecycle for Greenplum. This post will dive into this set of tools.

Toby Keith Mugnano 8/30/25 Toby Keith Mugnano 8/30/25

My Best Friend Lambchop 🐑🐾

This is what happens when you leave your laptop where your dog can reach it, he blogs about his favorite toy and, of course, Greenplum…. enjoy a lighthearted story from Toby!

Louis Mugnano 8/15/25 Louis Mugnano 8/15/25

Push the Logic to the Data: Using PL/Python and GreenplumPython for Scalable In-Database Processing

Demonstrate how to leverage PL/Python and the GreenplumPython library to push data transformation logic directly into the Greenplum Database. This approach aligns with a key best practice in big data architecture: move processing to the data, not the other way around. Achieve a scalable, more secure, more cost effective solution for your Python dev team.

Louis Mugnano 8/8/25 Louis Mugnano 8/8/25

Predicting Customer Churn Using Greenplum and gpmlbot

This blog summarizes a demonstration delivered to a customer interested in understanding how Greenplum can be used for machine learning, without exporting data to external systems. The objective was to showcase how churn modeling can be prototyped and iterated entirely within Greenplum using the gpmlbot utility, which automates feature preparation, model training, and evaluation.

Louis Mugnano 7/6/25 Louis Mugnano 7/6/25

Automating Agile Data Onboarding with Greenplum Sailfish

Louis Mugnano 7/3/25 Louis Mugnano 7/3/25

Accelerate Greenplum Adoption with the GPRA Onboarding Service

We understand the challenges of provisioning and operationalizing a Greenplum or Cloudberry data warehouse environment. Whether you’re deploying on physical infrastructure, virtual platforms, or the cloud, setting up a high-performance MPP database requires precision, expertise, and automation.

Louis Mugnano 7/2/25 Louis Mugnano 7/2/25

Operational Excellence on Day One: Introducing the DBA Operations Kickstarter

The DBA Operations Kickstarter is a comprehensive toolkit designed by Mugnano Data Consulting to automate and operationalize best-practice DBA workflows for Greenplum and Cloudberry environments. Whether you're onboarding a new system or stabilizing an existing one, this solution delivers an enterprise-grade operational foundation in hours, not months.

Louis Mugnano 5/8/25 Louis Mugnano 5/8/25

DbSchema Dimensional Model

Dive deeper into how we leverage the extensibility features of the DbSchema Data Modeling Tool. Specifically, we’ll show how to design a dimensional model and generate DDL that directly integrates with Mugnano Data Consulting’s dimension utility functions

Louis Mugnano 5/7/25 Louis Mugnano 5/7/25

Data Modeling Tool for Greenplum

This post introduces DbSchema, a data modeling tool I found effective for Greenplum. I’ll show how I worked with the vendor to make Greenplum a first-class citizen in the platform, supporting logical and physical design as well as DDL generation.