Our Blogs: A Curated Guide to Our Technical Posts
This curated guide follows the journey from installation and onboarding to automation, data modeling, advanced analytics, and machine learning. Each blog post here is organized to reflect the natural progression of a Greenplum deployment.
Kickstarter: Automating Backup, Replicate & Restore in Greenplum
Disaster Recovery (DR) isn’t just “take a backup.” At MPP scale, you need a repeatable end-to-end flow with minimal coordination and downtime. This post walks through the DBA Operations Kickstarter framework that wraps gpbackup, gpbackup_manager, and gprestore into a fully automated DR pipeline
Turn “Ad-Hoc Chaos” into Trusted Self-Service BI
Self-service BI works when the data foundation removes friction for business users. Our reusable dimensional framework gives you trusted definitions, “as-was” and “as-is” choices, foolproof joins, and upstream data quality checks, so teams can explore confidently without hand-coding SQL or exporting to Excel (or god help us all MS Access).
Kickstarter: Automating Partition Maintenance in Greenplum
Partitioning is a cornerstone of scalable analytics in Greenplum Database. The hardest part of partitioning isn’t design, it’s keeping partitions current. The kickstarter Partition Maintenance toolset is a set of Bash/Python utilities that operationalizes this lifecycle for Greenplum. This post will dive into this set of tools.
My Best Friend Lambchop 🐑🐾
This is what happens when you leave your laptop where your dog can reach it, he blogs about his favorite toy and, of course, Greenplum…. enjoy a lighthearted story from Toby!
Push the Logic to the Data: Using PL/Python and GreenplumPython for Scalable In-Database Processing
Demonstrate how to leverage PL/Python and the GreenplumPython library to push data transformation logic directly into the Greenplum Database. This approach aligns with a key best practice in big data architecture: move processing to the data, not the other way around. Achieve a scalable, more secure, more cost effective solution for your Python dev team.
Predicting Customer Churn Using Greenplum and gpmlbot
This blog summarizes a demonstration delivered to a customer interested in understanding how Greenplum can be used for machine learning, without exporting data to external systems. The objective was to showcase how churn modeling can be prototyped and iterated entirely within Greenplum using the gpmlbot utility, which automates feature preparation, model training, and evaluation.
Automating Agile Data Onboarding with Greenplum Sailfish
Automating Agile Data Onboarding with Greenplum Sailfish
Accelerate Greenplum Adoption with the GPRA Onboarding Service
We understand the challenges of provisioning and operationalizing a Greenplum or Cloudberry data warehouse environment. Whether you’re deploying on physical infrastructure, virtual platforms, or the cloud, setting up a high-performance MPP database requires precision, expertise, and automation.
Operational Excellence on Day One: Introducing the DBA Operations Kickstarter
The DBA Operations Kickstarter is a comprehensive toolkit designed by Mugnano Data Consulting to automate and operationalize best-practice DBA workflows for Greenplum and Cloudberry environments. Whether you're onboarding a new system or stabilizing an existing one, this solution delivers an enterprise-grade operational foundation in hours, not months.

