Our Blogs: A Curated Guide to Our Technical Posts
This curated guide follows the journey from installation and onboarding to automation, data modeling, advanced analytics, and machine learning. Each blog post here is organized to reflect the natural progression of a Greenplum deployment.
My Best Friend Lambchop 🐑🐾
This is what happens when you leave your laptop where your dog can reach it, he blogs about his favorite toy and, of course, Greenplum…. enjoy a lighthearted story from Toby!
Push the Logic to the Data: Using PL/Python and GreenplumPython for Scalable In-Database Processing
Demonstrate how to leverage PL/Python and the GreenplumPython library to push data transformation logic directly into the Greenplum Database. This approach aligns with a key best practice in big data architecture: move processing to the data, not the other way around. Achieve a scalable, more secure, more cost effective solution for your Python dev team.
Predicting Customer Churn Using Greenplum and gpmlbot
This blog summarizes a demonstration delivered to a customer interested in understanding how Greenplum can be used for machine learning, without exporting data to external systems. The objective was to showcase how churn modeling can be prototyped and iterated entirely within Greenplum using the gpmlbot utility, which automates feature preparation, model training, and evaluation.
Automating Agile Data Onboarding with Greenplum Sailfish
Automating Agile Data Onboarding with Greenplum Sailfish
Accelerate Greenplum Adoption with the GPRA Onboarding Service
We understand the challenges of provisioning and operationalizing a Greenplum or Cloudberry data warehouse environment. Whether you’re deploying on physical infrastructure, virtual platforms, or the cloud, setting up a high-performance MPP database requires precision, expertise, and automation.
Operational Excellence on Day One: Introducing the DBA Operations Kickstarter
The DBA Operations Kickstarter is a comprehensive toolkit designed by Mugnano Data Consulting to automate and operationalize best-practice DBA workflows for Greenplum and Cloudberry environments. Whether you're onboarding a new system or stabilizing an existing one, this solution delivers an enterprise-grade operational foundation in hours, not months.
DbSchema Dimensional Model
Dive deeper into how we leverage the extensibility features of the DbSchema Data Modeling Tool. Specifically, we’ll show how to design a dimensional model and generate DDL that directly integrates with Mugnano Data Consulting’s dimension utility functions
Data Modeling Tool for Greenplum
This post introduces DbSchema, a data modeling tool I found effective for Greenplum. I’ll show how I worked with the vendor to make Greenplum a first-class citizen in the platform, supporting logical and physical design as well as DDL generation.