Horizontal Machine Learning Platforms Need Flexibility and Monitoring

InsightsPortfolio

Mar 23

By David Magerman PhD

TL;DR - Horizontal ML platforms need to be flexible, configurable, and monitorable to be robust and to consistently add value over time. They need to allow data to be weighted flexibly, in user-controlled ways. They need to have data visualization tools to detect outliers and contributors to noise. And they need automated model parameters and data drift monitors to alert users to changes in the raw input to the models and/or to the nature of the computed and deployed models.

In my last blog entry, I explored the risks associated with the proliferation of low-code, no-code horizontal machine learning (ML) platforms. Human intelligence is still intrinsic to the development of robust and consistently-performing ML-based artificial intelligence (AI) systems. Trying to replace human data scientists with automated systems operated by domain experts is a hit-or-miss proposition which could lead to disaster if applied to mission-critical decision-making systems.

Nonetheless, there is clearly enormous benefit to applying ML-based AI systems to a broad set of problems across many industries, and these horizontal platforms might be useful tools in exploring and activating those benefits. So, if one is building a horizontal platform for deploying ML-based AI systems, what are the important features and functionalities to implement in them to avoid the pitfalls and experience the benefits? To answer this question, it helps to understand how human intelligence informs the data science exploration process, and then one must figure out how to enable that human intervention efficiently, while allowing the automated machine learning to proceed around it.

Human beings can understand data in ways that automated systems still struggle with. They can differentiate between data errors and just unusual data (e.g. GME trading in February). They can align unusual data patterns with real-world events (e.g. 9/11, COVID, financial crises, elections). They also understand the impact of calendar events (e.g. holidays). Depending on the data used in ML algorithms and the data being predicted, the semantics of the data might be meaningful and might be hard for automated learning algorithms to discover. And there is no reason to force them to discover these hidden relationships in the data if they aren’t hidden to the human operator.

There is also the question of how to weight the data, both in time and across data points. For instance, in modeling stock price movements, it is useful to look at a broad universe of stocks when estimating models for a particular stock’s price movement. However, for a given stock, one might want to weight the contribution of other stocks’ data differently. For instance, if you are trying to model MSFT’s price movements, you might want to look at data from other NASDAQ stocks, or other technology stocks, or other high-cap stocks. Depending on what aspect of MSFT’s price movements you are interested in studying (e.g. short-term movements, overall volatility, or movement relative to an index), you might want to weight data from other stocks differently.

If you have years of data, you might want to modulate the half-life of the data (i.e. how you downweight historical data relative to recent data). If you are predicting short-term price movements, let’s say for high-frequency trading, you might concentrate your models on very recent data, say the last few months or last year or two. If you are predicting long-term price movements, you might consider decades of data, but you might downweight historical data so more recent data would have more impact on your models than data from the distant past.

Data scientists also benefit from visualizing data, a very manual process, more of an art than a science. Plotting raw data, correlations between data and quantities being predicted, and time-series of coefficients resulting from estimations across time can yield observations that can be fed back into the model construction process. You might notice a periodicity to data, perhaps a day-of-week effect, or anomalous behavior around holidays. You might detect extreme moves in coefficients that suggest outlier data is not being handled well by your learning algorithms. You might notice different behavior across subsets of your data, suggesting that you might separate out subsets of your data to generate more refined models. Again, self-organizing learning algorithms can be used to try to discover some of these hidden patterns in the data. But a human being might be better equipped to find these patterns, and then feed insights from them back into the model construction process.

Finally, an important role data scientists play in the deployment of ML-based AI systems is model monitoring. Depending on the kind of model being used, what it is predicting, and how those predictions are being used in production, different aspects of the model need to be monitored so that deviations in behavior are tracked and problems can be anticipated before they lead to degradation in real-world performance. If models are being retrained on a regular basis using more recent data, it is important to track the consistency of the new data entering the training process with the data previously used. If production tools are being updated with new models trained on more recent data, it is important to verify that the new models are as similar to old models as one might expect, where expectation is model- and task-dependent.

As with most sciences brought to industry, data science is much more complex in the real world than it is in the laboratory. Data scientists are engineers and artists more so than practitioners of a rote mathematical process. One can automate their behavior to a degree, and in controlled environments, you can replicate the power and performance of their work with low-code and no-code auto-ML platforms.

But you ignore the nuance of the work of a data scientist at your own peril. If you aren’t going to employ human data scientists at scale in production deployments of ML-based AI systems, and if you are instead going to rely on a horizontal ML platform, you need to make sure you understand these nuances and have a platform that allows you to perform basic functionalities: flexibly weighting data, creatively visualizing data and model statistics in configurable ways, and monitoring training data, model coefficients, and production use of models in ways tailored to the deployment. And if you are building a horizontal ML platform, the more you incorporate these design details in your system, the more likely your customers will get long-term value out of your product.

More News & Insights

News & Insights

Oct 23, 2025

PostSig named Hot Vendor by Aragon Research Report 2025

Oct 23, 2025

Jun 17, 2025

Personal.ai launches “No LLM”

Jun 17, 2025

Jun 9, 2025

Row64 announces $4M Seed Round

Jun 9, 2025

Jun 2, 2025

IBM Acquires Seek AI

Jun 2, 2025

May 15, 2025

Ocrolus CEO Sam Bobley named Top 25 Fintech AI Executive

May 15, 2025

Apr 30, 2025

Personal AI announces collaboration with NVIDIA

Apr 30, 2025

Apr 24, 2025

STMicroelectronics Acquires Deeplite

Apr 24, 2025

Mar 6, 2025

On the Record with Lizzy Kolar, Scope Zero Co-Founder & CEO

Mar 6, 2025

Differential goes on the record with Lizzy Kolar, the co-founder and CEO of Scope Zero. Scope Zero's mission is to reduce annual utility bills and fuel expenses by $300 billion, the environmental equivalent of removing 125M cars from the road.

Mar 6, 2025

Feb 11, 2025

DataRobot Acquires Agnostiq

Feb 11, 2025

AttackIQ Acquires DeepSurface

Feb 11, 2025

Aug 27, 2024

On the Record with Moshe Hecht Hatch.AI Founder & CEO

Aug 27, 2024

Differential goes on the record with Moshe Hecht, an award-winning philanthropic futurist and innovator, reshaping the world of giving through technology and data solutions. The founder and CEO of Hatch, he is a dedicated philanthropist and has been published in Forbes, Guidestar, and Nonprofit Pro.

Aug 27, 2024

Aug 18, 2024

Driving Sustainability through Employee Benefits with Scope Zero CEO Lizzy Kolar

Aug 18, 2024

The WorkplaceTech Spotlight host Hadeel Al-Tashi sits down with Lizzy Kolar, Co-Founder and CEO of Scope Zero to dive into how Scope Zero's Carbon Savings Account (CSA) empowers employees to make affordable home technology and transportation upgrades while aligning with corporate sustainability goals. They discuss how the CSA not only supports environmental and financial wellness for employees but also strengthens a company's commitment to sustainability. Don't miss this opportunity to learn how integrating green benefits can drive meaningful impact within your organization.

Aug 18, 2024

Aug 6, 2024

Hatch. AI Closes a $3 Million Seed Round

Aug 6, 2024

Hatch AI, a groundbreaking intelligence platform for nonprofits, announced a $3 million raise in seed funding, led by Differential. Read the full press announcement at the link below.

Aug 6, 2024

Mar 4, 2024

Pienso: Putting AI into the hands of people with problems to solve

Mar 4, 2024

MIT News: Alumni-founded Pienso has developed a user-friendly AI builder so domain experts can build solutions without writing any code.

Mar 4, 2024

Feb 26, 2024

On the Record with Nate Cavanaugh, CoFounder & Co-CEO of FlowFi

Feb 26, 2024

On the Record with Nate Cavanaugh, CoFounder & Co-CEO of FlowFi.

In 2021, Nate co-founded of FlowFi, a SaaS-enabled marketplace that connects startups and SMBs with finance experts. FlowFi has raised $10M from top VC firms including Blumberg Capital, Differential Ventures, Clocktower Ventures and Precursor Ventures, and generated 7-figures of annual recurring revenue in its first year.

Nate was nominated to the Forbes 30 Under 30 list for Enterprise Technology.

Feb 26, 2024

Feb 13, 2024

FlowFi Closes on $9M in Seed Funding

Feb 13, 2024

TECHCRUNCH: FlowFi, a startup creating a marketplace of finance experts for entrepreneurs, closed on $9 million in seed funding.

Blumberg Capital led the investment and was joined by a group of investors including Parade Ventures, Differential Ventures, Precursor Ventures, Special Ventures, 14 Peaks Capital and Cooley LLP.

Feb 13, 2024

Dec 13, 2023

Cyolo’s Almog Apirion on Nasdaq TradeTalks

Dec 13, 2023

NASDAQ: Nasdaq TradeTalks: 2024 Cybersecurity Budget Outlook with Almog Apirion, Cyolo.

Dec 13, 2023

Nov 30, 2023

Retrocausal Raises $5.3M in Financing

Nov 30, 2023

FINSMES: Retrocausal, a Seattle, WA-based platform provider for manufacturing process management, raised $5.3M in funding.

The round was led by Glasswing Ventures, One Way Ventures, and Indicator Ventures, with participation from existing investors Argon Ventures, Differential Ventures, Ascend Vietnam Ventures, Incubate Fund US, SaaS Ventures, Hypertherm Ventures, Stage Venture Partners, and Techstars.

Nov 30, 2023

Sep 19, 2023

Nick Adams Discusses How To Get Your Generative AI Startup Funded

Sep 19, 2023

AI and the Future of Work Podcast: Entrepreneurs wonder what it’s like to be a VC. And VCs without an operating background often don’t understand the grit required to turn an idea into a successful business. The best investors have been successful operators first.

Today’s guest is one of those. Nick Adams founded Differential Ventures in 2017 to invest in B2B, data-first seed-stage companies. Since then, Nick and the team have invested in an impressive group of companies including Private AI, Ocrolus, and Agnostiq.

Sep 19, 2023

Aug 8, 2023

On the Record with Elissa Ross, CoFounder & CEO of Metafold

Aug 8, 2023

On the Record with Elissa Ross, CoFounder & CEO of Metafold. Elissa Ross is a mathematician and the CEO of Toronto-based startup Metafold 3D. Metafold makes an engineering design platform for additive manufacturing, with an emphasis on supporting engineers using metamaterials, lattices and microstructures at industrial scales. Elissa holds a PhD in discrete geometry (2011), and worked as an industrial geometry consultant for the 8 years prior to cofounding Metafold. Metafold is the result of observations made in the consulting context about the challenges and opportunities of 3D printing.

Aug 8, 2023

Jul 26, 2023

Nick Adams: What Regulations Need to Be Put in Place to Ensure the Safe Use of AI in the U.S.?

Jul 26, 2023

Nick Adams on PM360: To get a better grasp on what eventual AI regulations could and should look like, PM360 spoke with Nick Adams, Founding Partner at Differential Ventures. In addition to starting the venture capital firm focused on AI/machine learning in 2018, Adams is also a member of the cybersecurity and national security subcommittee for the National Venture Capital Association and recently briefed members of Congress on AI policy and potential regulation.

Jul 26, 2023

Jul 18, 2023

Metafold 3D Closes $2.35 Million CAD To Fuel Industrial Adoption of 3D Printing

Jul 18, 2023

BETAKIT: Metafold 3D, which wants to make it easier for manufacturers to design and 3D print complex parts, has secured $2.35 million CAD ($1.78 million USD) in seed funding.

Toronto-based Metafold was founded in 2020 by a group of math, geometry, and architecture experts in CEO Elissa Ross, CTO Daniel Hambleton, and COO Tom Reslinski. Born out of Hambleton’s geometry-focused consulting agency, Mesh Consultants, Metafold sells design for additive-manufacturing software to sportswear and biopharmaceutical companies.

Jul 18, 2023

Jul 12, 2023

Nick Adams: Where’s AI headed in the workplace? VCs weigh in

Jul 12, 2023

Nick Adams on TECHBREW: For all the pixels spilled about the promises of generative AI, it’s starting to feel like we’re telling the same story over and over again. AI is serviceable at document summarization and shows promise in customer service applications. But it generates fictions (the industry prefers the euphemistic and anthropomorphizing term “hallucinates”) and is limited by the data on which it’s trained.

Jul 12, 2023

Jul 10, 2023

Mona Introduces Free, Self-Service Monitoring for GPT Applications

Jul 10, 2023

ATLANTA and TEL AVIV, Israel, June 29, 2023 /PRNewswire/ -- Mona, the leading intelligent monitoring platform, unveils a new monitoring solution for GPT-based applications. The free, self-service offering provides businesses with granular visibility into GPT-based products and valuable insights into costs, performance, and quality.

Jul 10, 2023

Jun 21, 2023

David Magerman: Artificial Intelligence’s Glass Ceiling

Jun 21, 2023

David Magerman on THEINFORMATION: OpenAI’s stated goal is to develop and promote a software system capable of artificial general intelligence. Toward that end, the company has released systems based on large-language models, which can respond to prompts with fluent conversation on many subjects. ChatGPT, Microsoft’s Bing chatbot and other new systems based on OpenAI’s GPT-3 and GPT-4 models are truly incredible and perform far beyond previous attempts at achieving AGI.

Jun 21, 2023

Jun 16, 2023

Morgan Stanley at Work Launches Carver Edison’s Cashless Participation®

Jun 16, 2023

BUSINESSWIRE: Morgan Stanley at Work and Carver Edison, a financial technology company, announced today that Shareworks has joined Equity Edge Online® in offering Cashless Participation® to U.S.-based corporate clients. Since the initial launch of Cashless Participation® on Equity Edge Online®, stock plan participants have purchased more than one million shares1 with Cashless Participation®. Now that Shareworks has also launched the tool, a wider cohort of Morgan Stanley at Work corporate clients will have access.

Jun 16, 2023

Jun 9, 2023

Nick Adams on Fox5: Artificial Intelligence Pros and Cons

Jun 9, 2023

FOX5 WASHINGTON DC: Nick Adams discusses the pros and cons of Artificial intelligence.

Jun 9, 2023

May 15, 2023

Differential Ventures Specializes In Being Advisors For AI Companies

May 15, 2023

PULSE 2.0: Differential Ventures is a seed-stage venture capital fund that was founded by data scientists and entrepreneurs for data-focused entrepreneurs. To learn more about the firm, Pulse 2.0 interviewed Differential Ventures’ managing partner and co-founder Nick Adams.

May 15, 2023

May 4, 2023

Golioth Secures $4.6M Seed Funding to Accelerate Time-to-Market for IoT

May 4, 2023

IoTForAll: Golioth, a leading developer platform for the Industrial Internet of Things (IIoT), announced open access to a library of new reference designs for embedded engineers to accelerate their time to market, the launch of a Select Partner Program for energy and construction developers, and the completion of a $4.6M round of seed funding led by Blackhorn Ventures and Differential Ventures with participation from existing investors, Zetta Venture Partners, MongoDB Ventures and Lorimer Ventures.

May 4, 2023

May 1, 2023

PrivateAI’s PrivateGPT aims to combat ChatGPT privacy concerns

May 1, 2023

VENTURE BEAT: Data privacy provider Private AI, announced the launch of PrivateGPT, a “privacy layer” for large language models (LLMs) such as OpenAI’s ChatGPT. The new tool is designed to automatically redact sensitive information and personally identifiable information (PII) from user prompts.

May 1, 2023

AllDifferential Ventures

Guest User

Horizontal Machine Learning Platforms Need Flexibility and Monitoring

More News & Insights

Contact Us

Learn more

Horizontal Machine Learning Platforms Need Flexibility and Monitoring

More News & Insights

On the Record with Christina Qi, CEO of Databento

Yotam Oren, CEO at Mona Labs, on the Future of AI Production

Contact Us

Learn more

Sign Up