Blog

Machine Learning and Intelligent Forecasting: Beyond the Black Box

January 6, 2023

The black box. That mysterious widget in which myriad magic tricks reside. For some it is a magical force that delivers exactly what you need, even if you don’t entirely understand how. For others, it is at best untrustworthy and at worst, dangerous. And any solution that offers machine learning, artificial intelligence or predictive analytics is likely to fall into that “black box” arena.

What if we take the lid off the intelligent forecasting black box and investigate its contents? In our last blog post we talked about what intelligent forecasting is and some of the benefits of it over traditional forecasting models. Here, we’ll take a peek at what intelligent forecasting in life sciences looks like. If you’d like to rummage around in the black box a little more, there is much more detail in our white paper: Intelligent Forecasting for Pharmaceutical R&D.

If we break down intelligent forecasting to its component parts, it becomes a transparent methodology that generates accurate and comprehensive outputs that inform strategic decision making.

Near real-time data, coupled with a flexible and dynamic approach to forecasting means the outputs are current and relevant to pharmaceutical companies’ current strategic direction e.g., speciality care, multi-indication products, complex modalities and precision medicine.
Intelligent forecasting consists of a series of linked processes.

Curation of foundational a dataset from high confidence sources: This brings together many elements, including granular historical pipeline coverage, key R&D metrics and historical sales, product and company characteristics, competitive environment, and market news.
Pre-forecast data processing: This includes standardisation and structuring to make data suitable for analysis. Events, or clinical milestones and major news and outcomes are coded, then datasets are extracted to create training datasets.
Development on training and test datasets: A balanced and unbiased subset of the data is used to train the machine learning and form the basis of estimating the predictability of each attribute. The test dataset is used to validate the accuracy of the machine learning model
Defining product and indication attributes: These are characteristics that define the product and diseases intended to treat. These are categorised into four major categories;
1. Product characteristics, e.g. MoA or target, clinical milestones
2. Company characteristics e.g. historical success rates, track record
3. Unmet need within the indication e.g. regulatory designations, success/failure rates
4. Competition within the indication e.g. number of products in development, approval order
Attribute selection based on statistical analysis of predictability and attribution correlations: This predictability of model outputs is based on the training dataset, and correlates to real world events. It defines which attributes should be included in the machine learning model based on their predictability and can be flexed depending on product type and indication
Intelligent forecasting methodology applied to real-time clinical and commercial events to predict product/indication level outputs: The machine learning methodologies feed real-time data to generate product/indication-level probabilities of technical and regulatory success and/or forecasts of commercial opportunity
Forecast outputs are transparently produced and explained, quality assured and tested against industry benchmarks to ensure consistency: An audit of drivers can also be performed where it is possible to identify which attributes are used to predict success rates or commercial opportunities. Customisation is also possible, to generate real time predictions based on scenarios

Our own intelligent forecasting solution, Evaluate Omnium, uses machine learning to analyse historical datasets to identify signals of clinical success for products at all stages of the pipeline. It is capable of delivering commercially valuable insights into sales estimates for development assets often 12 to 15 years in advance of actual peak sales occurring.

That’s a bit of a whistle-stop tour of the black box. There’s loads more information not only about how it works, but also about what is means for your business in the full white paper. And, of course, we’re always happy to share more if you have questions. Just let us know!

Karthik Subramanian

VP, Product Strategy & Management

Related Blogs

Understand the context. Data-driven news and analysis for the pharma, biotech and medtech sectors.

April 24, 2024

Orphan Drug Report 2024: Slowdown a Sign of Success

Hot-off-the-press, Evaluate has just released its latest annual deep dive into the world of orphan drugs and rare diseases. I had the opportunity to preview the content and make the ...

April 9, 2024

US Health Systems: A Driving Force for Digital Health

The US Healthcare system is a vast, unwieldy beast that requires significant navigation by any company in the healthcare space. Whether you’re a Big Pharma, a small biotech or an ...

April 4, 2024

Next Generation Dealmaking with Evaluate and Inpart

Dealmaking is a hallmark of the pharma industry. Whether to access innovative therapies, expand into new areas or fill a gap left from a failure in the clinic, pipelines need ...

March 28, 2024

Meet the Evaluate Team: Markella Kordoyanni

Markella is part of Evaluate’s competitive intelligence (CI) consulting practice, where she works on a wide range of projects to support CI teams in pharma companies to ensure they stay ...

March 27, 2024

Competitive Intelligence Insights: Cell & Gene Therapy

The number of cell and gene therapy (C&GT)-based treatments in development has increased significantly over the last two decades and can be expected to continue, driven by the modality-specific market ...

March 4, 2024

2023 in Digital Health: Four Trends Driving Transformation

2023 is a year that many in the pharma market will be happy to see the back of, and the digital health ecosystem is no exception. Some of this can ...