How to choose the right AI model for business forecasting in retail

One of the areas where retailers can effectively apply AI for is in optimizing user behavior forecasting. This, however, raises key questions: Which model should be chosen, and what criteria should guide that choice? When is it worth deploying complex models, and when is traditional ML sufficient?

Let’s examine the commercial use of AI through a case study delivered by the BaOne team. The project involved personalizing product recommendations in the mobile app of a major grocery retailer with a network of over 900 stores.

The primary objective was to increase profit margins without compromising sales volume. A particular focus was also placed driving the growth of strategic product categories, including private labels (PL) and global exporters (GE).
Figure 1: Customer journey through the purchase funnel
In the traditional retail purchase funnel, as the customer moves from Stage 1 (New user) to Stage 4 (Explicit context), the company progressively gains more contextual data about the user. At the same time, the complexity and implementation time of AI models decrease.
  • Stage 1
    The company has no information about the new user—there is no interaction history, behavioral patterns cannot be applied, and context is entirely absent. Cold-start models are the most challenging when it comes to delivering high-conversion recommendations.
  • Stages 2 and 3
    At these stages, the company has access to website behavior and purchase history. This allows for the development of interest-based models, identification of generalized behavioral patterns, cohort analysis, and the delivery of personalized offers and discounts.
  • Stage 4
    The company knows exactly which items are currently in a specific user’s cart. At this point, real-time marketing tools and retargeting (traditional recommendation models) are used. Since 46% of retailers already apply AI in marketing, this stage of the customer journey typically poses no challenges.
In this client case, BaOne experts developed recommendation models to engage users at Stages 2 and 3 of the purchase funnel. When behavioral and historical context is available, two main approaches can be applied:


  • ·Accurately predict the next purchase, effectively forecasting the user’s future behavioral context.
  • ·Expand the customer’s shopping cart size by recommending products based on historical context and predefined business rules.

Which AI model to choose?

To build accurate forecasts based on historical data and behavioral factors, it’s important to use models capable of analyzing complex time-series patterns. Common options include:
  • LSTM

    Classic recurrent neural networks effective at capturing long-term dependencies
  • GRU

    Streamlined version of LSTM with lower computational overhead
  • Transformers
    (attention-based models)
    Cutting-edge but resource-intensive solutions, such as BERT4Rec and SASRec
To predict a customer’s the next purchase, the most frequently applied approaches include sequence modeling, attention mechanisms, and intent modeling.
Examples of models and techniques:
  • Recurrent Neural Networks (RNN / GRU / LSTM)
  • Transformer-based models (e.g., BERT4Rec, SASRec)
  • Seq2Seq with attention

For recommendation systems focused on cart  expansion, approaches often rely on embeddings, multi-task learning, and memory networks.
Examples of models and techniques:
  • User2Vec / Prod2Vec / DeepWalk applied to interest graphs
  • Collaborative filtering

Model selection criteria

When implementing algorithms, BaOne experts following these guiding principles:
  • Proven effectiveness
    Has the model been successfully applied to similar use cases?
  • Rigorous testing
    Have A/B tests been conducted to validate its performance?
  • Alignment with business goals
    If the model underperforms, understanding the underlying reasons is essential.
  • Infrastructure investment
    What are the costs associated with training and maintaining the model?

BaOne’s approach: context expansion over precise prediction

To achieve the retailer’s goals of margin growth, increased turnover, and promotion of PL and GE brands, BaOne experts adopted an approach focused on expanding the customer’s shopping cart by recommending products based on historical context and business rules. This strategy drove quick wins with limited strain on computing resources.

However, as with any AI project, a strong business idea must underpin a successful product recommendation. In this project, BaOne combined the following elements:
  • Recommending items based on similar customers who already purchase target categories in large volumes and drive higher revenue
  • Aligning with the assortment strategy for PL and GE brands without compromising overall turnover
  • Monetizing through retail media
Given that maximizing margin ROI was the primary objective, a proven solution was chosen:
  • Building user embeddings (vector representations of users) to establish context
  • Enhancing data with business analytics and selecting products aligned with user interests
Unlike traditional recommendation systems, the focus was not on precisely predicting the next action, but on expanding the user context. This is particularly important when aiming to increase the average ticket size, where key constraints include:
  • Macroeconomic factors (e.g., purchasing power)
  • Limited product assortment within the specific retail chain
This approach allows for flexible adaptation to shifts in demand and supports the delivery of personalized, commercially effective solutions.
Results

By building on a strong business idea and a hybrid recommendation model, the retailer achieved the following:
  • 2% increase in front margin

    through the promotion of strategic categories (PL and GE)
  • 2.5x growth in add-to-cart conversion rates
    by recommending products based on customers’ behavioral and historical context
  • 10% increase in activity

    across product matrix categories
The project introduced tools to manage personalized offers via tailored communication channels such as email, mobile apps, push notifications, SMS and more. Additional functionality was developed to support flexible segment management and user activation through RFM-based targeting.

Balancing forecast precision and cost in AI model selection

Modern neural network models—such as BERT4Rec, SASRec, or other transformer-based architectures—require significant computing power and complex infrastructure. The decision to adopt them ultimately hinges on striking the right balance between forecast precision, the cost of training and maintenance, and the business impact delivered.

In practice, gains in business performance tend to decrease with each upgrade to more advanced recommendation tools. That’s why the underlying business idea should remain the primary driver in model selection.
  • When are complex models (like BERT4Rec) the right choice?
    • High precision is critical – even a 1–3% improvement in recommendation accuracy can lead to meaningful gains in conversion rates or average ticket.
    • Large-scale data – when working with millions of users and intricate behavioral patterns that simpler models struggle to capture.
    • Long and complex sequences – where historical context matters (e.g., video streaming or e-commerce with extended decision cycles).
    • Willingness to invest in infrastructure – transformer models require GPU/TPU resources and optimized pipelines for training and inference.
  • When to use traditional ML models (LightFM, Matrix Factorization, CatBoost)?
    • Limited resources – no access to powerful servers or large budgets for ML infrastructure.
    • Fast implementation – quick MVP deployment needed to test hypotheses without extensive fine-tuning of complex models.
    • Short and simple sequences – user sessions are brief (e.g., impulse purchases in retail).
    • Interpretability over precision – business requires clear understanding of recommendation logic (e.g., for compliance).

ROI drives model selection

  • If a simpler model (LightFM, CatBoost with features) delivers 80% of BERT4Rec’s performance at just 10% of the cost, it is the smarter investment.
  • If even a modest increase in precision (e.g., +2% conversion) justifies the expense, transformer models are a reasonable choice.
As BaOne’s experience shows, hybrid approaches that combine traditional ML-driven recommendations (cost-effective, fast and stable) and complex scenarios powered by neural networks (e.g., personalization for VIP customers or long sessions) often provide the optimal solution. This approach ensures system scalability without overspending and allows flexible adaptation to evolving business needs.

The best approach depends on data volume and complexity, ML infrastructure budget, precision and speed requirements, as well as the business’s readiness for long-term investment. When unsure, start with simple models, track their performance and progressively enhance the system where it yields the highest ROI.
  • Vitali Baum
    A technology expert with over fifteen years of experience serving international companies.
    Specializes in digital transformation programs across various industries, with a special focus on implementing IT products for retail and e-commerce (both B2B and B2C), oil producers, OFS firms, insurance companies and pensions funds.
    innovations@baone.ae
Discuss your project today

© 2025 BaOne FZCO
Address: Office 304/305, The Offices 2, One Central, Level 3, Dubai, UAE
www.BaOne.ae
Tel.: +971527278388
innovations@baone.ae
This website uses cookies to improve your user experience. If you continue on this website, you will be providing your consent to our use of cookies.
Accept