Difference between revisions of "Journal:Can a byte improve our bite? An analysis of digital twins in the food industry"

From LIMSWiki
Jump to navigationJump to search
(Saving and adding more.)
(Saving and adding more.)
Line 352: Line 352:


==Discussion==
==Discussion==
The survey results revealed large differences in the use of digital twins depending on the stages of the FSC. First, the major proportion of digital twins are applied in production and processing. Further, nearly all applications are used for monitoring, and many applications predict future states of their physical objects. However, only a few digital twins recommend actions or control systems fully autonomously, i.e., refer to prescriptive or autonomous digital twins, respectively. In addition, key elements to implement digital twins were investigated. To better understand the reasons, we primarily discuss the potentials of digital twins in the food industry, as well as the challenges in implementing digital twins. We close this discussion with threats to validity.
===Potentials of digital twins in the food industry===
As shown in the previous sections, we identified in our literature review several potential ways to optimize the FSC with digital twins. From there, we have to address the fifth research question '''RQ5''' — What is the potential of digital twins in the food industry?
In general, digital twins enable data accessibility and advanced analytics in real-time to assist in more informed, efficient, and faster decision-making. [89] Sensor data are fed into a digital twin that runs food process models (i) for providing relevant product process information and operation outputs in real-time for process control, troubleshooting, and supply chain management, as well as (ii) to optimize processes for uniformity, performance, and sustainability, or to develop new designs. [4,90] Furthermore, this results in better risk assessment and mitigation strategies based on "what if" analyses and simulations. [89]
====Improved use of sensor data====
Current approaches in Industry 4.0 focus on the intelligent collection of data with IoT technology and its analysis with ML algorithms. [91] This includes a variety of data sources, including raw material data, machine data, and customer data. Digital twins enable deeper insights due to the use of multi-sensor networks (sensor fusion), where different sensors measure several parameters from different locations. [52,60] As stated before, sensors are required to provide data (e.g., environmental, process, machine, etc.) for the digital twins. With the development of smart sensors, monitoring the states during processes gets easier and faster. [4] Further, sensors become cheaper, need less power, and transfer the data wirelessly, which enables their use in more applications, even in mobile settings.
====Combination with intelligent packaging====
Intelligent packaging can directly share the quality and current condition of a food product on the packaging during the distribution, retail, and consumption stages. [92] Intelligent packaging consists of intelligent materials or objects, which are defined by their behavior of monitoring “the condition of packaged food or the environment surrounding the food.” [93] Sensors are integrated into the packaging [92] to monitor, e.g., the temperature, the pH value, the humidity, the pressure on the food, or vibrations during transportation. [94] Further, gas sensors can measure the concentration of carbon dioxide (CO<sub>2</sub>) or hydrosulfuric acid (H<sub>2</sub>S) to allow concluding the current condition of the food. [92] An example of how to produce near zero-cost gas sensors is given by Barandun ''et al.'' [95] Biosensors are able to detect pathogens or toxins in bacteria-contaminated foods. [96]
====Combination with AI and ML====
Likewise, integrating [[Nuclear magnetic resonance spectroscopy|nuclear magnetic resonance]] (NMR) and other spectroscopy methods, as well as imaging techniques [4] in conjunction with AI and, especially, ML, enables machine or computer vision. Such algorithms can analyze the food and are able to determine its composition, condition, and quality issues as spoilage, contaminants, or defects. [97,98] Furthermore, by placing virtual sensors on the digital twin model, sensor data from locations that would usually not be accessible to sensors can be generated. [8] Virtual sensor data are software-based outputs of fused data from physical sensors. [99] The application of physical sensors is limited by noise, interference, or unfeasibility due to spatial conditions [99] or locations that are difficult to access. [8] Virtual sensors provide data measurements of parameters or locations, which are physically not measurable. [99] This application enables the detailed prediction of food losses and the remaining shelf life of the food products. [60]
====Optimizing production planning====
Further, production planning can be optimized with ML in this context. [100] The industry demands the possibility to adapt to current market demands as unique nutritional standards and packaging sizes and, therefore, require a higher production flexibility. [6] This means not only being able to produce a wide range of products, but also counting with the capacity to reschedule the production dynamically. [81] The analysis and prediction of SC disruptions can be used to assist this [5,101], although the mentioned references focus on more economic aspects of these disruptions. Proactive adaptation improves system performance, as it forecasts adaptation concerns (e.g., through identification of patterns in historical data) and reacts either by preparing an adaptation or adapting. [102] Autonomous systems can respond to changes in the state during ongoing operation, while digital twins can integrate a variety of data like environment data, operational data, and process data. [26,103] This also includes supplying different stakeholders in the FSC with actionable real-time data, such as the remaining shelf life for each shipment (based on the product’s physical, biochemical, microbiological, or physiological states), on which logistics decisions and marketing strategies can be adjusted. [8,89]





Revision as of 22:17, 21 November 2022

Full article title Can a byte improve our bite? An analysis of digital twins in the food industry
Journal Sensors
Author(s) Henrichs, Elia; Noack, Tanja; Piedrahita, Ana María Pinzon; Salem, María Alejandra; Stolz, Johnathan; Krupitzer, Christian
Author affiliation(s) University of Hohenheim
Primary contact christian dot krupitzer at uni-hohenheim dot de
Editors Tiwari, Ashutosh
Year published 2022
Volume and issue 22(1)
Article # 115
DOI 10.3390/s22010115
ISSN 1424-8220
Distribution license Creative Commons Attribution 4.0 International
Website https://www.mdpi.com/1424-8220/22/1/115/htm
Download https://www.mdpi.com/1424-8220/22/1/115/pdf (PDF)

Abstract

The food industry faces many challenges, including the need to feed a growing population, manage food loss and waste, and improve inefficient production systems. To cope with those challenges, digital twins—digital representations of physical entities created by integrating real-time and real-world data—seem to be a promising approach. This paper aims to provide an overview of digital twin applications in the food industry and analyze their challenges and potentials. First, a literature review is executed to examine digital twin applications in the food supply chain. The applications found are classified according to a taxonomy, and key elements to implement digital twins are identified. Further, the challenges and potentials of digital twin applications in the food industry are discussed. This survey reveals that application of digital twins mainly target the production (i.e., agriculture) or food processing stages. Nearly all applications are used for monitoring and many for prediction. However, relatively few focus on the integration of digital twins in systems for developing autonomous control or providing recommendations to humans. The main challenges of implementing digital twins are combining multidisciplinary knowledge and providing enough data. Nevertheless, digital twins provide huge potentials, e.g., in determining food quality, ensuring traceability, or designing personalized foods.

Keywords: digital twins, food industry, food supply chain, cyber–physical systems, sensors, internet of things, survey

Introduction

With the evolution and digitalization towards Industry 4.0, the concept of creating digital copies of physical systems received entry to the industry. [1] In particular, the food industry is of special interest because it requires highly efficient use of available resources. [2] Over time, food production systems have evolved alongside technological innovations, allowing for increased production, greater product variety, more resilient food stocks, and increased international trade. Yet, despite these advances, food systems around the world continue to face unprecedented challenges that pose significant threats to current food systems, such as climate change, pressure to feed a growing global population, and persistent global food waste. In addition, there are growing societal demands for greater food provenance, traceability, and sustainability within the food system. [3]

A key element of Industry 4.0 is the digital twin, a virtual model of a product, process, or system created with data collected by sensors that enables simulations or real-time analyses of that product, process, or system. [1,4] The use of digital twins seems beneficial in food processing for various reasons. The COVID-19 pandemic demonstrated the vulnerability of food supply resilience. [5] To ensure the consistency of a food supply, production processes must allow high flexibility and adaptivity. [6] Furthermore, product quality is influenced by different quality levels of input materials. Especially in the case of seasonal fluctuations impacting raw material quality, the ability to adjustment parameters in the production process is essential. Introduction processes of new products could be simplified by a digital twin of already existing physical processes. The digital twin is able to learn the correct process parameters for production and is used as the knowledge base within a self-adaptive software system. [7] However, a digital twin of food production has additional specific requirements compared to digital twins of the production of material goods. [8] Due to the variability of raw materials, these cannot be based only on the processing steps but must also take into account the chemical, physical, and (micro)biological properties of the food. However, when functioning as intended, the technology can be applied to create a detailed digital model of the supply chain that integrates real-time and real-world data to respond to unexpected events and uncertainty within the supply chain.

This work aims to provide an overview of digital twin applications in the food industry and analyze their challenges and potentials. We first present a taxonomy to differentiate underlying technologies and better understand the intended use of each digital twin. Second, a survey is executed to examine digital twin applications in the food supply chain (FSC). We target the FSC as it provides a link between all the key activities and processes involved in bringing a specific food product to market. [9] To meet these unprecedented challenges, FSCs and corresponding actors are turning to modern technology for assistance. [10] We classify the found applications of digital twins according to our taxonomy. Third, we investigate the key elements to implement digital twins in the FSC. Fourth, since the concept of digital twins is still young, we discuss the potentials of applying them in the food sector. Finally, we discuss the challenges of applying digital twins in the food industry.

In summary, this paper contributes to the body of research by providing the following scopes:

  • Classification of digital twins in the food sector,
  • Overview of the application of digital twins in the food industry,
  • Definition of the key elements for implementing a digital twin,
  • Analysis of the potential of digital twins in the food industry, and
  • Description of challenges of applying digital twins in the food industry.

The remainder of the paper is structured as follows. The next section explains several fundamentals related to the FSC and the digitalization of the food industry, and it provides a definition of digital twins. Then, the methodological approach for the literature review is presented. The subsequent section evaluates the literature review results and summarizes the key elements for implementing digital twins. Then we discuss the potentials and challenges of digital twins and their implementation in the food supply chain, and then we discriminate this work against other publications in the field. We conclude this paper with a summary of our results.

Background

The food supply chain

A supply chain (SC) is a network of actors structured around activities and processes that aim to satisfy given consumer demand by bringing products or services to market. [11] This network includes feedback and circular economy aspects, e.g., for sustainability reasons such as with the recycling of materials. [12] The actors within the SC are linked through upstream or downstream processes and activities that produce value in the form of finished products or services. [11] In the same sense, a FSC encompasses all activities involved in the creation and transformation of raw materials into food products, as well as their retail and consumption. [10] However, FSCs do differ significantly from other SCs due to the complexity of producing, transporting, and managing food products. [13]

While it is important to consider not only the primary flow but also the tangential and secondary flows that are contained within the FSC—as these are opportunities to reduce food loss and waste through reuse and recycling [12]—we focus on a simplified, linear, and straight-forward structure of the FSC. This is sufficient for this survey since the focus is on single activities of the FSC that are present identically in the simplified FSC as well as in a circular view. Figure 1 provides an overview of the FSC and the main actors, to which the digital twin applications will be assigned. Commonly, the FSC begins with production (which usually involves an agricultural farm) and continues with supply, processing, distribution, and retail, finally ending with consumption.


Fig1 Henrichs Sensors22 22-1.png

Figure 1. A simplified structure of the food supply chain (based on the work of Kamilaris et al.[10]), including the actors used to classify the digital twin applications within the scope of this work. The structure does not show any circular flows or side chains of by-, side-, or co-products, which would result in a value network rather than a straight-forward chain.

Worth mentioning is that the stages could be thereby divided into several processing or transportation sub-entities. For instance, Shoji et al. [14] investigate the cold chain of fruits and vegetables from a (farm to) packhouse to distribution to the retailer. The authors divide the transportation steps between the supply from packhouse to distribution center and from distribution center to the retailer. In our understanding, the packhouse would be part of the processing stage, and both the transportation and the distribution would be summarized as distribution.

It is crucial for SCs to be designed with consideration for uncertainty and risk, as mitigation measures and solutions must be developed to prevent disruptions to the SC. [9] Those disruptions impact the SC’s regular flow and affect the other actors directly. [15] In particular, the most frequent FSC disruptions are human error, communication misunderstandings, organizational process errors, and quality problems with goods received. [16] Consequently, disruptions may result in negative effects to the final product [9] regarding sustainability, safety, and quality. [13]

Additionally, several challenges in the FSC occur during different stages. This includes [17]:

  • Production estimation and optimization in the production stage, including crop management and security, as well as livestock control;
  • Production planning in the processing stage, regarding the post-harvest loss, as well as demand prediction;
  • Distribution management concerning route planning, prediction of SC risks and disruptions, and shelf life prediction; and
  • Consumption, which represents consumer behavior, their dietary behavior, food loss and waste, and the prediction of daily demand.

Industry 4.0 and related concepts

The concept of Industry 4.0 is associated with the fourth industrial revolution. It combines technologies such as cyber–physical systems (CPSs), internet of things (IoT), and cloud computing. While the term "Industry 4.0" is primarily used in Europe, a similar concept—the industrial internet of things (IIoT)—mainly used in the US describes advances in big data, cloud computing, and networking of machinery and systems in the industrial sector. [18] Based on CPSs and IoT, manufacturing processes including logistics (i.e., SC management), services, and maintenance are efficiently synchronized within in Industry 4.0. [19] Hence, Industry 4.0 does not focus on a single process or technology but integrates all processes, resulting in a highly flexible and integrated, optimized manufacturing process. The complete implementation of Industry 4.0 or IIoT culminates in the smart factory, an integrated production process that is entirely self-organizing by the connected machines and intelligent software without any human interaction. [20]

Further, modern FSCs increasingly make use of integrated information and communication technology (ICT) systems to assist with mitigating against uncertainty and risk, process optimization, and numerous other applications. [11] In addition, ICT systems are of particular interest for traceability and decision-making functions within the FSC. [21] Traceability is important to identify quality and safety concerns and to ensure food provenance to the consumer and authorities. [13] As described by Zhong et al. [9], traceability systems in FSCs vary greatly depending on region, government regulations, and digitalization of the FSC. ICT systems as digital twins are able to assist with decision-making, collaboration, scheduling and planning, logistics management, and warehouse management within the FSC. [22]

A definition of "digital twin"

The concept of digital twins first came up during NASA’s Apollo 13 mission in 1970, as the ground team used simulators to provide solutions to the spacecraft crew for landing them safely. [23] The term “digital twin” was first used and defined in 2003. [24] This concept contains a physical object and a virtual object, connecting data and information from both. NASA formalized the description of digital twins in 2012 and forecasted its potential in the aerospace sector. [25] Here, the digital twin is defined as a multi-physics and multi-scale simulation of a vehicle or system, which uses the best physical models, sensor data, and history, resulting in a mirror of the physical counterpart. The discovery that digital twins might be used in a variety of industries other than aerospace accelerated its development [26]; in particular, it is an important concept for Industry 4.0 and IIoT.

For purposes of this work, we follow the definition of a digital twin provided by the CIRP Encyclopedia of Production Engineering [27]:

A digital twin is a digital representation of an active unique product (real device, object, machine, service, or intangible asset) or unique product-service system (a system consisting of a product and a related service) that comprises its selected characteristics, properties, conditions, and behaviors by means of models, information, and data within a single or even across multiple life cycle phases.

As such, a digital twin virtually represents its real-world counterpart, containing all its essential properties. [8] It is based on real-world comprehensive data measurements, which form the digital profile of the physical object or process. Consequently, a digital twin is connected to the real-world object or system through a continuously updated data flow. [28] Further, the digital twin is able to simulate the relevant processes and kinetics accurately. [8] In this sense, a digital twin may be seen as an ever-evolving digital profile of the past, current, and even future behavior of a process or a physical object and allow the prediction of uncertainty in the process steps. [28]

Defraeye et al. [8] define three common principles to a digital twin. Firstly, it must contain all the necessary components and material properties of what it is representing. Secondly, it can reliably and accurately simulate all relevant processes through the product life cycle. Finally, the digital twin should be connected with its real-world counterpart, as this differs a digital twin from simpler models. Communication is preferred to be realized in real-time, but the data could also flow offline.

These principles are in accordance with Jones et al. [29], who define twinning as the synchronization of the states of the physical and virtual entities. Additionally, the virtual model consists of high fidelity. Bottani et al. [30] expand upon this, explaining that a digital twin is more than the representation of the physical counterpart, since the goal of a digital twin is to replicate all behaviors and relationships of a system and its environment.

Further, five technological components enable digital twins [28]: sensors, integration capabilities, real-world aggregated data, analytical techniques, and actuators. Those technologies are required to aggregate the different available data sources (mainly related to the product and the process) into one comprehensible model of the digital twin, as well as support the prediction or analysis with a digital twin. Figure 2 presents the diversity of potential data resources for a food-based digital twin.


Fig2 Henrichs Sensors22 22-1.png

Figure 2. A food-based digital twin basic framework, within the context of dairy processing.

A concept closely related to digital twins are CPSs. [31,32] While a digital twin is a digital copy of a product or physical system with the intention of performing real-time optimization, a CPS merges computational and physical processes to seamlessly support humans with intelligence when using machines. [33] CPSs often include digital twins as a base for their decision-making processes. [34,35,36]

Methodology

The methodology for the survey integrates methods from the guidelines of Webster and Watson [37] for a structured literature review, and Petersen et al. [38] for systematic mapping studies. The research is based on the steps shown in Figure 3. In the beginning, we framed our aim in the form of research questions. We defined exclusion and inclusion criteria and performed keyword-based searches for filtering the articles based on their titles and abstracts. The search method was adapted from Webster and Watson [37] to cover a wide range of publications with regards to regions, fields, and publishers. After identifying the set of possible relevant publications, a relevance analysis based on a full paper screening was performed. Subsequently, descriptions and properties of the digital twin applications, as well as bibliography data, have been extracted and classified as proposed by Petersen et al. [38] In the following, we describe these steps in detail.


Fig3 Henrichs Sensors22 22-1.png

Figure 3. Overview of the methodology for the classification in this survey. Publications found through a keyword-based search were first selected based on the title and abstract. Afterward, the publications were analyzed, and relevant publications were categorized in a previously defined taxonomy.

Definition of research questions

The primary aim of this work is (i) to provide an overview of digital twins applied in the food sector regarding their intended use and (ii) to identify future research areas. According to this goal, we derived our research questions. First, we searched for taxonomies (RQ1) that enable classifying the digital twin concepts that we found according to their application purpose. As we are primarily interested in which area of the food industry (i.e., stages of the FSC) digital twins are applied, we decided to assign the applications found to the stages of the FSC as presented in Figure 1 (RQ2). To better understand the application’s reason for use, we classified the applications found according to our taxonomy (see RQ1) to answer the research question of how a digital twin can support the activities in the FSC (RQ3). Further, we analyzed the different types of digital twins that we identified for providing an overview of the different key elements of a digital twin in the FSC (RQ4). Aiming to show the applicability and the benefits of implementing digital twins in different stages of the FSC, we conducted the final research questions. At first, we discussed the potential of digital twins to improve the food industry (RQ5). Then, we discussed the challenges of implementing digital twins in the food sector (RQ6). These considerations led to the following research questions:

  • RQ1 — How can digital twins be classified?
  • RQ2 — In which areas of the food industry are digital twins applied?
  • RQ3 — Which types of digital twins are applied in the food industry?
  • RQ4 — What are the key elements in implementing a digital twin?
  • RQ5 — What is the potential of digital twins in the food industry?
  • RQ6 — What are the challenges in applying digital twins in the food industry?

Selection method

To find digital twin applications in the FSC (answering RQ2 and RQ3), we conducted a literature review and included publications available between May and the end of September 2021. We searched the following databases: Google Scholar, Scopus, ScienceDirect, and Academic Search Complete by EBSCO Publishing. We created two groups of keywords. The first group concerned digital twins, including the keywords “digital twin,” “digital twin application,” and “cyber–physical systems,” while the second group provided the relation to the food sector, i.e., consisted of the keywords “food,” “food supply chain,” “food production,” “food industry,” and “food sector.” The search was performed by combining each of the keywords of both groups.

While we see agricultural plants or farms as part of the FSC (production stage), we did not search directly for “digital twin” AND (“agriculture” OR “agrifood”) since our main focus is on food quality as it relates to food processing. Therefore, the food processing and the transportation stages (supply and distribution), as well as the retail stage, are from special interest rather than the primary food production since the food quality parameters are ultimately adjusted during processing. After the processing stage, all actions, e.g., cooling, serve to maintain and guarantee food quality until consumption. Still, we did not discard works related to the agriculture sector when found with our set of keywords.

Additionally, we added publications to our list—which we did not find directly but rather were referred to by other publications and possibly relevant—for this research (backward search). In the literature search process, we also identified applicable literature reviews. [29,39,40,41] However, as we wanted to avoid the misinterpretation or incorrect reproduction of information, we rather included the original publications or sources of such reviews. This ensured that we did not include different points of view for the same application.

Furthermore, we performed a free web search with Google and DuckDuckGo to find examples for digital twins related to the FSC applied in the industry. While this search provided many results, we only included a few of them [42,43,44,45,46,47], as the found information was often not precise enough to analyze in the detail required for a classification within our taxonomy.

Analysis method

The authors selected the publications based on the title and abstract. Additionally, the entire paper was searched to overcome the disadvantages of a keyword-based search. Each publication was reviewed, and applications found were classified according to the taxonomy by one of the authors by screening the complete paper. Afterward, each publication, as well as the classification, was reviewed by another of the authors. If an application was classified differently, a third author also reviewed the classification, and the classification was discussed by all authors.

The focus of this work is on digital twin applications related to foods, food products, and their quality. Therefore, the publications needed to contain a specified description of a digital twin application and terms related to “food” (see Section 3.2). Further, we included publications with regards to food products or their quality, meaning we included digital twins of field monitoring applications, animal monitoring applications, and processing machines as well, which we found through the search. We investigated applications that were already realized and implemented, as well as concepts for digital twins if the provided description was sufficient enough for the analysis.

A few publications found were located in the periphery of foods, food products, and their quality. For instance, Linz et al. [48] and Tsolakis et al. [49] describe digital twin applications of agricultural machines and robots, whereas the digital twins are used for route planning. Furthermore, Jo et al. [50,51] propose a digital twin for a pigsty to control energy demand while adjusting the ventilation and temperature. Other publications provided too little information about the digital twin, although they were strongly related to our research. [31,52] Since we were not able to classify them, we did not include those in our evaluation.

For some works, we found subsequent publications extending the originally presented digital twin. We added such follow-up publications as dedicated digital twin applications, as they develop within the projects, or the available information concerning the applications differed in the papers. Further, the originally published digital twin might be sufficient for some applications. In particular, those publications were from Skobelev et al. [36,53,54], concerning (wheat) plants; from Defraeye et al. [8,55], Shoji et al. [14], and Tagliavini et al. [56], regarding fruits; and from Bottani et al. [30] and Vignali and Bottani [57], relating to a pasteurizer.

Selected studies

In total, we studied 84 publications, from which we included 38 publications after the application of the inclusion and exclusion criteria. The publication range spanned works from 2007 to 2022. Worth mentioning is that the publication from Shoji et al. [14] is assigned to 2022 since this is an online-first available publication. Figure 4 reveals that the number of publications increased during the last years. In 2019, we observed a peak of 24 publications (12 included). In the years 2020 and 2021, the number of publications is slightly decreasing, counting 22 (nine included) and 13 publications (eight included), respectively. A reason for this decrease could be the COVID-19 pandemic and the inclusion of publications available until the end of September 2021.


Fig4 Henrichs Sensors22 22-1.png

Figure 4. Overview of included (in blue) and excluded (in gray) publications per year. In total, we included 38 of 84 identified publications regarding the analysis of digital twin applications in the food industry.

From the selected publications, the major proportion was originally published at conferences and journals, 47.4% and 34.2%, respectively (see Figure 5). Further, we included non peer-reviewed publications (18.4%) from press releases (two publications), books, white papers, websites, reports, and project announcements (all one publication each). The inclusion of non-scientific publication types is appropriate for several reasons. First, digital twins are still a rather young research topic, particularly in the food sector. In addition, the research is highly driven by the industry since the implementation of digital twins is strongly practice-oriented. However, non-scientific publications often do not provide sufficient details for a classification; hence, this number of included works is limited.


Fig5 Henrichs Sensors22 22-1.png

Figure 5. Share of the publication type of 38 included publications.

Results

This section answers the research questions on how to classify digital twins (RQ1), in which areas of the food industry digital twins can be found (RQ2), what types of digital twins are applied (RQ3), and which key elements are required to implement digital twins (RQ4). First, we examine different classification schemes and derived the best fitting taxonomy for our research by combining different existing classification schemes. Second, we analyze in which activities of the FSC digital twins are applied. We then investigate which types of digital twins are applied in the FSC based on our results of RQ1. The classification of all applications included in this section can be found in Appendix A (see Table A1). We close out this section by summarizing the key elements for the implementation of digital twins.

Classification of digital twins

Since digital twins have no unique and standardized taxonomy, this section provides an overview of different classification approaches and classifies their relevance for our work. This answers the first research question RQ1 — How can digital twins be classified?

The classification approaches differ among the various authors’ focus on digital twins. Kritzinger et al. [58] differentiate the terms "digital model," "digital shadow," and "digital twin" based on the data flow between the physical and digital object. A digital model is defined by a manual data flow between both objects, where the data flows automatically from the physical to the digital object and manually from the digital to the physical object in a digital shadow. The data flow in a digital twin is automated between both objects, which may serve as the controller of the physical object.

Pylianidis et al. [39] differentiate among various service categories, meaning the use case of a digital twin. These categories are real-time monitoring, energy consumption analysis, system failure analysis and prediction, optimization/update, behavior analysis/user operation guide, technology integration, and virtual maintenance. They further distinguish the technology readiness level (TRL) among the levels concept, prototype, and deployed. Jones et al. [29] classify digital twins according to the product’s life-cycle phases, namely imagination, definition, realization, support/usage, and retirement/disposal.

However, we use a combination of the following two schemes as taxonomy since we are interested in the techniques behind the digital twins and the intended use of the digital twins. According to Defraeye et al. [8], a digital twin can be statistical, data-driven (i.e., intelligent), or physics-based (i.e., mechanistic). The first type is based on statistics, where an analytical model is solved with an ordinary differential equation or a simpler analytical equation. The intelligent digital twin is a data-driven model that relies on artificial intelligence (AI) techniques, e.g., machine learning (ML), for model development, calibration, verification, and validation. Mechanistic digital twins are based on physics; hence, they are also called physics-based digital twins. These models concern all relevant physical, biochemical, microbiological, and physiological processes using multi-physics modeling and simulation. Several authors Verboven et al. [4] and Onwude et al. [60] mention that only a mechanistic digital twin is able to mimic the behavior of the real-world counterpart realistically and comprehensively. Therefore, a mechanistic digital twin is preferable for predictions. Worth mentioning is that intelligent digital twins also consider statistical methods. Further, the model parameters used in mechanistic digital twins can be quantified, verified, and validated with statistical and ML methods.

Defraeye et al. [8] presented the types in a triangular structure, placing the twin types "statistical," "intelligent," and "mechanistic" in the corners. As such, the type of a digital twin could be assigned to corners as well as edges, or in-between. However, we decided to classify the digital twin applications according to their prevailing type, i.e., "there are not any mixed types."

The second classification scheme is similar to that of Jones et al. [29] since it represents the product’s life-cycle phases. Following the approach by Verdouw et al. [61], digital twins can be used to characterize and simulate the states and behavior of their real-life twins, which do not exist at a specific point in time. Further, digital twins may be used to monitor the current state of items, prescribe desired states, forecast future states, and automatically react to conditions of their real-world counterparts and, therefore, control systems without human interaction. Finally, digital twins are also able to outlast real-world objects, and they can be used to recollect their historical conditions. Worth mentioning is that these categories can coexist within the same digital twin application. Table 1 provides a detailed description of the different categories we used to classify the digital twin applications within the context of this work.

Table 1. Digital twin taxonomy (based on work by Defraeye et al. [8] and Verdouw et al.[61]).
Type Description Citation
Statistical Solving a simple analytical equation or an ordinary differential equation (ODE) for calculations with the generated data [8]
Intelligent Use of AI techniques, e.g., ML, for model development, calibration, verification, and validation
Mechanistic Performance of multi-physics modeling and simulation to capture the relevant physical, biochemical, microbiological, and physiological processes
Imaginary Simulates objects that do not physically exist in the real world at the given time [61]
Monitoring Monitors the current state and behavior of a real-life, physically existing counterpart
Predictive Projects future states and behavior of a physical object based on real-time data
Prescriptive Are able to intelligently recommend corrective and preventive actions while using the results of monitoring and predictions
Autonomous Control autonomously the behavior of the real-world counterparts without human intervention
Recollection Maintains the complete history of physical objects, which no longer exist in real life

It is notable that the definition of a digital model [58] corresponds to the definition of an imaginary digital twin. [61] Additionally, the categories of Pylianidis et al. [39] and Verdouw et al. [61] are similar, but since Pylianidis et al. focus on more technical approaches, the approach by Verdouw et al. is used in this work.

Applications of digital twins in the food supply chain

In our prior discussion of selected studies in our methodology, we observed that the number of publications increased in recent years. Accordingly, the number of digital twin applications has increased as well. This section answers the second research question RQ2 — In which areas of the food industry are digital twins applied?

Figure 6 provides an overview of the absolute frequency of applications per stage in the FSC. The major proportion of digital twin applications could be found in the production stage, often referred to as agricultural applications (54.90%). Many applications focus on the growth of plants [36,42,53,54,61,62,63,64] or monitoring the condition of animals. [23,61,64,65,66] Further, entire production systems as greenhouses or fields are twinned. [34,43,44,61,62,67,68,69,70,71] Several applications could be described as supportive, e.g., to monitor and control pests. [35,65]


Fig6 Henrichs Sensors22 22-1.png

Figure 6. Results of the literature review–Absolute frequency of digital twin applications assigned to stages in the food supply chain. As there are applications [47,56] assigned to several stages, the total number of counts is 53, although 51 applications were found in 38 publications.

The second most frequently assigned stage is the processing stage (31.37%). In this stage, the digital twins mainly concern processing machines, e.g., pasteurizer [30,57] or packaging machines [45,72], or entire processing systems. [6,45,73,74,75,76] A few use cases focus on the optimal product composition or quality. [45,46,77]

Applications during transportation—in particular, the stages of supply and distribution (7.84% and 5.88%, respectively)—determine the quality of fruits and vegetables, with a focus on measuring the temperature. [8,14,47,55,56,78] Only one application could be assigned to the retail stage (1.96%), where it is used to determine the quality of fruits and vegetables, as well as the remaining shelf-life. [47] Furthermore, one application is assigned to the consumption stage (1.96%). This application aims to twin a consumer in order to design food products, which are personalized to adapt foods in case of genetic disorders, such as diabetes mellitus. [79]

It should be noted that two applications were assigned to multiple stages. While the digital twin of a mango fruit to determine the quality during transportation was assigned to supply and distribution stages [56], the digital twin concept for the determination of the quality of fruits and vegetables was assigned to the distribution and retail stages. [47]

Types and categories of digital twins in the food supply chain

In addition to the stages in the FSC, where a digital twin is applied, the applications’ intentions of use are of special interest. Earlier, for question one, we specified a taxonomy regarding both the digital twin techniques and the intended use. It is necessary to note that in the case of the taxonomy regarding the intended use, the applications could be classified into several categories. Regarding the digital twin technique, applications could only be assigned to one type. In contrast to the previous question, applications were not counted twice if they were assigned to multiple stages of the FSC. Hence, this section answers the third research question RQ3 — Which types of digital twins are applied in the food industry?

Figure 7 shows the classification of the digital twin applications regarding their different types. Most of the digital twin applications are classified as intelligent or data-driven (39.22%). These applications are used for monitoring and controlling plant growth environments, in particular greenhouses or fields [34,42,68,71]; the twinning of plants during growing itself [35,65]; the detection of pests and actions to tackle them [65]; the monitoring of animals [23]; or the determination of shocks and the adaptation of process parameters during potato harvesting. [61,78,80] In addition, applications concern the monitoring of cattle with regards to their health, dairy productivity, or growth (weight gain for meat production) [61,65,66] and the control of food processing parameters. [75] The applications use clustering methods to determine the states and conditions of animals and plants and to classify pests, and further ML techniques to improve the system continuously.


Fig7 Henrichs Sensors22 22-1.png

Figure 7. Results of the literature review, in regards to the share and absolute frequency of digital twin types found in 38 publications.

Almost the same proportion of applications are used for simulation, based on mechanistic or physics-based models (31.37%). Many use cases regard either the plant and animal growth in the production stage [36,53,61,62,64,69] or the monitoring of food processing, e.g., from a pasteurizer, an ice cream machine, pudding production, malting, or the packaging design concerning special product properties. [30,45,57,73,74,76] More digital twins focus on fruit and vegetable quality during supply by measuring the surface temperature and calculating the pulp temperature based on that. [14,56] All the applications mentioned in this category could be described well with known models.

Further, some applications are based on statistics (13.73%). In this category, many use cases focus on the control of food processing. [6,45,46,72] Other applications regard the design and personalization of food [77,79], or the twinning of a wheat plant. [54] All digital twins are based on statistical methods using means and standard deviations for conclusions and predictions.

It should be noted that there are some applications (15.69% in total), which are not classified to any type [45,65,67] or the classification was not possible due to a lack of information. [43,44,45]

Figure 8 shows the categorization results of the digital twin applications with regards to their intended use. We observed that nearly all applications (94.12%) are used for monitoring their real-life counterparts. Only three use cases have not been classified in this category; those target applications for the design of new food products and food packaging [45] and the weight gain of cattle for the meat and livestock value chain. [66] We conclude that this observation makes sense since monitoring the physical objects is often the base for further predictions or decision-making. However, only 32 applications (62.75%) are working with real-time data.


Fig8 Henrichs Sensors22 22-1.png

Figure 8. Results of the literature review, in regards to the absolute frequency of digital twin categories found in 38 publications. It should be noted that the total number of counts is not equal to the number of applications since they are not restricted to one category.

Additionally, many applications are used for predictions (72.55%). Use cases, which are not predicting, are mainly used for real-time monitoring and decision-making. These cases concern the detection of pests and the control of plant growth environments based on current growing conditions, e.g., the temperature or humidity [34,44,63,67,68], the monitoring of animals [23,65], the control of food processing [74], and the design of products. [45]

The predictions could be used to suggest corrective or preventive actions (39.22%). Since most of the applications found are assigned to the production stage, many prescriptive digital twin applications belong to applications only able to assist in agricultural plants to enhance the quality during growth and harvest processes. [42,43,61,62,64,69,70,78,80] Another prescriptive digital twin is applied in a pudding production system to assist in production planning. [73] Further use cases only recommend actions rather than fully automatizing the system. [75] Examples are the personalized design of foods regarding genetically caused diseases [79] or the design of food packaging. [45]

A minor amount of digital twins (15.96%) are integrated into systems working autonomously. The applications automatically control greenhouses by adjusting parameters like temperature or light [34,42,44] or processing plants by controlling, among others, the workflow or specified processing parameters as temperature. [45,75]

Some digital twins found were used for forecasting and simulating objects that were presently non-existent (23.53%). This category includes applications for the design of food products and raw materials [53,77] as well as food packaging and production plants [45,67]; applications to predict shelf-life and the food quality [47,56,66]; and applications to control the process flow. [6,72,74,75] The application of imaginary digital twins enables the avoidance of expensive mistakes [75] and detailed planning. [67]

Recollective digital twins, that maintain the complete history of physical objects (even if those do not longer exist), can be found in all stages of the FSC (31.37%). Some applications use the stored information for learning and improving the system. [34,53,54,61,62,63,68,78,80] Other applications were implemented to better document the processes and quality parameters of the physical objects. [6,30,47,57,74,76,79] It should be noted that due to a lack of information, many applications could not be classified in this category. [23,42,43,44,45,46,56,65,66,67,69,70,71,73,75]

Key elements for digital twin implementation

In the previous sections, we describe our observations that the implementation of digital twins varies in the different stages of the FSC, as well as the intention of use within a specified stage. The major proportion of digital twins are applied in the (primary) production and the processing stage. Especially in the distribution, retail, and consumption stages only a few applications have been found. In addition, different types of digital twins have been found. To investigate how to improve the food quality in the FSC using digital twins, necessary components to apply digital twins need to be identified. Hence, this section answers the research question RQ4 — What are the key elements in implementing a digital twin?

First of all, there must be a motivation to implement a digital twin. Some digital twins are motivated by production and market reasons, e.g., to cope with a higher demand for more flexibility in the production to adapt to new market demands, such as clients requesting more products that meet unique nutritional standards and packaging sizes. [6] Moreover, the constant increase in business competition challenges companies to look beyond cost reductions and improve quality and productivity. [81] In particular, food processing industries are battling with low-profit margins while being challenged to reduce time-to-market and develop new, flexible processes for a wide range of goods. [6] Another motivation arises out of the demand for more transparency to stakeholders, trust, and ownership of the processes. [4] Finally, some drivers are employee-related, such as offering training based on virtual reality applications that benefit from the data of the models in digital twins [81] and improving employee safety by detecting potential workplace hazards with digital twins. [30,81]

Second, the underlying technical infrastructure must be able to support a digital twin. [82] The definition of such a digital model is only possible if a sufficient amount of data is available. Even taking into consideration that a digital twin is able to generate some data of the physical entities (products and processes) through ML, AI, or simulation, the collection of data from the process through sensors is still a critical requirement. Another requirement is related to the communication infrastructure: if real-time analysis is demanded, the infrastructure must support fast exchange of data. [23] Real-time analysis requires a sufficient processing capacity; hence, if real-time analysis is targeted in a project with digital twins, this processing capacity must be available. [4] Thus, a detailed analysis of the available data sources, communication infrastructure, and processing capacity when starting the implementation of a digital twin is required to determine how a digital twin can be enabled.

Third, a digital twin for food production has additional specific requirements compared to digital twins of the production of material goods. Due to the variability of raw materials, these cannot be based only on the processing steps, but must also take into account the chemical, physical or (micro)biological properties of the food. [8] Consequently, either (i) sensors must be available to capture those changes or (ii) simulations based on scientific models of food aging must support information about those intermediate states of the food. Accordingly, the typical, retrospective analysis of machine and process data must be combined with short-term (detection of potential problems), and medium-term data analysis approaches (simulation of product changes) to improve decision making in food production and tracking of the current state of production at any time with digital twins.

Every digital twin implementation starts with a process design in which all processes and interaction points are mapped that a digital twin will be modeling. [28] Improvements with regard to cost, time, or asset efficiency are augmented in this design process.

However, up to now, there is no consensus regarding a generic method in the realization of digital twins that can describe its implementation and the data acquisition from the physical to the virtual object. [26] Given this lack of consensus, Tao et al. [83] proposed a digital twin model based on five dimensions (see Figure 9):

  • Physical entity: The physical world is the basis. The physical entity can be a device or product, a system, a process, or even an organization. [84] It carries out actions following physical regulations and deals with environmental uncertainty.
  • Virtual entity: The digital model is generated to replicate the physical geometries, properties, behaviors, and rules of the physical entity. Therefore, multiple models can be considered. [85]
  • Service platform: Decision-support analyses support the monitoring and optimization of the physical entity with simulations, verification, diagnosis, and prognosis, as well as prognostic. [81,84] Further, the virtual entity must be served with data, knowledge, and algorithms, and the platform itself needs to be served, e.g., with customized software development and model building.
  • Data model: The data is stored in the data model. [81] Since the digital twin considers multi-temporal scale, multi-dimension, multi-source, and heterogeneous data [84], the data model includes and merges data from the physical entity, the virtual entity, services, and knowledge. [85]
  • Information connections: All dimensions need to be connected to ensure communication and update the information immediately. [81] This enables advanced simulation, operation, and analysis. [84]


Fig9 Henrichs Sensors22 22-1.png

Figure 9. The five-dimensional digital twin concept (adapted from the work of Steindl et al. [85]). Digital twins consist of a physical as well as a virtual entity, which are supported by several services. Data are fused and stored centralized. These four dimensions must be connected with each other, creating the fifth dimension.

Barni et al. [86] describe four best practices for the implementation of a digital twin. First, the entire product value chain should be included to ensure data exchange and consistency. Second, the virtual models should be kept dynamic through the development of well-documented methods for model generation and modification. Third, it should be ensured that data from several sources are included to measure the different variables and all essential properties of the physical product and the system (process, actuators, inputs, outputs, and environment). [4] The exact combination of relevant data is often unknown a priori when the first model is developed; accordingly, the design of the digital twin must offer modularity and scalability. [86] Fourth, long access life cycles should be ensured in order to address a long-term convergence within the physical and the virtual world. The approach by Werner et al. [73] reinforces this through a knowledge-acquiring digital twin.

Thus, the accessibility and continuous flow of near real-time data are important. [23] The data generation can be achieved with sensors [8] and the use of IIoT technology. [87] In addition, data processing and data evaluation or interpretation are of high relevance [23], leading to the requirement of sufficient computational performance to handle big data volumes. [4] As such, data transfer technologies are required to provide high-speed data gathering from huge amounts of remote sensor data and transfer it in real-time within a network, e.g., Bluetooth, LoRaWAN, or 5G. [8]

The core of a digital twin is based on modeling. [87] Therefore, physics modeling (geometrical, mechanical, material, hydrodynamic, and discrete event models), semantic modeling (ML models, deep learning, data mining expert system, and ontology modeling), and model integration (flexible modeling, standard interface, black-box, gray-box, and multi-physics modeling) are used. AI and ML techniques support data analysis and data fusion, enabling efficient processing and interpretation of a large amount of data. [81,88] Further, those techniques can continuously improve the performance of the system. [23] A key element in this regard is a human-machine interface, where users can easily interact with and understand the digital twin’s information. [23] This is particularly important if the digital twin recommends corrective and preventive actions.

In conclusion, the implementation of digital twins requires multidisciplinary knowledge [31], especially from food science. For instance, this includes microbiological, physical, chemical, and engineering disciplines, as well as knowledge for efficient process management. Further, ICT is required. Commonly, ICT today is used in the FSC to connect the various stakeholders in the different stages through data exchange. In the future, the support of automated data collection with IoT technology and efficient data analysis, mainly using ML, will have increased importance.

Discussion

The survey results revealed large differences in the use of digital twins depending on the stages of the FSC. First, the major proportion of digital twins are applied in production and processing. Further, nearly all applications are used for monitoring, and many applications predict future states of their physical objects. However, only a few digital twins recommend actions or control systems fully autonomously, i.e., refer to prescriptive or autonomous digital twins, respectively. In addition, key elements to implement digital twins were investigated. To better understand the reasons, we primarily discuss the potentials of digital twins in the food industry, as well as the challenges in implementing digital twins. We close this discussion with threats to validity.

Potentials of digital twins in the food industry

As shown in the previous sections, we identified in our literature review several potential ways to optimize the FSC with digital twins. From there, we have to address the fifth research question RQ5 — What is the potential of digital twins in the food industry?

In general, digital twins enable data accessibility and advanced analytics in real-time to assist in more informed, efficient, and faster decision-making. [89] Sensor data are fed into a digital twin that runs food process models (i) for providing relevant product process information and operation outputs in real-time for process control, troubleshooting, and supply chain management, as well as (ii) to optimize processes for uniformity, performance, and sustainability, or to develop new designs. [4,90] Furthermore, this results in better risk assessment and mitigation strategies based on "what if" analyses and simulations. [89]

Improved use of sensor data

Current approaches in Industry 4.0 focus on the intelligent collection of data with IoT technology and its analysis with ML algorithms. [91] This includes a variety of data sources, including raw material data, machine data, and customer data. Digital twins enable deeper insights due to the use of multi-sensor networks (sensor fusion), where different sensors measure several parameters from different locations. [52,60] As stated before, sensors are required to provide data (e.g., environmental, process, machine, etc.) for the digital twins. With the development of smart sensors, monitoring the states during processes gets easier and faster. [4] Further, sensors become cheaper, need less power, and transfer the data wirelessly, which enables their use in more applications, even in mobile settings.

Combination with intelligent packaging

Intelligent packaging can directly share the quality and current condition of a food product on the packaging during the distribution, retail, and consumption stages. [92] Intelligent packaging consists of intelligent materials or objects, which are defined by their behavior of monitoring “the condition of packaged food or the environment surrounding the food.” [93] Sensors are integrated into the packaging [92] to monitor, e.g., the temperature, the pH value, the humidity, the pressure on the food, or vibrations during transportation. [94] Further, gas sensors can measure the concentration of carbon dioxide (CO2) or hydrosulfuric acid (H2S) to allow concluding the current condition of the food. [92] An example of how to produce near zero-cost gas sensors is given by Barandun et al. [95] Biosensors are able to detect pathogens or toxins in bacteria-contaminated foods. [96]

Combination with AI and ML

Likewise, integrating nuclear magnetic resonance (NMR) and other spectroscopy methods, as well as imaging techniques [4] in conjunction with AI and, especially, ML, enables machine or computer vision. Such algorithms can analyze the food and are able to determine its composition, condition, and quality issues as spoilage, contaminants, or defects. [97,98] Furthermore, by placing virtual sensors on the digital twin model, sensor data from locations that would usually not be accessible to sensors can be generated. [8] Virtual sensor data are software-based outputs of fused data from physical sensors. [99] The application of physical sensors is limited by noise, interference, or unfeasibility due to spatial conditions [99] or locations that are difficult to access. [8] Virtual sensors provide data measurements of parameters or locations, which are physically not measurable. [99] This application enables the detailed prediction of food losses and the remaining shelf life of the food products. [60]

Optimizing production planning

Further, production planning can be optimized with ML in this context. [100] The industry demands the possibility to adapt to current market demands as unique nutritional standards and packaging sizes and, therefore, require a higher production flexibility. [6] This means not only being able to produce a wide range of products, but also counting with the capacity to reschedule the production dynamically. [81] The analysis and prediction of SC disruptions can be used to assist this [5,101], although the mentioned references focus on more economic aspects of these disruptions. Proactive adaptation improves system performance, as it forecasts adaptation concerns (e.g., through identification of patterns in historical data) and reacts either by preparing an adaptation or adapting. [102] Autonomous systems can respond to changes in the state during ongoing operation, while digital twins can integrate a variety of data like environment data, operational data, and process data. [26,103] This also includes supplying different stakeholders in the FSC with actionable real-time data, such as the remaining shelf life for each shipment (based on the product’s physical, biochemical, microbiological, or physiological states), on which logistics decisions and marketing strategies can be adjusted. [8,89]



References

Notes

This presentation is faithful to the original, with only a few minor changes to presentation and updates to spelling and grammar. In some cases important information was missing from the references, and that information was added.