Difference between revisions of "LII:Organizational Memory and Laboratory Knowledge Management: Its Impact on Laboratory Information Flow and Electronic Notebooks"

From LIMSWiki
Jump to navigationJump to search
(Saving and adding more.)
(Saving and adding more.)
Line 19: Line 19:
The efforts began with work at Perkin-Elmer, Nelson Analytical, Spectra Physics, Digital Equipment Corporation, and many others on the computer controlled recording and processing of instrument data. Once we learned how to acquire the data, [[Laboratory automation|robotic tools]] were introduced to help process [[Sample (material)|samples]] and make them ready for introduction into instruments, that with the connection to a computer for data acquisition further increased productivity. That was followed by an emphasis on the storage, [[Information management|management]], and [[Data analysis|analysis]] of that data through the application of [[laboratory information management system]]s (LIMS) and other software. With the recent development of [[artificial intelligence]] (AI) systems and [[large language model]]s (LLMs), we are ready to consider the next stage in automation and system’s application: organizational memory and laboratory knowledge management.  
The efforts began with work at Perkin-Elmer, Nelson Analytical, Spectra Physics, Digital Equipment Corporation, and many others on the computer controlled recording and processing of instrument data. Once we learned how to acquire the data, [[Laboratory automation|robotic tools]] were introduced to help process [[Sample (material)|samples]] and make them ready for introduction into instruments, that with the connection to a computer for data acquisition further increased productivity. That was followed by an emphasis on the storage, [[Information management|management]], and [[Data analysis|analysis]] of that data through the application of [[laboratory information management system]]s (LIMS) and other software. With the recent development of [[artificial intelligence]] (AI) systems and [[large language model]]s (LLMs), we are ready to consider the next stage in automation and system’s application: organizational memory and laboratory knowledge management.  


This piece discusses the convergence of a set of technologies and their application to scientific work. The development of software systems like ChatGPT, Gemini, and others<ref name="MalhotraThisAI24">{{cite web |url=https://www.marktechpost.com/2024/01/30/this-ai-paper-unveils-the-future-of-multimodal-large-language-models-mm-llms-understanding-their-evolution-capabilities-and-impact-on-ai-research/ |title=This AI Paper Unveils the Future of MultiModal Large Language Models (MM-LLMs) – Understanding Their Evolution, Capabilities, and Impact on AI Research |author=Malhotra, T. |work=Marktechpost |publisher=Marketechpost Media, LLC |date=30 January 2024 |accessdate=10 April 2024}}</ref> means that with a bit of effort the ROI in research and testing can be greatly improved.  
This piece discusses the convergence of a set of technologies and their application to scientific work. The development of software systems like [[ChatGPT]], Gemini, and others<ref name="MalhotraThisAI24">{{cite web |url=https://www.marktechpost.com/2024/01/30/this-ai-paper-unveils-the-future-of-multimodal-large-language-models-mm-llms-understanding-their-evolution-capabilities-and-impact-on-ai-research/ |title=This AI Paper Unveils the Future of MultiModal Large Language Models (MM-LLMs) – Understanding Their Evolution, Capabilities, and Impact on AI Research |author=Malhotra, T. |work=Marktechpost |publisher=Marketechpost Media, LLC |date=30 January 2024 |accessdate=10 April 2024}}</ref> means that with a bit of effort the ROI in research and testing can be greatly improved.  


The initial interest discussed herein is on the topic of using LLMs to create an effective organizational memory (OM) and how that OM can benefit scientific organizations. Following that, we'll then examine how that potential technology impacts information flow, integration, and productivity, as well as what it could mean for developing [[electronic laboratory notebook]]s (ELNs). We’ll also have to extend that discussion to having AI and OM systems work with LIMS, [[scientific data management system]]s (SDMS), instrument data systems (IDSs), engineering tools, and field work found in various industries.
The initial interest discussed herein is on the topic of using LLMs to create an effective organizational memory (OM) and how that OM can benefit scientific organizations. Following that, we'll then examine how that potential technology impacts information flow, integration, and productivity, as well as what it could mean for developing [[electronic laboratory notebook]]s (ELNs). We’ll also have to extend that discussion to having AI and OM systems work with LIMS, [[scientific data management system]]s (SDMS), instrument data systems (IDSs), engineering tools, and field work found in various industries.
Line 31: Line 31:


==Organizational memory==
==Organizational memory==
A researcher came into our analytical lab and asked about some results reported a few years earlier. One chemist recalled the project as well as the person in charge of that work, who had since left the company. The researcher thought he had a better approach to the problem being studied in the original work and was asked to investigate it. The bad news is that all the work, both analytical and previous research notes, was written into paper [[laboratory notebook]]s (1960s). Because of their age, they had left the library and were stored in banker’s boxes in a trailer in the parking lot. There, they were subject to water damage and rodents. Most of that material was unusable, and the investigation was dropped.
Many laboratories have similar stories to the above, lamenting the loss of knowledge within the overall organization due to poor knowledge management practices. Knowledge management has been a human activity for thousands of years since the first pictographs were placed on cave walls. The technology being used, the amount of knowledge generated, and our ability to work with it has changed over many centuries. Today, the subject of organizational knowledge management has seen evolving interest as organizations have moved from disparate archives and libraries of physical documents to more organized "computer-based organizational memories"<ref>{{Cite journal |last=Walsh |first=James P. |last2=Ungson |first2=Gerardo Rivera |date=1991-01 |title=Organizational Memory |url=http://www.jstor.org/stable/258607?origin=crossref |journal=The Academy of Management Review |volume=16 |issue=1 |pages=57 |doi=10.2307/258607}}</ref> where a higher level of productivity can be had.
Until recently, many approaches to organization memory development have relied on [[document management system]]s (DMSs) with keyword indexing and local search engines for retrieval. While those are a start, we need more; search engines still rely too heavily on people to sort through their output to find and organize relevant material.
Recently—particularly in 2023—AI systems like the notable ChatGPT 3 have offered a means of searching, organizing, and presenting material in a form that requires little additional human effort. Initial versions have had several issues (e.g., "[[Hallucination (artificial intelligence)|hallucinations]],” a tame way of saying the AI fabricates and falsifies data and information<ref>{{Cite journal |last=Emsley |first=Robin |date=2023-08-19 |title=ChatGPT: these are not hallucinations – they’re fabrications and falsifications |url=https://www.nature.com/articles/s41537-023-00379-4 |journal=Schizophrenia |language=en |volume=9 |issue=1 |pages=52, s41537–023–00379-4 |doi=10.1038/s41537-023-00379-4 |issn=2754-6993 |pmc=PMC10439949 |pmid=37598184}}</ref>), but as new models and tools are developed to better address these issues<ref name="FabbroMicrosoft24">{{cite web |url=https://qz.com/microsoft-azure-ai-hallucinations-chatbots-1851374390 |title=Microsoft is apprehending AI hallucinations — and not just its own |author=Fabbro, R. |work=Quartz |date=29 March 2024 |accessdate=10 April 2024}}</ref><ref name="MaurinTheBank24">{{cite web |url=https://www.risk.net/risk-management/7959062/the-bank-quant-who-wants-to-stop-gen-ai-hallucinating |title=The bank quant who wants to stop gen AI hallucinating |author=Maurin, N. |work=Risk.net |date=15 March 2024 |accessdate=10 April 2024}}</ref>, sufficient improvement may be shown so that those AI systems eventually may deliver on their potential. Outside of ChatGPT, there are similar systems available (e.g., Microsoft CoPilot and Google Gemini), and more are likely under development. Our intent is not to make a comparison since any effort will quickly become outdated. 





Revision as of 17:48, 10 April 2024

Title: Organizational Memory and Laboratory Knowledge Management: Its Impact on Laboratory Information Flow and Electronic Notebooks

Author for citation: Joe Liscouski, with minor editorial modifications by Shawn Douglas

License for content: Creative Commons Attribution-ShareAlike 4.0 International

Publication date: April 2024

Introduction

Beginning in the 1960s, the application of computing to laboratory work was focused on productivity: the reduction of the amount of work and cost needed to generate results, along with an improvement in the return on investment (ROI). This was very much a bottom-up approach, addressing the most labor-intensive issues and moving progressively to higher levels of data and information processing and productivity.

The efforts began with work at Perkin-Elmer, Nelson Analytical, Spectra Physics, Digital Equipment Corporation, and many others on the computer controlled recording and processing of instrument data. Once we learned how to acquire the data, robotic tools were introduced to help process samples and make them ready for introduction into instruments, that with the connection to a computer for data acquisition further increased productivity. That was followed by an emphasis on the storage, management, and analysis of that data through the application of laboratory information management systems (LIMS) and other software. With the recent development of artificial intelligence (AI) systems and large language models (LLMs), we are ready to consider the next stage in automation and system’s application: organizational memory and laboratory knowledge management.

This piece discusses the convergence of a set of technologies and their application to scientific work. The development of software systems like ChatGPT, Gemini, and others[1] means that with a bit of effort the ROI in research and testing can be greatly improved.

The initial interest discussed herein is on the topic of using LLMs to create an effective organizational memory (OM) and how that OM can benefit scientific organizations. Following that, we'll then examine how that potential technology impacts information flow, integration, and productivity, as well as what it could mean for developing electronic laboratory notebooks (ELNs). We’ll also have to extend that discussion to having AI and OM systems work with LIMS, scientific data management systems (SDMS), instrument data systems (IDSs), engineering tools, and field work found in various industries.

This work is not a "how to acquire and implement" article but rather a prompt for "something to think about and pursue" if makes sense within your organization. The idea is the creation of an effective OM (i.e., an extensive document and information database) that fills a gap in scientific and laboratory informatics, one that can be used effectively with an AI tool to search, organize, synthesize, and present material in an immediately applicable way. We need to seriously think about what we want from these systems and what our requirements are for them before the rapid pace of development produces products that need extensive modifications to be useful in scientific, laboratory, field, and engineering work.

Why should you read this?

Most of the products used in scientific work (whether in the lab, field, office, etc.) are designed for a specific application (working with instruments, for example) or adapted from general-purpose tools used in various industries and settings. The ideas discussed here need further development, as do the tools specifically for the needs of the scientific community. Still, that work needs to begin as a community effort to gain possible benefits. We need to guide the development of technologies so that they meet the needs of the scientific community rather than try to adapt them once they are delivered to the general marketplace.

LLM systems have shown rapid development and deployment in almost every facet of industries throughout 2023. Unless something drastic happens, development will only accelerate, given the potential impact on business operations and interest of technology-driven companies. The scientific community needs to not only ensure its unique needs (once they’ve been defined) are included in LLM development and are met, but also that the resultant output reflects empirical rigor.

Organizational memory

A researcher came into our analytical lab and asked about some results reported a few years earlier. One chemist recalled the project as well as the person in charge of that work, who had since left the company. The researcher thought he had a better approach to the problem being studied in the original work and was asked to investigate it. The bad news is that all the work, both analytical and previous research notes, was written into paper laboratory notebooks (1960s). Because of their age, they had left the library and were stored in banker’s boxes in a trailer in the parking lot. There, they were subject to water damage and rodents. Most of that material was unusable, and the investigation was dropped.

Many laboratories have similar stories to the above, lamenting the loss of knowledge within the overall organization due to poor knowledge management practices. Knowledge management has been a human activity for thousands of years since the first pictographs were placed on cave walls. The technology being used, the amount of knowledge generated, and our ability to work with it has changed over many centuries. Today, the subject of organizational knowledge management has seen evolving interest as organizations have moved from disparate archives and libraries of physical documents to more organized "computer-based organizational memories"[2] where a higher level of productivity can be had.

Until recently, many approaches to organization memory development have relied on document management systems (DMSs) with keyword indexing and local search engines for retrieval. While those are a start, we need more; search engines still rely too heavily on people to sort through their output to find and organize relevant material.

Recently—particularly in 2023—AI systems like the notable ChatGPT 3 have offered a means of searching, organizing, and presenting material in a form that requires little additional human effort. Initial versions have had several issues (e.g., "hallucinations,” a tame way of saying the AI fabricates and falsifies data and information[3]), but as new models and tools are developed to better address these issues[4][5], sufficient improvement may be shown so that those AI systems eventually may deliver on their potential. Outside of ChatGPT, there are similar systems available (e.g., Microsoft CoPilot and Google Gemini), and more are likely under development. Our intent is not to make a comparison since any effort will quickly become outdated.


Acknowledgements

I’d like to thank Gretchen Boria for her help in improving this article and her contributions to it.

Footnotes

About the author

Initially educated as a chemist, author Joe Liscouski (joe dot liscouski at gmail dot com) is an experienced laboratory automation/computing professional with over forty years of experience in the field, including the design and development of automation systems (both custom and commercial systems), LIMS, robotics and data interchange standards. He also consults on the use of computing in laboratory work. He has held symposia on validation and presented technical material and short courses on laboratory automation and computing in the U.S., Europe, and Japan. He has worked/consulted in pharmaceutical, biotech, polymer, medical, and government laboratories. His current work centers on working with companies to establish planning programs for lab systems, developing effective support groups, and helping people with the application of automation and information technologies in research and quality control environments.

References