Difference between revisions of "Chemical informatics"

From LIMSWiki
Jump to navigationJump to search
(Creation of Article)
 
(Updating some of the links and fixed the category title)
Line 1: Line 1:
''(This article was taken from Wikipedia)''
''(This article was taken from Wikipedia)''


'''Cheminformatics''' (also known as '''chemoinformatics''' and '''chemical informatics''')  is the use of computer and [[Information science|informational]] techniques, applied to a range of problems in the field of [[chemistry]]. These ''[[in silico]]'' techniques are used in [[pharmaceutical]] companies in the process of [[drug discovery]]. These methods can also be used in chemical and allied industries in various other forms.
'''Cheminformatics''' (also known as '''chemoinformatics''' and '''chemical informatics''')  is the use of computer and informational techniques, applied to a range of problems in the field of chemistry. These ''in silico'' techniques are used in pharmaceutical companies in the process of [[drug design|drug discovery]]. These methods can also be used in chemical and allied industries in various other forms.


== History ==
== History ==


The term chemoinformatics was defined by F.K. Brown <ref name="Brown_1998">{{cite journal | author = F.K. Brown | title = Chapter 35. Chemoinformatics: What is it and How does it Impact Drug Discovery | journal = Annual Reports in Med. Chem. | year = 1998 | volume = 33 | pages = 375 | doi = 10.1016/S0065-7743(08)61100-8}}</ref><ref>{{cite journal | author = Brown, Frank | title = Editorial Opinion: Chemoinformatics – a ten year update | journal = [[Current Opinion in Drug Discovery & Development]]| year = 2005 | volume = 8 | issue = 3 | pages = 296–302}}</ref> in 1998:  
The term chemoinformatics was defined by F.K. Brown <ref name="Brown_1998">{{cite journal | author = F.K. Brown | title = Chapter 35. Chemoinformatics: What is it and How does it Impact Drug Discovery | journal = Annual Reports in Med. Chem. | year = 1998 | volume = 33 | pages = 375 | doi = 10.1016/S0065-7743(08)61100-8}}</ref><ref>{{cite journal | author = Brown, Frank | title = Editorial Opinion: Chemoinformatics – a ten year update | journal = Current Opinion in Drug Discovery & Development| year = 2005 | volume = 8 | issue = 3 | pages = 296–302}}</ref> in 1998:  


<blockquote>
<blockquote>
Line 11: Line 11:
</blockquote>
</blockquote>


Since then, both spellings have been used, and some have evolved to be established as Cheminformatics,<ref>[http://www.molinspiration.com/chemoinformatics.html Cheminformatics or Chemoinformatics ?<!-- Bot generated title -->]</ref> while European Academia settled in 2006 for Chemoinformatics.<ref name="Obernai">[http://infochim.u-strasbg.fr/chemoinformatics/Obernai%20Declaration.pdf Obernai Declaration]</ref> The recent establishment of the [[Journal of Cheminformatics]] is a strong push towards the shorter variant.
Since then, both spellings have been used, and some have evolved to be established as Cheminformatics,<ref>[http://www.molinspiration.com/chemoinformatics.html Cheminformatics or Chemoinformatics ?<!-- Bot generated title -->]</ref> while European Academia settled in 2006 for Chemoinformatics.<ref name="Obernai">[http://infochim.u-strasbg.fr/chemoinformatics/Obernai%20Declaration.pdf Obernai Declaration]</ref> The recent establishment of the Journal of Cheminformatics is a strong push towards the shorter variant.


== Basics ==
== Basics ==
Cheminformatics combines the scientific working fields of [[chemistry]] and [[computer science]] for example in the area of [[topology (chemistry)| topology]] and [[chemical graph theory]] and mining the [[chemical space]].<ref name="Gasteiger_2004">Gasteiger J.(Editor), Engel T.(Editor): ''Chemoinformatics : A Textbook''. John Wiley & Sons, 2004, ISBN 3-527-30681-1</ref><ref>A.R. Leach, V.J. Gillet: ''An Introduction to Chemoinformatics''.  Springer, 2003, ISBN 1-4020-1347-7</ref>
Cheminformatics combines the scientific working fields of chemistry and computer science for example in the area of topology and chemical graph theory and mining the chemical space.<ref name="Gasteiger_2004">Gasteiger J.(Editor), Engel T.(Editor): ''Chemoinformatics : A Textbook''. John Wiley & Sons, 2004, ISBN 3-527-30681-1</ref><ref>A.R. Leach, V.J. Gillet: ''An Introduction to Chemoinformatics''.  Springer, 2003, ISBN 1-4020-1347-7</ref>
Cheminformatics can also be applied to data analysis for various industries like [[paper industry|paper]] and [[pulp industry|pulp]], [[dye industry|dyes]] and such allied industries.
Cheminformatics can also be applied to data analysis for various industries like [[paper industry|paper]] and [[pulp industry|pulp]], [[dye industry|dyes]] and such allied industries.


== Applications ==
== Applications ==
===Storage and retrieval===
===Storage and retrieval===
{{main|Chemical database}}
The primary application of cheminformatics is in the storage of information relating to compounds. The efficient search of such stored information includes topics that are dealt with in computer science as [[data mining]] and machine learning. Related research topics include:
The primary application of cheminformatics is in the storage of information relating to compounds. The efficient search of such stored information includes topics that are dealt with in computer science as [[data mining]] and [[machine learning]]. Related research topics include:
* Unstructured data
* [[Unstructured data]]
* Structured Data Mining and mining of Structured data
* [[Structured Data Mining]] and mining of [[Structured data]]
** [[Database mining]]
**[[Database mining]]
** Graph mining
**[[Graph mining]]
** Molecule mining
**[[Molecule mining]]
** Sequence mining
**[[Sequence mining]]
** Tree mining
**[[Tree mining]]


==== File formats ====
==== File formats ====
{{main|Chemical file format}}
The ''in silico'' representation of chemical structures uses specialized formats such as the XML-based Chemical Markup Language or SMILES.  These representations are often used for storage in large chemical databases. While some formats are suited for visual representations in 2 or 3 dimensions, others are more suited for studying physical interactions, modeling and docking studies.
The ''in silico'' representation of chemical structures uses specialized formats such as the [[XML]]-based [[Chemical Markup Language]] or [[Simplified molecular input line entry specification|SMILES]].  These representations are often used for storage in large [[chemical database]]s. While some formats are suited for visual representations in 2 or 3 dimensions, others are more suited for studying physical interactions, modeling and docking studies.


=== Virtual libraries ===
=== Virtual libraries ===
Line 39: Line 37:


Virtual libraries of classes of compounds (drugs, natural products, diversity-oriented synthetic products) were recently generated using the FOG (fragment optimized growth) algorithm.
Virtual libraries of classes of compounds (drugs, natural products, diversity-oriented synthetic products) were recently generated using the FOG (fragment optimized growth) algorithm.
<ref>{{cite journal|title=FOG: Fragment Optimized Growth Algorithm for the de Novo Generation of Molecules occupying Druglike Chemical | last=Kutchukian | first=Peter  | coauthors=Lou, David; Shakhnovich, Eugene |journal=Journal of Chemical Information and Modeling | year=2009 |volume=49 | pages=1630–1642|doi=10.1021/ci9000458|pmid=19527020|issue=7 }}</ref>  This was done by using cheminformatic tools to train transition probabilities of a [[Markov chain]] on authentic classes of compounds, and then using the Markov chain to generate novel compounds that were similar to the training database.
<ref>{{cite journal|title=FOG: Fragment Optimized Growth Algorithm for the de Novo Generation of Molecules occupying Druglike Chemical | last=Kutchukian | first=Peter  | coauthors=Lou, David; Shakhnovich, Eugene |journal=Journal of Chemical Information and Modeling | year=2009 |volume=49 | pages=1630–1642|doi=10.1021/ci9000458|pmid=19527020|issue=7 }}</ref>  This was done by using cheminformatic tools to train transition probabilities of a Markov chain on authentic classes of compounds, and then using the Markov chain to generate novel compounds that were similar to the training database.


=== Virtual screening ===
=== Virtual screening ===
{{main|Virtual screening}}
In contrast to high-throughput screening, virtual screening involves computationally
In contrast to [[high-throughput screening]], virtual screening involves computationally
screening ''in silico'' libraries of compounds, by means of various methods such as
screening ''[[in silico]]'' libraries of compounds, by means of various methods such as
docking, to identify members likely to possess desired properties
[[docking (molecular)|docking]], to identify members likely to possess desired properties
such as biological activity against a given target. In some cases, combinatorial chemistry is used in the development of the library to increase the efficiency in mining the chemical space. More commonly, a diverse library of small molecules or natural products is screened.
such as biological activity against a given target. In some cases, [[combinatorial chemistry]] is used in the development of the library to increase the efficiency in mining the chemical space. More commonly, a diverse library of small molecules or [[natural product]]s is screened.


===Quantitative structure-activity relationship (QSAR) ===
===Quantitative structure-activity relationship (QSAR) ===
{{main|Quantitative structure-activity relationship}}
This is the calculation of quantitative structure-activity relationship and quantitative structure property relationship values, used to predict the activity of compounds from their structures. In this context there is also a strong relationship to [[Chemometrics]]. Chemical expert systems are also relevant, since they represent parts of chemical knowledge as an ''in silico'' representation.
This is the calculation of [[quantitative structure-activity relationship]] and [[QSPR|quantitative structure property relationship]] values, used to predict the activity of compounds from their structures. In this context there is also a strong relationship to [[Chemometrics]]. Chemical [[expert system]]s are also relevant, since they represent parts of chemical knowledge as an ''[[in silico]]'' representation.


== See also ==
== See also ==
{{columns-list|2|
* [[Bioinformatics]]
* [[Bioinformatics]]
* [[Chemical file format]]
* [[Chemogenomics]]
* [[Computational chemistry]]
* [[Combinatorial chemistry]]
* [[Data analysis]]
* [[Data analysis]]
* [[Chemometrics]]
* [[Journal of Chemical Information and Modeling]]
* [[List of chemistry topics]]
* [[Docking (molecular)]]
* [[Mathematical chemistry]]
* [[Molecular design software]]
* [[Molecular graphics]]
* [[Molecular modelling]]
* [[Pharmaceutical company]]
* [[Scientific visualization]]
* [[List of software for molecular mechanics modeling|Software for molecular modeling]]
* [[Statistics]]
* [[WorldWide Molecular Matrix]]
* [[Molecular Conceptor]]
* [[Chemicalize.org]]}}


==References==
==References==
Line 102: Line 78:
* [http://chemoinformatician.co.uk Cheminformatics studies from Unilever Centre for Molecular Informatics to OpenEye]
* [http://chemoinformatician.co.uk Cheminformatics studies from Unilever Centre for Molecular Informatics to OpenEye]


[[Category:Cheminformatics]]
[[Category:Chemical informatics]]

Revision as of 19:50, 20 July 2011

(This article was taken from Wikipedia)

Cheminformatics (also known as chemoinformatics and chemical informatics) is the use of computer and informational techniques, applied to a range of problems in the field of chemistry. These in silico techniques are used in pharmaceutical companies in the process of drug discovery. These methods can also be used in chemical and allied industries in various other forms.

History

The term chemoinformatics was defined by F.K. Brown [1][2] in 1998:

Chemoinformatics is the mixing of those information resources to transform data into information and information into knowledge for the intended purpose of making better decisions faster in the area of drug lead identification and optimization.

Since then, both spellings have been used, and some have evolved to be established as Cheminformatics,[3] while European Academia settled in 2006 for Chemoinformatics.[4] The recent establishment of the Journal of Cheminformatics is a strong push towards the shorter variant.

Basics

Cheminformatics combines the scientific working fields of chemistry and computer science for example in the area of topology and chemical graph theory and mining the chemical space.[5][6] Cheminformatics can also be applied to data analysis for various industries like paper and pulp, dyes and such allied industries.

Applications

Storage and retrieval

The primary application of cheminformatics is in the storage of information relating to compounds. The efficient search of such stored information includes topics that are dealt with in computer science as data mining and machine learning. Related research topics include:

  • Unstructured data
  • Structured Data Mining and mining of Structured data

File formats

The in silico representation of chemical structures uses specialized formats such as the XML-based Chemical Markup Language or SMILES. These representations are often used for storage in large chemical databases. While some formats are suited for visual representations in 2 or 3 dimensions, others are more suited for studying physical interactions, modeling and docking studies.

Virtual libraries

Chemical data can pertain to real or virtual molecules. Virtual libraries of compounds may be generated in various ways to explore chemical space and hypothesize novel compounds with desired properties.

Virtual libraries of classes of compounds (drugs, natural products, diversity-oriented synthetic products) were recently generated using the FOG (fragment optimized growth) algorithm. [7] This was done by using cheminformatic tools to train transition probabilities of a Markov chain on authentic classes of compounds, and then using the Markov chain to generate novel compounds that were similar to the training database.

Virtual screening

In contrast to high-throughput screening, virtual screening involves computationally screening in silico libraries of compounds, by means of various methods such as docking, to identify members likely to possess desired properties such as biological activity against a given target. In some cases, combinatorial chemistry is used in the development of the library to increase the efficiency in mining the chemical space. More commonly, a diverse library of small molecules or natural products is screened.

Quantitative structure-activity relationship (QSAR)

This is the calculation of quantitative structure-activity relationship and quantitative structure property relationship values, used to predict the activity of compounds from their structures. In this context there is also a strong relationship to Chemometrics. Chemical expert systems are also relevant, since they represent parts of chemical knowledge as an in silico representation.

See also

References

  1. F.K. Brown (1998). "Chapter 35. Chemoinformatics: What is it and How does it Impact Drug Discovery". Annual Reports in Med. Chem. 33: 375. doi:10.1016/S0065-7743(08)61100-8. 
  2. Brown, Frank (2005). "Editorial Opinion: Chemoinformatics – a ten year update". Current Opinion in Drug Discovery & Development 8 (3): 296–302. 
  3. Cheminformatics or Chemoinformatics ?
  4. Obernai Declaration
  5. Gasteiger J.(Editor), Engel T.(Editor): Chemoinformatics : A Textbook. John Wiley & Sons, 2004, ISBN 3-527-30681-1
  6. A.R. Leach, V.J. Gillet: An Introduction to Chemoinformatics. Springer, 2003, ISBN 1-4020-1347-7
  7. Kutchukian, Peter; Lou, David; Shakhnovich, Eugene (2009). "FOG: Fragment Optimized Growth Algorithm for the de Novo Generation of Molecules occupying Druglike Chemical". Journal of Chemical Information and Modeling 49 (7): 1630–1642. doi:10.1021/ci9000458. PMID 19527020. 

External links