<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>cc3507</ui>
	<ji>CCJ</ji>
	<fm>
		<dochead>Commentary</dochead>
		<bibl>
			<title>
				<p>Artificial neural networks as prediction tools in the critically ill</p>
			</title>
			<aug>
				<au id="A1" ca="yes">
					<snm>Clermont</snm>
					<fnm>Gilles</fnm>
					<insr iid="I1"/>
					<email> clermontg@upmc.edu</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Co-Director, The CRISMA (Clinical Research, Investigation, and Systems Modeling of Acute Illness) Laboratory, Department of Critical Care Medicine, and Medical Director of The Center for Inflammatory and Regenerative Modeling, University of Pittsburgh, Pittsburgh, Pennsylvania, USA</p>
				</ins>
			</insg>
			<source>Critical Care</source>
			<issn>1364-8535</issn>
			<pubdate>2005</pubdate>
			<volume>9</volume>
			<issue>2</issue>
			<fpage>153</fpage>
			<lpage>154</lpage>
			<url>http://ccforum.com/content/9/2/153</url>
			<note>See related research article <url>http://ccforum.com/content/9/2/R150</url></note>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">15774070</pubid><pubid idtype="doi">10.1186/cc3507</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<pub>
				<date>
					<day>3</day>
					<month>3</month>
					<year>2005</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2005</year>
			<collab>BioMed Central Ltd</collab>
		</cpyrt>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<p>The past 25 years have witnessed the development of improved tools with which to predict short-term and long-term outcomes after critical illness. The general paradigm for constructing the best known tools has been the logistic regression model. Recently, a variety of alternative tools, such as artificial neural networks, have been proposed, with claims of improved performance over more traditional models in particular settings. However, these newer methods have yet to demonstrate their practicality and usefulness within the context of predicting outcomes in the critically ill.</p>
			</sec>
		</abs>
	</fm>
	<bdy>
		<sec>
			<st>
				<p>Introduction</p>
			</st>
			<p>The science of outcome prediction is particularly useful in the setting of the emergency room &#8211; the entry point of many to acute care. In this issue of <it>Critical Care </it>Jaimes and coworkers <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> evaluate the usefulness of artificial neural networks (ANNs) in predicting hospital mortality in patients presenting to the emergency room with suspected sepsis. The construction of a prediction tool is a difficult undertaking; it requires careful methodological consideration and validation before the predictions can be deemed valid and reliable in na&#239;ve patients <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. These tools identify the presence of associations between the outcome of interest and empiric risk factors that contribute to this outcome. A well designed tool will typically possess three qualities: discrimination (the ability to identify accurately those patients who will reach the outcome from those who will not), goodness of fit (the ability to match accurately predicted and actual outcomes, such as mortality rate, in subgroups of patients), and the ability to achieve these predictions in cohorts of patients similar to those in which the tool was developed <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>.</p>
		</sec>
		<sec>
			<st>
				<p>Artificial neural networks as prediction tools</p>
			</st>
			<p>Most predictive tool use logistic regression &#8211; a well vetted statistical technique that is applicable to situations in which the outcome is binary (e.g. survival/death), measured at a predetermined time in the future <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The technique can precisely quantify the relative contribution of each risk factor to outcome, typically crystallized as the odds ratio (i.e. the relative odds that a patient with the risk factor has of reaching the outcome as compared with a patient without the risk factor). ANNs represent an alternative technique for achieving predictions. The key difference between the two techniques is that the contribution of each risk factor is not as rigidly dictated with ANNs as it is in a logistic regression model. ANNs can improve predictions by extracting information drawn from unforeseen interactions between predictors. Arguably, if a modeler had foresight of the important interactions present, then they could design an accurate standard logistic regression model without the heavy price associated with the use of an ANN.</p>
			<p>Often considered as 'black boxes', ANNs suffer from several shortcomings. There is no clear prescription to construct an ANN. Off the shelf software is readily available, but there is little guidance as to how ANN parameters, such as number of intermediate (hidden) layers of neurons, number of neurons in those layers, learning rate, activation functions, and several other tuning parameters, should be chosen and tuned for optimal performance by a novice user. ANNs, by virtue of the presence of numerous weights linking the neurons, can accommodate nonlinearity but they also include very large numbers of parameters. There are no clear techniques that provide limits of confidence for those parameters. Consequently, the relative contribution of each input risk factor to the outcome is difficult to quantify. Because of the large number of parameters, it is very easy for an ANN to overfit the development dataset. In other words, the predictions offered by the ANN will be overly optimistic in the original population of patients, and this good performance will not generalize well to populations to which the ANN is na&#239;ve.</p>
			<p>Jaimes and coworkers <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> do not offer a prediction in a na&#239;ve population, nor do those investigators clearly indicate whether techniques were applied to minimize the risk of overfitting. Thus, their conclusion may be overstated. Limited measures guarding against overfitting can be applied within a development set, without specific recourse to an independent validation set. Finally, preparing input data for an ANN requires some amount of <it>a priori </it>knowledge. For example, white blood cell count or fever typically have a 'U'-shaped relationship to outcome. Both low and high values portend a bad outcome, whereas intermediate values are normal. The predictions of an ANN will be significantly improved if this unintuitive relationship between risk factor and outcome is already 'known' to the modeler ahead of time.</p>
			<p>ANNs have been used by many investigators to predict outcomes in the critically ill <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Clearly, ANNs &#8211; as prediction tools &#8211; require significant expertise to build, have significant shortcomings, and should be developed and validated as rigorously as logistic regression models <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. ANNs do offer greater flexibility and may allow the identification of unforeseen interactions. ANNs can predict continuous outcomes, and thus offer an alternative to multivariable regression. Similarly, ANNs are easily scalable to ordinal or categorical outcomes, and to survival analysis <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, which is much less familiar territory for the critical care physician. However, ANNs can easily be misused, typically because their limitations are not well understood <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>.</p>
		</sec>
		<sec>
			<st>
				<p>Conclusion</p>
			</st>
			<p>Therefore, in settings in which familiar tools such as logistic regression can be applied, ANNs should be reserved for those situations where standard models do not perform well; where one suspects the presence of intense but poorly characterized interactions between risk factors; where one does not particularly care about quantifying the relative contributions of risk factors; where appropriate validation is possible; and, most importantly, where proper expertise is readily available.</p>
			<p>The science of developing prediction models is best left to the expert, but the clinician can contribute invaluable content knowledge to the process.</p>
		</sec>
		<sec>
			<st>
				<p>Abbreviations</p>
			</st>
			<p>ANN = artificial neural network.</p>
		</sec>
		<sec>
			<st>
				<p>Competing interests</p>
			</st>
			<p>Dr Clermont is a paid consultant to Amgen, Altana, and Immunetrics, LLC. The current manuscript was realized without any financial or other material support from commercial entities.</p>
		</sec>
	</bdy>
	<bm>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Comparison between logistic regression and neural networks to predict death in patients with suspected sepsis in the emergency room</p>
				</title>
				<aug>
					<au>
						<snm>Jaimes</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Farbiarz</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Alvarez</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Martinez</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Crit Care</source>
				<pubdate>2005</pubdate>
				<volume>9</volume>
				<fpage>R150</fpage>
				<lpage>R156</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1186/cc3054</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Outcome prediction for individual intensive care patients: useful, misused, or abused?</p>
				</title>
				<aug>
					<au>
						<snm>Lemeshow</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Klar</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Teres</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Intensive Care Med</source>
				<pubdate>1995</pubdate>
				<volume>21</volume>
				<fpage>770</fpage>
				<lpage>776</lpage>
				<xrefbib>
					<pubid idtype="pmpid">8847434</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Severity scoring systems in the modern intensive care unit</p>
				</title>
				<aug>
					<au>
						<snm>Clermont</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Angus</snm>
						<fnm>DC</fnm>
					</au>
				</aug>
				<source>Ann Acad Med Singapore</source>
				<pubdate>1998</pubdate>
				<volume>27</volume>
				<fpage>397</fpage>
				<lpage>403</lpage>
				<xrefbib>
					<pubid idtype="pmpid">9777087</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Predicting the outcome of intensive care unit patients</p>
				</title>
				<aug>
					<au>
						<snm>Lemeshow</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Teres</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Avrunin</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>Pastides</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>J Am Stat Assoc</source>
				<pubdate>1988</pubdate>
				<volume>83</volume>
				<fpage>348</fpage>
				<lpage>356</lpage>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Predicting outcome after ICU admission. The art and science of assessing risk</p>
				</title>
				<aug>
					<au>
						<snm>Schuster</snm>
						<fnm>DP</fnm>
					</au>
				</aug>
				<source>Chest</source>
				<pubdate>1992</pubdate>
				<volume>102</volume>
				<fpage>1861</fpage>
				<lpage>1870</lpage>
				<xrefbib>
					<pubid idtype="pmpid">1446503</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>A review of goodness of fit statistics for use in the development of logistic regression models</p>
				</title>
				<aug>
					<au>
						<snm>Lemeshow</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Hosmer</snm>
						<fnm>DW</fnm>
						<suf>Jr</suf>
					</au>
				</aug>
				<source>Am J Epidemiol</source>
				<pubdate>1982</pubdate>
				<volume>115</volume>
				<fpage>92</fpage>
				<lpage>106</lpage>
				<xrefbib>
					<pubid idtype="pmpid">7055134</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>A comparison of statistical and connectionist models for the prediction of chronicity in a surgical intensive care unit</p>
				</title>
				<aug>
					<au>
						<snm>Buchman</snm>
						<fnm>TG</fnm>
					</au>
					<au>
						<snm>Kubos</snm>
						<fnm>KL</fnm>
					</au>
					<au>
						<snm>Seidler</snm>
						<fnm>AJ</fnm>
					</au>
					<au>
						<snm>Siegforth</snm>
						<fnm>MJ</fnm>
					</au>
				</aug>
				<source>Crit Care Med</source>
				<pubdate>1994</pubdate>
				<volume>22</volume>
				<fpage>750</fpage>
				<lpage>762</lpage>
				<xrefbib>
					<pubid idtype="pmpid">8181282</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Predicting outcomes after liver transplantation. A connectionist approach</p>
				</title>
				<aug>
					<au>
						<snm>Doyle</snm>
						<fnm>HR</fnm>
					</au>
					<au>
						<snm>Dvorchik</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Mitchell</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Marino</snm>
						<fnm>IR</fnm>
					</au>
					<au>
						<snm>Ebert</snm>
						<fnm>FH</fnm>
					</au>
					<au>
						<snm>McMichael</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Fung</snm>
						<fnm>JJ</fnm>
					</au>
				</aug>
				<source>Ann Surg</source>
				<pubdate>1994</pubdate>
				<volume>219</volume>
				<fpage>408</fpage>
				<lpage>415</lpage>
				<xrefbib>
					<pubid idtype="pmpid">8161267</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Modeling mortality in the intensive care unit: comparing the performance of a back-propagation, associative-learning neural network with multivariate logistic regression</p>
				</title>
				<aug>
					<au>
						<snm>Doig</snm>
						<fnm>GS</fnm>
					</au>
					<au>
						<snm>Inman</snm>
						<fnm>KJ</fnm>
					</au>
					<au>
						<snm>Sibbald</snm>
						<fnm>WJ</fnm>
					</au>
					<au>
						<snm>Martin</snm>
						<fnm>CM</fnm>
					</au>
					<au>
						<snm>Robertson</snm>
						<fnm>JM</fnm>
					</au>
				</aug>
				<source>Proc Annu Symp Comput Appl Med Care</source>
				<pubdate>1993</pubdate>
				<fpage>361</fpage>
				<lpage>365</lpage>
				<xrefbib>
					<pubid idtype="pmpid">8130495</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Artificial neural network prediction of ventilator-induced lung edema formation</p>
				</title>
				<aug>
					<au>
						<snm>Hotchkiss</snm>
						<fnm>JR</fnm>
					</au>
					<au>
						<snm>Broccard</snm>
						<fnm>AF</fnm>
					</au>
					<au>
						<snm>Crooke</snm>
						<fnm>PS</fnm>
					</au>
				</aug>
				<source>Crit Care Med</source>
				<pubdate>2003</pubdate>
				<volume>31</volume>
				<fpage>2250</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1097/01.CCM.0000087328.59341.FC</pubid>
						<pubid idtype="pmpid" link="fulltext">12973193</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Predicting hospital mortality for patients in the intensive care unit: a comparison of artificial neural networks with logistic regression models</p>
				</title>
				<aug>
					<au>
						<snm>Clermont</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Angus</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>DiRusso</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Griffin</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Linde-Zwirble</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>Crit Care Med</source>
				<pubdate>2001</pubdate>
				<volume>29</volume>
				<fpage>291</fpage>
				<lpage>296</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1097/00003246-200102000-00012</pubid>
						<pubid idtype="pmpid" link="fulltext">11246308</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Understanding articles describing clinical prediction tools</p>
				</title>
				<aug>
					<au>
						<snm>Randolph</snm>
						<fnm>AG</fnm>
					</au>
					<au>
						<snm>Gordon</snm>
						<fnm>HG</fnm>
					</au>
					<au>
						<snm>Calvin</snm>
						<fnm>JE</fnm>
					</au>
					<au>
						<snm>Doig</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Richardson</snm>
						<fnm>WS</fnm>
					</au>
				</aug>
				<source>Crit Care Med</source>
				<pubdate>1998</pubdate>
				<volume>26</volume>
				<fpage>1603</fpage>
				<lpage>1612</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1097/00003246-199809000-00036</pubid>
						<pubid idtype="pmpid" link="fulltext">9751601</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>A technique for using neural network analysis to perform survival analysis of censored data</p>
				</title>
				<aug>
					<au>
						<snm>de Laurentiis</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Ravdin</snm>
						<fnm>PM</fnm>
					</au>
				</aug>
				<source>Cancer Lett</source>
				<pubdate>1994</pubdate>
				<volume>77</volume>
				<fpage>127</fpage>
				<lpage>138</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/0304-3835(94)90095-7</pubid>
						<pubid idtype="pmpid">8168059</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Nervous about artificial neural networks?</p>
				</title>
				<aug>
					<au>
						<snm>Wyatt</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Lancet</source>
				<pubdate>1995</pubdate>
				<volume>346</volume>
				<fpage>1175</fpage>
				<lpage>1177</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0140-6736(95)92893-6</pubid>
						<pubid idtype="pmpid">7475653</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>On the misuses of artificial neural networks for prognostic and diagnostic classification in oncology</p>
				</title>
				<aug>
					<au>
						<snm>Schwarzer</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Vach</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Schumacher</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Stat Med</source>
				<pubdate>2000</pubdate>
				<volume>19</volume>
				<fpage>541</fpage>
				<lpage>561</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">10694735</pubid>
				</xrefbib>
			</bibl>
		</refgrp>
	</bm>
</art>

