Tuesday, March 20, 2012
Using Decision Tree Analysis to Identify Risk Factors for Relapse to Smoking
Probabilistic Part-of-Speech Tagging Using Decision Trees
In Probabilistic Part-of-Speech Tagging Using Decision Trees, author Helmut Schmid describes a new method for tagging words and their appropriate parts-of-speech in a tex by using a decision tree. In languages such as English, where a word can have several different meanings depending on the context (try “store”), highly accurate tagging can improve the ability of programming like spell-check to properly recognize and correct for errors. Earlier methods for tagging often degraded as the size of the text decreased, creating problems in accuracy. Schmid’s new method using a decision tree was intended to greatly increase the accuracy of probabilistic tagging regardless of the size of the text.
Summary:
In an effort to improve on the accuracy of probabilistic part-of-speech tagging, Helmut Schmid developed a method using a binary decision tree to develop paths that accurately predict the probability of a word by using the context around it. Each estimation of a word’s part-of-speech begins with a root node. If looking for a specific type of word, such as a noun preceded by an adjective, the root node asks if the preceding word is an adjective or not. The decision tree follows a path until it comes to an ending node (leaf) that determines the appropriate probabilities of what the word is.
In the design of the decision tree, each root node asks a simple question of the context surrounding a word. In the example above, this is whether the preceding word is an adjective or not. If the answer is “yes”, then the program follows to the next node. This node refines the search by asking more detail of preceding and following words to improve context. This path
continues until it comes to a node that leads only to leafs, in which case the word is then weighted and probabilities are given for the word’s part of speech.
The decision tree also incorporated probabilities and parts-of-speech from “fullform” and “suffix” lexicons. The “fullform” lexicon was built on a large tagged text to present probabilities in other texts. The lexicon allows for searches of words and their potential parts-of-speech. As interesting is the “suffix” lexicon, which is built on a decision tree. Starting at a root node (labeled “#”), the tree follows a suffix backward from the last letter to the first of the suffix. For example, when searching for the word “tagging” in the suffix lexicon, the tree starts at the root “#” then proceeds to the “g” node, then “n” node and finally the “i” node. Once there, the lexicon gives the probability of the word and its part-of-speech.
In Schmid’s testing, the TreeTagger was able to tag 10,000 words a second and over 2 million words in 6 minutes. After testing, TreeTagger turned out to be more accurate than other methods for tagging a large training text (corpus). In addition, the decision tree method allowed TreeTagger to tag smaller texts significantly more accurately. While other methods saw significant degradation in accuracy as the number of words dropped below 10,000, TreeTagger’s use of decision trees allowed it to maintain a respectable 84 percent accuracy.
Conclusion:
The TreeTagger method uses a decision tree and probabilities for accurate part-of-speech tagging in texts. The decision trees allow the method to work efficiently and accurately with texts regardless of the size while other methods saw significant reductions in accuracy as texts decreased in size and offered fewer examples for them to draw on. TreeTagger instead uses prebuilt lexicons and decision trees to track context and words.
This is an interesting way to employ decision trees. While it steers away from the intelligence field in application, its use of probabilities provides a different perspective on decision trees and probability. The application could be changed by switching out words for conditions or facts and linguistic contexts for cultural, social, and political contexts. While this is easier said than done, it may be an avenue of research worth pursuing in the intelligence field, especially since the end goal of both is to produce an accurate probability or estimate.
Source:
Schmid, H. (1994). Probabilistic part-of-speech tagging using decision trees. Proceedings of International Conference on New Methods in Language Processing. 12(4) pp 44-49
Retrieved from: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.28.1139
Decision Trees in Intelligence Analysis
Introduction:
Written during the Cold War, Edwin Sapp’s article on decision trees begins by noting the large amount of information available to intelligence analysts, the sophistication and seriousness of the weapons available, and ever-shortening time span in which effective decisions must be made. Sapp advocates the decision tree as a competent method for processing large amounts of data, while at the same time communicating degrees of certainty to an extent which language cannot.
Edwin Greenlaw Sapp begins his article entitled Decision Trees by discussing the nature of intelligence analysis. He identifies four major categories that intelligence requirements fall into:
1) places (geographic locations, physical resources);
2) people (strengths and attitudes);
3) organizations (what people form and belong to); and
4) objects (things people make and possess; cities and weapons systems
Critique:
Can decision trees really handle the large amounts of data that Sapp claims?
How accurate can assigning probabilities be when such probabilities are related to intelligence problems?
Sapp, E. G. (1974, Winter). Decision Trees. Intelligence Studies, pp. 45-57 (declassified).
Using Decision Trees for Predicting Student Academic Performance
Introduction:
In this study, authors Anupama Kumar and Dr. Vijayalakshmi use decision trees to predict student academic performance. Data will be used to help tutors make better decisions regarding student success and academic needs.
Summary:
The authors look to find a way use a combined method of decision trees and rule mining to predict student performance in upcoming tests, quizzes, etc for the use of tutors to create a better way to teach these students. In this study, the authors use a decision tree algorithm called C4.5 to predict whether students will pass or fail said exam. The outcome of the decision tree analysis was then used in a comparative analysis which stated the prediction helped students who were struggling, and challenged excelling students to reach higher levels of success.
Conclusion:
Decision trees are a successful way to predict student outcome and improve their results based on prior educational data. Decision tree algorithms can be better rated for efficiency based on their accuracy and time taken to derive the tree.
Source:
Kumar, S. Anupama, Vijayalakshmi, Dr., (2009). Efficiency of decision trees in predicting student’s academic performance. Retrieved from http://airccj.org/CSCP/vol1/csit1230.pdf
Decision Trees and their Nodes
Summary
In the paper, Probabilistic Approaches: Scenario Analysis, Decision Trees and Simulations, the author looks at decision trees as an assessment of risk in a sequence. Therefore, the subject in question must pass a series of tests, failure at any point leads to a complete loss of value. The example given is pharmaceutical drugs.
Technique
The decision tree is broken down into distinct categories. Root nodes, decision nodes, event nodes and end nodes. A root node is at the beginning and where the decision maker has a decision choice or an uncertain outcome. An event node represents the possible outcomes on a given decision. You have to figure out the possible outcomes and their likelihood for this node. The decision nodes represent choices that can be made by a decision maker. The end nodes represent the final outcomes of the decision tree.

Strengths
- By linking actions and choices – decision trees give decision makers a framework and make them think about the consequences.
- Value of information – having to think through this process gives the decision maker insight on how valuable this information is.
- Decision trees act as a form of risk management. If the decision doesn’t pass each test – it may be too risky to undertake.
- They are easy to construct and give definitive answers.
Challenges
- There is no wiggle room or room to maneuver.
- Multiple risks are hard to assess at the same time. This is a linear process at each stage.
- Event nodes require estimates of outcomes. This is subjective in a lot of cases.
- The use of the decision tree depends entirely on the decision makers willingness to stick to it strictly.
Conclusions
Decision tree analysis is particularly useful when there are discrete outcomes. However, as a process becomes more complicated and the number of outcomes increases, it is harder to use this tool effectively. Its ease of use and ability to make decision makers think about the choices and consequences makes this a tool that can be applied to many different situations.
Bibliography
Probabilistic Approaches: Scenario Analysis, Decision Trees and Simulations. Stern. NYU. Retrieved from; http://people.stern.nyu.edu/adamodar/pdfiles/papers/probabilistic.pd
Constructing a decision tree from data with hierarchical class labels
| Interesting? |
Monday, March 19, 2012
A Decision Tree System For Finding Genes In DNA
|
Sample decision tree for classifying human DNA
|



