summaryrefslogtreecommitdiff
path: root/paper/method.tex
diff options
context:
space:
mode:
authorTimotej Lazar <timotej.lazar@fri.uni-lj.si>2017-04-16 12:39:44 +0200
committerTimotej Lazar <timotej.lazar@fri.uni-lj.si>2017-04-16 12:39:44 +0200
commitc7a948f57d1bea6c58e4451b948d3c400edf0007 (patch)
treeeb6e3b752155498dfd2d8f22a44a365b6356571a /paper/method.tex
parent5e8682d06866e6d670df2967daa118983def1208 (diff)
Shorten some references
Diffstat (limited to 'paper/method.tex')
-rw-r--r--paper/method.tex2
1 files changed, 1 insertions, 1 deletions
diff --git a/paper/method.tex b/paper/method.tex
index b52b74f..1cd3497 100644
--- a/paper/method.tex
+++ b/paper/method.tex
@@ -28,7 +28,7 @@ Since programs can be validated with appropriate unit tests, our goal is not cla
Before explaining the algorithm, let us discuss the reasons why a program can be incorrect. Our experience indicates that bugs in student programs can often be described by 1) some incorrect or \emph{buggy} pattern, which needs to be removed, or 2) some missing relation (pattern) that should be included before the program can be correct. We shall now explain how both types of errors can be identified with rules.
-To discover buggy patterns, the algorithm first learns rules that describe incorrect programs (I-rules). We use a variant of the CN2 algorithm~\cite{cn21991} implemented within the Orange data-mining toolbox~\cite{demsar2013orange}. Since we use rules to generate hints, and since hints should not be presented to students unless they are likely to be correct, we impose additional constraints on the rule learner:
+To discover buggy patterns, the algorithm first learns rules that describe incorrect programs (I-rules). We use a variant of the CN2 algorithm~\cite{clark1991rule} implemented within the Orange data-mining toolbox~\cite{demsar2013orange}. Since we use rules to generate hints, and since hints should not be presented to students unless they are likely to be correct, we impose additional constraints on the rule learner:
\begin{enumerate}
\item The classification accuracy of each learned rule must exceed a threshold (we selected 90\%, as 10\% error seems acceptable for our application).
\item Each conjunct in a condition must be significant with respect to the likelihood-ratio test (in our experiments significance threshold was set to $p=0.05$).