Search

Scholarly Works (3 results)

Sort By:

Article
Peer Reviewed

Automated Essay Scoring in Innovative Assessments of Writing from Sources

Journal of Writing Assessment, Volume 6, Issue 1 (2013)

This study examined automated essay scoring for experimental tests of writing from sources. These tests (part of the CBAL research initiative at ETS) embed writing tasks within a scenario in which students read and respond to sources. Two large-scale pilots are reported: One was administered in 2009, in which four writing assessments were piloted, and one was administered in 2011, in which two writing assessments and two reading assessments were administered. Two different rubrics were applied by human raters to each prompt: a general rubric intended to measure only those skills for which automated essay scoring provides relatively direct measurement, and a genre-specific rubric focusing on specific skills such as argumentation and literary analysis. An automated scoring engine (e-rater) was trained on part of the 2009 dataset, and cross-validated against the remaining 2009 dataset and all the 2011 data. The results indicated that automated scoring can achieve operationally acceptable levels of accuracy in this context. However, differentiation between the general rubric and the genre-specific rubric reinforces the need to achieve full construct coverage by supplementing automated scoring with additional sources of evidence.

Cover page: Automated Essay Scoring in Innovative Assessments of Writing from Sources

Creative Commons 'BY-NC-ND' version 4.0 license

Article
Peer Reviewed

Predictive value of hippocampal internal architecture asymmetry in temporal lobe epilepsy

UC San Francisco Previously Published Works (2013)

Background

Asymmetry of hippocampal internal architecture (HIA) clarity has been suggested to be a sign of hippocampal sclerosis (HS) and is frequently associated with other MRI findings of HS. The goal of this work is to use a previously developed HIA visual scoring system (Ver Hoef et al., 2013) to quantify HIA asymmetry in a retrospective series of consecutive temporal lobe epilepsy (TLE) patients and evaluate its value in predicting laterality of seizure onset both in patients with other signs of HS (HS+) and those without (HS-).

Methods

The HIA scoring system was used to rate hippocampal asymmetry and to assess the agreement between HIA and seizure lateralization. The median values of the average HIA scores for each hippocampus were compared for HS+ epileptogenic hippocampi, HS- epileptogenic hippocampi, and non-epileptogenic hippocampi with a Kruskal-Wallis one-way analysis of variance by ranks. Pair-wise differences between groups were evaluated with the two-tailed Mann-Whitney U test. A logistic regression model examined the utility of average HIA asymmetry score in predicting the true laterality of seizure onset as determined by video-EEG. Sensitivity and specificity are calculated using various asymmetry thresholds in each patient group.

Results

Fifty-five patients were identified who met inclusion criteria. Thirteen patients (24%) were found to have hippocampal atrophy and/or signal abnormality indicative of HS (HS+) and 42 did not (HS-). Significant differences were observed in the distribution of individual and average HIA scores between each of the groups of hippocampi, with HS+ hippocampi having the lowest HIA scores and non-epileptogenic hippocampi having the highest. Logistic regression analysis showed that the average HIA asymmetry score was a strong predictor of the laterality of seizure onset (β=3.93508, p<0.001). HIA asymmetry remained significant even after adjustment for HS+/HS- status (β=3.8854, p<0.001). Among HS- patients, when the average HIA asymmetry score was equal to or exceeded a threshold value of 0.5, the specificity for correctly predicting the side of seizure onset was between 95% and 100% with a sensitivity of 40-45%. Among HS+ patients, a threshold of 0.3 yielded a sensitivity of 85% and specificity of 100%.

Conclusions

In this report we show for the first time that HIA asymmetry is a significant predictor of the laterality of seizure onset in TLE patients with otherwise normal MRI findings, and that the proposed HIA scoring system has high specificity and moderate sensitivity for lateralizing seizure onset in patients with TLE.

Cover page: Predictive value of hippocampal internal architecture asymmetry in temporal lobe epilepsy

Article
Peer Reviewed

Evaluating hippocampal internal architecture on MRI: Inter-rater reliability of a proposed scoring system

UC San Francisco Previously Published Works (2013)

Background

Asymmetry of hippocampal internal architecture (HIA) has been reported to be a frequent imaging finding in epilepsy patients with temporal lobe epilepsy (TLE) who exhibit other signs of hippocampal sclerosis. HIA asymmetry may also be an independent predictor of the side of seizure onset in patients with otherwise normal MRI scans. The study of HIA asymmetry and its relationship to the laterality of TLE would benefit from a reliable method of assessing the clarity of HIA in MRI scans. We propose a visual scoring system that rates HIA clarity from 1 (imperceptible) to 4 (excellent) and report the inter-rater reliability (IRR) of this system.

Methods

In the initial preliminary phase of this study we examined IRR using a kappa statistic (κ) among a mixed group of expert and non-expert reviewers using only a brief description of the scoring system to score single images from a series of patients. In the second phase we explored the effect of training on the use of our HIA scoring system by assessing IRR among neuroimaging experts before and after a brief interactive training session. In this phase, multiple slices from each patient were scored. Separate κ values and intraclass correlation coefficients (ICC) were calculated from the scores given to each hippocampal image and from the asymmetry of scores between left and right for each slice. In the third phase the effect of training on non-expert reviewers was explored using a similar approach as with the expert reviewers.

Results

In the preliminary phase of the study, HIA scoring of single images showed substantial agreement among expert reviewers (κHIA=0.65), fair agreement among non-expert reviewers (κHIA=0.27), and a fair to moderate degree of agreement among all the reviewers as a whole (κHIA=0.40). In the second phase, prior to training there was substantial agreement among expert reviewers in regard to the individual HIA scores (κHIA=0.62; ICCHIA=0.81) but only moderate agreement on the degree of asymmetry (κAsym=0.47; ICCAsym=0.71). Training improved agreement on the individual HIA scores (κHIA=0.58-0.72; ICCHIA=0.76-0.84) and on the degree of asymmetry (κAsym=0.61-0.67; ICCAsym=0.81-0.85). Among non-expert reviewers, scores improved from only a fair degree of agreement pre-training (κHIA=0.25, κAsym=0.25; ICCHIA=0.68, ICCAsym=0.66) to a moderate level of agreement after training (κHIA=0.54, κAsym=0.52; ICCHIA=0.78, ICCAsym=0.81).

Conclusions

The proposed HIA scoring system has a substantial degree of inter-rater reliability among experienced neuroimaging reviewers. Training improves the detection of asymmetries in HIA score in particular. Non-expert reviewers can employ the system with a moderate degree of reliability, and training has an even greater impact on the improvement of scoring reliability.

Cover page: Evaluating hippocampal internal architecture on MRI: Inter-rater reliability of a proposed scoring system