EvallA Framework for Information Systems Evaluation
- Gonzalo Arroyo, Julio
- Verdejo Maillo, María Felisa
- Amigó Cabrera, Enrique
- Carrillo-de-Albornoz, Jorge
ISSN: 1135-5948
Year of publication: 2016
Issue: 57
Pages: 189-192
Type: Article
More publications in: Procesamiento del lenguaje natural
Abstract
In this paper, the Evall framework for the automatic evaluation of information systems task is presented. With just one click and providing the system outputs of the algorithms, Evall allows researchers to automatically generate a Latex report including the results of their algorithms, statistical significance tests, measures descriptions, and references.
Bibliographic References
- Amigó, E., J. Gonzalo, and F. Verdejo. 2013. A general evaluation measure for document organization tasks. In Proceedings of the 36th International ACM SIGIR Conference, pages 643{652, New York, USA.
- Cohen, J. 1960. A Coefficient of Agreement for Nominal Scales. Educational and Psychological Measurement, 20(1):37.
- Cormack, G. V. and T. R. Lynam. 2005. Trec 2005 spam track overview. In TREC.
- Ghosh, J. 2003. Scalable clustering methods for data mining. In N. Ye, editor, Handbook of Data Mining. Lawrence Erlbaum.
- Halkidi, M., Y. Batistakis, and M. Vazirgiannis. 2001. On Clustering Validation Techniques. J. of Int. Inf. Syst., 17(2-3):107-145.
- Hull, D. A. 1998. The TREC-7 filtering track: description and analysis. In Proc. of TREC-7, 7th Text Retrieval Conf., pages 33-56.
- Järvelin, K. and J. Kekäläinen. 2002. Cumulated gain-based evaluation of ir techniques. ACM Trans. Inf. Syst., 20:422-446, October.
- Meila, M. 2003. Comparing clusterings. In Proceedings of COLT 03.
- Moffat, A. and J. Zobel. 2008. Rankbiased precision for measurement of retrieval effectiveness. ACM Trans. Inf. Syst., 27(1):2:1-2:27, December.
- Steinbach, M., G. Karypis, and V. Kumar. 2000. A comparison of document clustering techniques.
- Xu, W., X. Liu, and Y. Gong. 2003. Document clustering based on non-negative matrix factorization. In Proc. of the 26th annual Int. ACM SIGIR Conf., pages 267-273.