This course is about scalable approaches to processing large amounts of information (terabytes and even petabytes). We focus mostly on MapReduce, which is presently the most accessible and practical means of computing at this scale, but will discuss other approaches as well.
The main task of the GenIELex project is the development of a biochemistry specific lexicon as well as of an annotated corpus for the evaluation of the system. The need for the construction of such a lexicon is illustrated by the following figures, based
The main task of the GenIELex project is the development of a biochemistry specific lexicon as well as of an annotated corpus for the evaluation of the system. The need for the construction of such a lexicon is illustrated by the following figures, based
A. Bogoni, W. Xiaoxia, I. Fazal, and A. Willner. Optical Fiber Communication - incudes post deadline papers, 2009.
OFC 2009. Conference on, page 1-3--. (2009)
C. Paris, N. Colineau, and R. Wilkinson. HT '09: Proceedings of the Twentieth ACM Conference on Hypertext and Hypermedia, New York, NY, USA, ACM, (July 2009)
G. Neumann, and B. Sacaleanu. Evaluating Systems for Multilingual and Multimodal Information Access: 4th Workshop of the Cross-Language Evaluation Forum, CLEF 2003, Trondheim, Norway, volume 3237 of Lecture Notes in Computer Science, Springer, Berlin, (2004)
D. Ferrucci, and A. Lally. Proceedings of the HLT-NAACL 2003 Workshop on Software Engineering and Architecture of Language Technology Systems (SEALTS), Edmonton, Canada, page 67-74. (2003)