Eric Chamoun
Initial SciPaths Space release
0a55f0f
context label split
Thus , over the past few years , along with advances in the use of learning and statistical methods for acquisition of full parsers ( <CITED HERE> ; Charniak , 1997a ; Charniak , 1997b ; Ratnaparkhi , 1997 ) , significant progress has been made on the use of statistical learning methods to recognize shallow parsing patterns syntactic phrases or words that participate in a syntactic relationship ( Church , 1988 ; Ramshaw and Marcus , 1995 ; Argamon et al. , 1998 ; Cardie and Pierce , 1998 ; Munoz et al. , 1999 ; Punyakanok and Roth , 2001 ; Buchholz et al. , 1999 ; Tjong Kim Sang and Buchholz , 2000 ) . Background train
This was done by MERT optimization ( <CITED HERE> ) towards post-edits under the TER target metric . Uses train
She evaluates 3,000 German verbs with a token frequency between 10 and 2,000 against the Duden ( <CITED HERE> ) . Background train
The following four components have been identified as the key elements of a question related to patient care ( <CITED HERE> ) : Background train
<CITED HERE> report on manually analyzing an open-class vocabulary of 35,000 head words for predicate subcategorization information and comparing the results against the subcategorization details in COMLEX . CompareOrContrast train
This is roughly an 11 % relative reduction in error rate over <CITED HERE> and Bods PCFG-reduction reported in Table 1 . CompareOrContrast train
Finally , feedback expressions ( head nods and shakes ) are successfully predicted from speech , prosody and eye gaze in interaction with Embodied Communication Agents as well as human communication ( Fujie et al. , 2004 ; <CITED HERE> ; Morency et al. , 2007 ; Morency et al. , 2009 ) . Background train
We use the agreement checker code developed by <CITED HERE> and evaluate our baseline ( MaltParser using only CORE12 ) , best performing model ( Easy-First Parser using CORE12 + DET+LMM+PERSON+FN * NGR g + p ) , and the gold reference . Uses train
The diagnoser , based on <CITED HERE>b ) , outputs a diagnosis which consists of lists of correct , contradictory and non-mentioned objects and relations from the student 's answer . Extends train
The formalization of DLRs provided by <CITED HERE> defines a formal lexical rule specification language and provides a semantics for that language in two steps : A rewrite system enriches the lexical rule specification into a fully explicit description of the kind shown in Figure 1 . Background train
Similar to our previous work ( Chan and Ng , 2005b ) , we used the supervised WSD approach described in ( <CITED HERE> ) for our experiments , using the naive Bayes algorithm as our classifier . Uses train
Similar to ( <CITED HERE>a ) , our summarization system is , which consists of three key components : an initial sentence pre-selection module to select some important sentence candidates ; the above compression model to generate n-best compressions for each sentence ; and then an ILP summarization method to select the best summary sentences from the multiple compressed sentences . CompareOrContrast train
The extraction procedure consists of three steps : First , the bracketing of the trees in the Penn Treebank is corrected and extended based on the approaches of <CITED HERE> and Collins ( 1997 ) . Background train
transition-based dependency parsing framework ( <CITED HERE> ) using an arc-eager transition strategy and are trained using the perceptron algorithm as in Zhang and Clark ( 2008 ) with a beam size of 8 . Uses train
But their importance has grown far beyond machine translation : for instance , transferring annotations between languages ( Yarowsky and Ngai 2001 ; Hwa et al. 2005 ; Ganchev , Gillenwater , and Taskar 2009 ) ; discovery of paraphrases ( <CITED HERE> ) ; and joint unsupervised POS and parser induction across languages ( Snyder and Barzilay 2008 ) . Motivation train
each relevant document is retrieved ( <CITED HERE> ) . Uses train
<CITED HERE> have previously examined the task of categorizing sentences in medical abstracts using supervised discriminative machine learning techniques . Background train
This recognizer incrementally outputs word hypotheses as soon as they are found in the best-scored path in the forward search ( <CITED HERE> ) using the ISTAR ( Incremental Structure Transmitter And Receiver ) protocol , which conveys word graph information as well as word hypotheses . Background train
The recent great advances in speech and language technologies have made it possible to build fully implemented spoken dialogue systems ( <CITED HERE> ; Allen et al. , 1996 ; Zue et al. , 2000 ; Walker et al. , 2000 ) . Background train
We posit that this would not have a significant effect on the results , in particular for MML-based classification techniques , such as Decision Graphs ( <CITED HERE> ) . Background val
<CITED HERE> showed how the perceptron algorithm can be used to efficiently compute the best parse with DOP1 's subtrees , reporting a 5.1 % relative reduction in error rate over the model in Collins ( 1999 ) on the WSJ . Background train
Tateisi et al. also translated LTAG into HPSG ( <CITED HERE> ) . CompareOrContrast train
The system utilizes several large size biological databases including three NCBI databases ( GenPept [ 11 ] , RefSeq <CITED HERE> , and Entrez GENE [ 13 ] ) , PSD database from Protein Information Resources ( PIR ) [ 14 ] , and Uses train
The LM uses the monolingual data and is trained as a five-gram9 using the SRILM-Toolkit ( <CITED HERE> ) . Uses train
There has also been work focused upon determining the political leaning ( e.g. , `` liberal '' vs. `` conservative '' ) of a document or author , where most previously-proposed methods make no direct use of relationships between the documents to be classified ( the `` unlabeled '' texts ) ( <CITED HERE> ; Efron , 2004 ; Mullen and Malouf , 2006 ) . Background train
A number of speech understanding systems have been developed during the past fifteen years ( Barnett et al. 1980 , Dixon and Martin 1979 , Erman et al. 1980 , Haton and Pierrel 1976 , Lea 1980 , <CITED HERE> , Medress 1980 , Reddy 1976 , Walker 1978 , and Wolf and Woods 1980 ) . CompareOrContrast train
Secondly , the cooperative principle of <CITED HERE> , 1978 ) , under the assumption that referential levels of a writer and a reader are quite similar , implies that the writer should structure the text in a way that makes the construction of his intended model easy for the reader ; and this seems to imply that he should appeal only to the most direct knowledge of the reader . Motivation train
In modern syntactic theories ( e.g. , lexical-functional grammar [ LFG ] [ Kaplan and Bresnan 1982 ; Bresnan 2001 ; Dalrymple 2001 ] , head-driven phrase structure grammar [ HPSG ] [ Pollard and Sag 1994 ] , tree-adjoining grammar [ TAG ] [ <CITED HERE> ] , and combinatory categorial grammar [ CCG ] [ Ades and Steedman 1982 ] ) , the lexicon is the central repository for much morphological , syntactic , and semantic information . Background train
In this paper , inspired by KNN-SVM ( <CITED HERE> ) , we propose a local training method , which trains sentence-wise weights instead of a single weight , to address the above two problems . Motivation train
1990 ) , on linguisitic acquisition ( by the use of Part-of-Speech filters hand-crafted by a linguist ) ( Oueslati , 1999 ) or , more frequently , on a combination of the two ( Smadja , 1993 ; <CITED HERE> , for example ) . CompareOrContrast train
<CITED HERE> report excellent part-of-speech tagging results using a handcrafted approach that is close to OT .3 More speculatively , imagine an OT grammar for stylistic revision of parsed sentences . Background train
Thus for instance , ( <CITED HERE> ; Copestake et al. , 2001 ) describes a Head Driven Phrase Structure Grammar ( HPSG ) which supports the parallel construction of a phrase structure ( or derived ) tree and of a semantic representation and ( Dalrymple , 1999 ) show how to equip Lexical Functional grammar ( LFG ) with a glue semantics . Background train
While many linguistic theories state subcategorization requirements in terms of phrase structure ( CFG categories ) , <CITED HERE> questions the viability and universality of such an approach because of the variety of ways in which grammatical functions may be realized at the language-specific constituent structure level . Background train
We are going to make such a comparison with the theories proposed by J. <CITED HERE> , 1982 ) that represent a more computationally oriented approach to coherence , and those of T.A. van Dijk and W. Kintch ( 1983 ) , who are more interested in addressing psychological and cognitive aspects of discourse coherence . CompareOrContrast train
11 From ( <CITED HERE> ) , we find that the performance of SAMT system is similar with the method of labeling SCFG rules with POS tags . Motivation train
In this paper we focus on the exploitation of the LDOCE grammar coding system ; Alshawi et al. ( 1985 ) and <CITED HERE> describe further research in Cambridge utilising different types of information available in LDOCE . Background train
<CITED HERE> observed that some annotators were not familiar with the exact definition of semantic relatedness . Motivation train
All current approaches to monolingual TE , either syntactically oriented ( Rus et al. , 2005 ) , or applying logical inference ( Tatu and Moldovan , 2005 ) , or adopting transformation-based techniques ( <CITED HERE> ; Bar-Haim et al. , 2008 ) , incorporate different types of lexical knowledge to support textual inference . Background train
Although originally developed as a tool to assist in query formulation , <CITED HERE> pointed out that PICO frames can be employed to structure IR results for improving precision . Background train
There has been some controversy , at least for simple stemmers ( Lovins , 1968 ; Porter , 1980 ) , about the effectiveness of morphological analysis for document retrieval ( <CITED HERE> ; Krovetz , 1993 ; Hull , 1996 ) . Background train
in history-based models ( <CITED HERE> ) , the probability estimate for each derivation decision di is conditioned on the previous derivation decisions d1 , ... , d , _ 1 , which is called the derivation history at step i . Uses train
The reader is referred to <CITED HERE> for a more detailed discussion of our use of constraint propagation .32 We illustrate the result of constraint propagation with our example grammar . Background train
Such tools make it easy to run most current approaches to statistical markup , chunking , normalization , segmentation , alignment , and noisy-channel decoding , ' including classic models for speech recognition ( Pereira and Riley , 1997 ) and machine translation ( <CITED HERE> ) . Background train
Such technologies require significant human input , and are difficult to create and maintain ( <CITED HERE> ) . Background train
We can define PCAT using a probabilistic grammar ( <CITED HERE> ) . Background val
It is these orthographic variations and complex morphological structure that make Arabic language processing challenging ( <CITED HERE> ; Xu et al. , 2002 ) . Background train
The first is the one used in the chunking competition in CoNLL-2000 ( Tjong Kim <CITED HERE> ) . Uses train
There have been many studies on parsing techniques ( Poller and Becker , 1998 ; Flickinger et al. , 2000 ) , ones on disambiguation models ( Chiang , 2000 ; <CITED HERE> ) , and ones on programming/grammar-development environ - Background train
For example , <CITED HERE> experimented with abstracts and full article texts in the task of automatically generating index term recommendations and discovered that using full article texts yields at most a 7.4 % improvement in F-score . Background train
The ten most specific nouns have been produced by comparing our corpus of computing to the French corpus Le Monde , composed of newspaper articles ( <CITED HERE> ) . Uses train
Secondly , we need to investigate techniques for identifying identical documents , virtually identical documents and highly repetitive documents , such as those pioneered by <CITED HERE>b ) and shingling techniques described by Chakrabarti ( 2002 ) . Future train
Following the work of <CITED HERE> , we implement a linear-chain CRF merging system using the following features : stemmed ( separated ) surface form , part-of-speech14 and frequencies from the training corpus for bigrams/merging of word and word +1 , word as true prefix , word +1 as true suffix , plus frequency comparisons of these . Uses train
<CITED HERE> attempts to improve on the approach of Brent ( 1993 ) by passing raw text through a stochastic tagger and a finite-state parser ( which includes a set of simple rules for subcategorization frame recognition ) in order to extract verbs and the constituents with which they co-occur . Background train
From an IR view , a lot of specialized research has already been carried out for medical applications , with emphasis on the lexico-semantic aspects of dederivation and decomposition ( Pacak et al. , 1980 ; Norton and Pacak , 1983 ; Wolff , 1984 ; Wingert , 1985 ; Dujols et al. , 1991 ; <CITED HERE> ) . Background val
Rather than producing a complete analysis of sentences , the alternative is to perform only partial analysis of the syntactic structures in a text ( <CITED HERE> ; Abney , 1991 ; Greffenstette , 1993 ) . Background train
Thus , over the past few years , along with advances in the use of learning and statistical methods for acquisition of full parsers ( Collins , 1997 ; <CITED HERE>a ; Charniak , 1997b ; Ratnaparkhi , 1997 ) , significant progress has been made on the use of statistical learning methods to recognize shallow parsing patterns syntactic phrases or words that participate in a syntactic relationship ( Church , 1988 ; Ramshaw and Marcus , 1995 ; Argamon et al. , 1998 ; Cardie and Pierce , 1998 ; Munoz et al. , 1999 ; Punyakanok and Roth , 2001 ; Buchholz et al. , 1999 ; Tjong Kim Sang and Buchholz , 2000 ) . Background train
Problems such as these have motivated research on more abstract , dependencybased parser evaluation ( e.g. , Lin 1995 ; Carroll , Briscoe , and Sanfilippo 1998 ; Carroll et al. 2002 ; Clark and Hockenmaier 2002 ; King et al. 2003 ; Preiss 2003 ; <CITED HERE> ; Miyao and Tsujii 2004 ) . Motivation train
They are widely used in MT as a way to figure out how to translate input in one language into output in another language ( <CITED HERE> ) . Background train
Further details about the properties of entropy can be found in textbooks on information theory ( e.g. , <CITED HERE> ) . Background train
<CITED HERE> , for example , discussed the transcripts of a dialogue between people who assemble a piece of garden furniture ( originally recorded by Candy Sidner ) . Background train
Here , PV ( A ) represents an ascent direction chosen as follows : For inequality constraints , it is the projected gradient ( <CITED HERE> ) ; for equality constraints with slack , we use conjugate gradient ( Nocedal and Wright 1999 ) , noting that when A = 0 , the objective is not differentiable . Uses val
Some methods are based on likelihood ( Och and Ney , 2002 ; Blunsom et al. , 2008 ) , error rate ( <CITED HERE> ; Zhao and Chen , 2009 ; Pauls et al. , 2009 ; Galley and Quirk , 2011 ) , margin ( Watanabe et al. , 2007 ; Chiang et al. , 2008 ) and ranking ( Hopkins and May , 2011 ) , and among which minimum error rate training ( MERT ) ( Och , 2003 ) is the most popular one . Motivation train
This idea was inspired by <CITED HERE> , who used a list of arguments surrounding the main verb together with the verb 's subcategorization information and previously processed examples to analyse semantic roles ( case relations ) . Motivation train
, `` domain circumscription '' ( cfXXX <CITED HERE> ) , and their kin . CompareOrContrast train
While these approaches have been reasonably successful ( see Mitkov ( 2002 ) ) , <CITED HERE> speculate that deeper linguistic knowledge needs to be made available to resolvers in order to reach the next level of performance . Background train
A parsing experiment shows that an efficient HPSG parser with the obtained grammar achieved a significant speed-up against an existing LTAG parser ( <CITED HERE> ) . CompareOrContrast train
Features were discovered from the actual sentence plan trees that the SPG generated through the feature derivation process described below , in a manner similar to that used by <CITED HERE> . CompareOrContrast train
Such questions are typically answered by designing appropriate priming experiments ( <CITED HERE> ) or other lexical decision tasks . Background train
The system is in the form of an agenda-driven chart-based parser whose foundation is similar to previous formalizations of Chomsky 's Minimalist Program ( <CITED HERE> ; Harkema , 2000 ; Niyogi , 2001 ) . CompareOrContrast train
<CITED HERE> and Akkerman et al. ( 1985 ) provide a more detailed analysis of the information encoded by the LDOCE grammar codes and discuss their efficacy as a system of linguistic description . Background train
The research described below is taking place in the context of three collaborative projects ( Boguraev , 1987 ; Russell et al. , 1986 ; <CITED HERE> ) to develop a general-purpose , wide coverage morphological and syntactic analyser for English . Background train
The EDR has close ties to the named entity recognition ( NER ) and coreference resolution tasks , which have been the focus of several recent investigations ( Bikel et al. , 1997 ; Miller et al. , 1998 ; Borthwick , 1999 ; Mikheev et al. , 1999 ; Soon et al. , 2001 ; <CITED HERE> ; Florian et al. , 2004 ) , and have been at the center of evaluations such as : MUC-6 , MUC-7 , and the CoNLL '02 and CoNLL '03 shared tasks . Background train
Viewed in this way , gradable adjectives are an extreme example of the `` efficiency of language '' ( <CITED HERE> ) : Far from meaning something concrete like `` larger than 8 cm '' -- a concept that would have very limited applicability -- or even something more general like `` larger than the average N , '' a word like large is applicable across a wide range of different situations . CompareOrContrast train
However , since work in this direction has started , a significant progress has also been made in the research on statistical learning of full parsers , both in terms of accuracy and processing time ( Charniak , 1997b ; Charniak , 1997a ; <CITED HERE> ; Ratnaparkhi , 1997 ) . Background train
It is defined on different kinds of textual units , e.g. documents , parts of a document ( e.g. words and their surrounding context ) , words or concepts ( <CITED HERE> ) .2 Linguistic distance between words is inverse to their semantic similarity or relatedness . Background train
However , the greatest increase is in the amount of raw text available to be processed , e.g. the English Gigaword Corpus ( Linguistic Data <CITED HERE> ) . Background val
For example , such schema can serve as a mean to represent translation examples , or find structural correspondences for the purpose of transfer grammar learning ( Menezes & Richardson , 2001 ) , ( Aramaki et al. , 2001 ) , ( <CITED HERE> ) , ( Meyers et al. , 2000 ) , ( Matsumoto et al. , 1993 ) , ( kaji et al. , 1992 ) , and example-base machine translation EBMT3 ( Sato & Nagao , 1990 ) , ( Sato , 1991 ) , ( Richardson et al. , 2001 ) , ( Al-Adhaileh & Tang , 1999 ) . Background val
<CITED HERE> introduced the log-linear model for statistical machine translation ( SMT ) , in which translation is considered as the following optimization problem : Background train
Representative systems are described in Boisen et al. ( 1989 ) , De Mattia and Giachin ( 1989 ) , Niedermair ( 1989 ) , <CITED HERE> , and Young ( 1989 ) . Background train
For compound splitting , we follow <CITED HERE> , using linguistic knowledge encoded in a rule-based morphological analyser and then selecting the best analysis based on the geometric mean of word part frequencies . Uses train
<CITED HERE> showed that the use of a POS factor only results in negligible BLEU improvements , but we need access to the POS in our inflection prediction models . Background train
Over the last decade there has been a lot of interest in developing tutorial dialogue systems that understand student explanations ( Jordan et al. , 2006 ; Graesser et al. , 1999 ; Aleven et al. , 2001 ; Buckley and Wolska , 2007 ; Nielsen et al. , 2008 ; <CITED HERE> ) , because high percentages of selfexplanation and student contentful talk are known to be correlated with better learning in humanhuman tutoring ( Chi et al. , 1994 ; Litman et al. , 2009 ; Purandare and Litman , 2008 ; Steinhauser et al. , 2007 ) . Background train
We first identified the most informative unigrams and bigrams using the information gain measure ( <CITED HERE> ) , and then selected only the positive outcome predictors using odds ratio ( Mladenic and Grobelnik 1999 ) . Uses train
Recently , several alternative , often quite sophisticated approaches to collective classification have been proposed ( Neville and Jensen , 2000 ; Lafferty et al. , 2001 ; Getoor et al. , 2002 ; Taskar et al. , 2002 ; <CITED HERE> ; Taskar et al. , 2004 ; McCallum and Wellner , 2004 ) . Background val
It compares favorably to other stemming or root extraction algorithms ( <CITED HERE> ; Al-Shalabi and Evens , 1998 ; and Houmame , 1999 ) , with a performance of over 97 % for extracting the correct root in web documents , and it addresses the challenge of the Arabic broken plural and hollow verbs . Motivation train
We perceive that these results can be extended to other language models that properly embed bilexical context-free grammars , as for instance the more general history-based models used in ( <CITED HERE> ) and ( Chelba and Jelinek , 1998 ) . Future train
Clearly , what it takes for the adjective to be applicable has not been cast in stone , but is open to fiat : the speaker may decide that 8 cm is enough , or the speaker may set the standards higher ( cfXXX , <CITED HERE> ) . Background train
Many investigators ( e.g. <CITED HERE> ; Elowitz et al. 1976 ; Luce et al. 1983 ; Cahn 1988 ) have suggested that the poor prosody of synthetic speech , in comparison with natural speech , is the primary factor leading to difficulties in the comprehension of fluent synthetic speech . Motivation train
The PERSIVAL project , the most comprehensive study of such techniques applied on medical texts to date , leverages patient records to generate personalized summaries in response to physicians ' queries ( McKeown , Elhadad , and Hatzivassiloglou 2003 ; <CITED HERE> ) . CompareOrContrast train
See <CITED HERE> for further discussion . Background train
We gather similar words using <CITED HERE>a ) , mining similar verbs from a comparable-sized parsed corpus , and collecting similar nouns from a broader 10 GB corpus of English text .4 We also use Keller and Lapata ( 2003 ) 's approach to obtaining web-counts . Uses train
Thus , for example , it can acquire a `` script '' such as the one for going to a restaurant as defined in <CITED HERE> . Background train
ones , DIRT ( <CITED HERE> ) , VerbOcean ( Chklovski and Pantel , 2004 ) , FrameNet ( Baker et al. , 1998 ) , and Wikipedia ( Mehdad et al. , 2010 ; Kouylekov et al. , 2009 ) . Background train
There is some literature on procedure acquisition such as the LISP synthesis work described in Biermann et al. ( 1984 ) and the PROLOG synthesis method of <CITED HERE> . CompareOrContrast train
The system uses a domain-specific content planner to produce input to the surface realizer based on the strategy decision , and a FUF/SURGE ( <CITED HERE> ) generation system to produce the appropriate text . Uses train
However , each of these fields requires further decoding and restructuring to provide client programs with easy access to the information they require ( see <CITED HERE> for further discussion ) . Background train
Our results also confirm the insights gained by <CITED HERE> , who observed that in crossdomain polarity analysis adding more training data is not always beneficial . CompareOrContrast train
We consider the Creative Commons model as the most suitable one to let each author choose the rights to reserve ( <CITED HERE> ) . Uses val
It compares favorably to other stemming or root extraction algorithms ( Yates and Neto , 1999 ; Al-Shalabi and Evens , 1998 ; and <CITED HERE> ) , with a performance of over 97 % for extracting the correct root in web documents , and it addresses the challenge of the Arabic broken plural and hollow verbs . Motivation train
In a similar vein , <CITED HERE> showed that a different feature-topic model improved predictions on a fill-in-the-blank task . Background val
As ( <CITED HERE> ) show , lexical information improves on NP and VP chunking as well . Future val
But while Bod 's estimator obtains state-of-the-art results on the WSJ , comparable to Charniak ( 2000 ) and Collins ( 2000 ) , Bonnema et al. 's estimator performs worse and is comparable to <CITED HERE> . Background train
In multi-party discussion people usually mention each other 's name for the purpose of disentanglement ( <CITED HERE> ) . Background train
Fortunately , indirect associations are usually not difficult to identify , because they tend to be weaker than the direct associations on which they are based ( <CITED HERE>c ) . Background train
The work of <CITED HERE> and Steedman , Osborne , et al. ( 2003 ) suggests that co-training can be helpful for statistical parsing . Background val
TNT refers to the HPSG parser ( <CITED HERE> ) , C++ implementation of the two-phase parsing algorithm that performs filtering with a compiled CFG ( phase 1 ) and then executes feature unification ( phase 2 ) . CompareOrContrast train
1 ° The body of a plan can be an action or sequence of actions , a goal or sequence 9 Moore and Paris also note that `` a generation system must maintain the kinds of information outlined by Grosz and Sidner '' ( <CITED HERE> , 203 ) . Background val
Surveys and articles on the topic include Lamarche and Retord ( 1996 ) , de Groote and Retord ( 1996 ) , and <CITED HERE> . Background train
Another line of research that is correlated with ours is recognition of agreement/disagreement ( Misra and Walker , 2013 ; Yin et al. , 2012 ; Abbott et al. , 2011 ; Andreas et al. , 2012 ; <CITED HERE> ; Hillard et al. , 2003 ) and classification of stances ( Walker et al. , 2012 ; Somasundaran and Wiebe , 2010 ) in online forums . CompareOrContrast val
The standard way to handle this problem is to handcraft a finite set of features which provides a sufficient summary of the unbounded history ( Ratnaparkhi , 1999 ; Collins , 1999 ; <CITED HERE> ) . CompareOrContrast train
Since the arguments can provide useful semantic information , the SRL is crucial to many natural language processing tasks , such as Question and Answering ( Narayanan and Harabagiu 2004 ) , Information Extraction ( <CITED HERE> ) , and Machine Translation ( Boas 2002 ) . Background train
Most approaches rely on VerbNet ( <CITED HERE> ) and FrameNet ( Baker et al. , 1998 ) to provide associations between verbs and semantic roles , that are then mapped onto the current instance , as shown by the systems competing in semantic role labelling competitions ( Carreras and Marquez , 2004 ; Carreras and Marquez , 2005 ) and also ( Gildea and Jurafsky , 2002 ; Pradhan et al. , 2005 ; Shi and Mihalcea , 2005 ) . Background train
In previous work ( Bachenko et al. 1986 ) , we described an experimental text-to-speech system that determined prosodic phrasing for the Olive -- Liberman synthesizer ( <CITED HERE> ) . Background train
Note that this ensures that greater importance is attributed to longer chunks , as is usual in most EBMT systems ( cfXXX Sato and Nagao 1990 ; Veale and Way 1997 ; <CITED HERE> ) .7 As an example , consider the translation into French of the house collapsed . Background train
This approach is taken , for example , in LKB ( Copestake 1992 ) where lexical rules are introduced on a par with phrase structure rules and the parser makes no distinction between lexical and nonlexical rules ( <CITED HERE> , 31 ) . CompareOrContrast train
27 <CITED HERE> argue that semi-productivity of lexical rules , which can be understood as a generalization of exceptions to lexical rules , can be integrated with our approach by assigning probabilities to the automaton associated with a particular lexical entry . Background train
Results from other systems show that measures of semantic coherence between a student and a system were positively associated with higher learning gain ( <CITED HERE> ) . Future train
These automatic transformations are based on linguistic rules ( <CITED HERE> ) . Uses train
• Only an automatic evaluation was performed , which relied on having model responses ( Berger and Mittal 2000 ; <CITED HERE> ) . CompareOrContrast train
In comparison , the tag set of the Buckwalter Morphological Analyzer ( <CITED HERE> ) used in the PATB has a core POS set of 44 tags ( CORE44 ) before morphological extension .8 Cross-linguistically , a core set containing around 12 tags is often CompareOrContrast train
• use of low level knowledge from the speech recognition phase , • use of high level knowledge about the domain in particular and the dialogue task in general , • a `` continue '' facility and an `` auto-loop '' facility as described by Biermann and Krishnaswamy ( 1976 ) , • a `` conditioning '' facility as described by <CITED HERE> , • implementation of new types of paraphrasing , • checking a larger environment in the expectation acquisition algorithm when deciding if an incoming sentence is the same or similar to one already seen , and • examining inter-speaker dialogue patterns . Future train
<CITED HERE> present detailed studies on the task of named entity recognition , which discusses and compares different methods on multiple aspects including chunk representation , inference method , utility of non-local features , and integration of external knowledge . Background train
mers ( <CITED HERE> ; Porter , 1980 ) demonstrably improve retrieval performance . Background train
Cases like this would be covered if the decision-theoretic property of Pareto optimality ( e.g. , <CITED HERE> ) was used as the sole criterion : Formally , an object r E C has a Pareto-optimal combination of Values V iff there is no other x E C such that Background train
A formula for the test set perplexity ( <CITED HERE> ) is :13 Background train
This method follows a traditional Information Retrieval paradigm ( <CITED HERE> ) , where a query is represented by the content terms it contains , and the system retrieves from the corpus a set of documents that best match this query . Uses train
Table 1 gives the interpretations of eight adjective-noun combinations discussed in <CITED HERE> and Vendler ( 1968 ) . Uses train
The Penn Treebank results reported here for the Markov model approach are at least equivalent to those reported for the Maximum Entropy approach in ( <CITED HERE> ) . CompareOrContrast train
<CITED HERE> , by comparison , employ 163 distinct predefined frames . Background train
Brockmann and Lapata ( 2003 ) have showed that WordNet-based approaches do not always outperform simple frequency-based models , and a number of techniques have been recently proposed which may offer ideas for refining our current unsupervised approach ( Erk , 2007 ; <CITED HERE> ) . Future train
fθ on demand ( <CITED HERE> ) can pay off here , since only part of fθ may be needed subsequently . ) Background val
An approach ( also based on regulation of the succession of rule application ) to the associated problem of spurious ambiguity is given in <CITED HERE> but again , to our knowledge , there is no predictive relation between incremental combinatory processing and the kind of processing phenomena cited in the introduction . Background train
This equivalence is doing essentially the same job as Pereira 's pronoun abstraction schema in <CITED HERE> . CompareOrContrast train
Finally , we experiment with a method for combining phrase tables proposed in ( <CITED HERE> ; Nakov and Ng , 2012 ) . Uses train
Our work is more similar to NLG work that concentrates on structural constraints such as generative poetry ( Greene et al. , 2010 ) ( Colton et al. , 2012 ) ( Jiang and Zhou , 2008 ) or song lyrics ( <CITED HERE> ) ( Ramakrishnan A et al. , 2009 ) , where specified meter or rhyme schemes are enforced . CompareOrContrast train
For the cases where retrieval took place , we used F-score ( van Rijsbergen 1979 ; <CITED HERE> ) to determine the similarity between the response from the top-ranked document and the real response ( the formulas for F-score and its contributing factors , recall and precision , appear in Section 4.2 ) . Uses val
These translations gave rise to a number of automatically constructed linguistic resources : ( 1 ) the original ( source , target ) phrasal translation pairs , ( 2 ) the marker lexicon , ( 3 ) the gen11 Thanks are due to one of the anonymous reviewers for pointing out that our wEBMT system , seeded with input from multiple translation systems , with a postvalidation process via the Web ( amounting to an n-gram target language model ) , in effect forms a multiengine MT system as described by <CITED HERE> , Frederking et al. ( 1994 ) , and Hogan and Frederking ( 1998 ) . CompareOrContrast val
This includes work on generalized expectation ( Mann and McCallum , 2010 ) , posterior regularization ( Ganchev et al. , 2010 ) and constraint driven learning ( Chang et al. , 2007 ; <CITED HERE> ) . Background val
The list , a synthesis of a number of relation lists cited in the literature , has been designed to be general , domainindependent ( <CITED HERE>a ) . Motivation val
Unlike our approach , those of <CITED HERE> and Hockenmaier , Bierner , and Baldridge ( 2004 ) include a substantial initial correction and clean-up of the Penn-II trees . CompareOrContrast train
2The algorithm was implemented by the the authors , following the description in <CITED HERE> . Uses train
Our motivation for generation of material for language education exists in work such as Sumita et al. ( 2005 ) and <CITED HERE> , which deal with automatic generation of classic fill in the blank questions . Motivation train
In most recent research , NEs ( person , location and organisations ) are extracted from the text and used as a source of evidence to calculate the similarity between documents - see for instance ( <CITED HERE> ; Chen and Martin , 2007 ; Popescu and Magnini , 2007 ; Kalashnikov et al. , 2007 ) . Background train
Many researchers use the GIZA + + software package ( <CITED HERE> ) as a black box , selecting IBM Model 4 as a compromise between alignment quality and efficiency . Background train
Some works abstract perception via the usage of symbolic logic representations ( <CITED HERE> ; Chen and Mooney , 2011 ; Matuszek et al. , 2012 ; Artzi and Zettlemoyer , 2013 ) , while others choose to employ concepts elicited from psycholinguistic and cognition studies . Background train
Over the past decade , researchers at IBM have developed a series of increasingly sophisticated statistical models for machine translation ( Brown et al. , 1988 ; <CITED HERE> ; Brown et al. , 1993a ) . Background train
One area of current interest concerns the left-to-right arrangement of premodifying adjectives within an NP ( e.g. , <CITED HERE> ; Malouf 2000 ) . Background train
Models of translational equivalence that are ignorant of indirect associations have `` a tendency ... to be confused by collocates '' ( <CITED HERE> ) . Background train
Note that although our feature set was drawn primarily from our prior uncertainty detection experiments ( Forbes-Riley and Litman , 2011a ; <CITED HERE> ) , we have also experimented with other features , including state-of-theart acoustic-prosodic features used in the last Interspeech Challenges ( Schuller et al. , 2010 ; Schuller et al. , 2009b ) and made freely available in the openSMILE Toolkit ( Florian et al. , 2010 ) . Extends train
For this mention-pair coreference model φ ( u , v ) , we use the same set of features used in <CITED HERE> . Uses train
Compared to the reranking technique in <CITED HERE> , who obtained an LP of 89.9 % and an LR of 89.6 % , our results show a 9 % relative error rate reduction . CompareOrContrast train
The paper compares and contrasts the training time needed and performance achieved by our modified learner with two other systems : a standard transformation-based learner , and the ICA system ( <CITED HERE> ) . CompareOrContrast train
There has also been work focused upon determining the political leaning ( e.g. , `` liberal '' vs. `` conservative '' ) of a document or author , where most previously-proposed methods make no direct use of relationships between the documents to be classified ( the `` unlabeled '' texts ) ( Laver et al. , 2003 ; <CITED HERE> ; Mullen and Malouf , 2006 ) . Background train
It projects a functional head , voice ( <CITED HERE> ) , whose specifier is the external argument . Background train
Finite state transducers , which can be learned from bilingual corpora , have been proposed for automatic translation ( Amengual et al. , 2000 ) , as have been bilingual stochastic grammars ( <CITED HERE> ) . Background train
This is noticeable for German ( Brants et al. , 2002 ) and Portuguese ( <CITED HERE> ) , which still have high overall accuracy thanks to very high attachment scores , but much more conspicuous for Czech ( B ¨ ohmov ´ a et al. , 2003 ) , Dutch ( van der Beek et al. , 2002 ) and Slovene ( Dˇzeroski et al. , 2006 ) , where root precision drops more drastically to about 69 % , 71 % and 41 % , respectively , and root recall is also affected negatively . CompareOrContrast train
3 The degree of precision of the measurement ( <CITED HERE> , Section 1.5 ) determines which objects can be described by the GRE algorithm , since it determines which objects count as having the same size . Background val
There have already been several attempts to develop distributed NLP systems for dialogue systems ( <CITED HERE> ) and speech recognition ( Hacioglu and Pellom , 2003 ) . Background train
Table look-up using an explicit translation lexicon is sufficient and preferable for many multilingual NLP applications , including `` crummy '' MT on the World Wide Web ( Church & Hovy , 1993 ) , certain machine-assisted translation tools ( e.g. ( <CITED HERE> ; Melamed , 1996b ) ) , concordancing for bilingual lexicography ( Catizone et al. , 1993 ; Gale & Church , 1991 ) , computerassisted language learning , corpus linguistics ( Melby . Background val
SWIZZLE is a multilingual enhancement of COCKTAIL ( <CITED HERE> ) , a coreference resolution system that operates on a mixture of heuristics that combine semantic and textual cohesive information ' . Extends train
( Davis and Ogden , 1997 ; <CITED HERE> ; Hull and ( 3refenstette , 1996 ) . CompareOrContrast train
de URL : http://www.sfs.nphil.uni-tuebingen.de/sfb / b4home.html 1 This is , for example , the case for all proposals working with verbal lexical entries that raise the arguments of a verbal complement ( Hinrichs and Nakazawa 1989 ) that also use lexical rules such as the Complement Extraction Lexical Rule ( <CITED HERE> ) or the Complement Cliticization Lexical Rule ( Miller and Sag 1993 ) to operate on those raised elements . Background train
<CITED HERE> attempt to translate technical terms using word relation matrices , although the resource from which such relations are derived is a pair of nonparallel corpora . Background train
The system is in the form of an agenda-driven chart-based parser whose foundation is similar to previous formalizations of Chomsky 's Minimalist Program ( Stabler , 1997 ; <CITED HERE> ; Niyogi , 2001 ) . CompareOrContrast val
Each component will return a confidence measure of the reliability of its prediction , c.f. ( <CITED HERE> ) . Motivation train
In fact , most of the features3 implemented in existing coreference resolution systems rely solely on mention heads ( <CITED HERE> ) . Background val
The problem of handling ill-formed input has been studied by Carbonell and Hayes ( 1983 ) , <CITED HERE> , Jensen et al. ( 1983 ) , Kwasny and Sondheimer ( 1981 ) , Riesbeck and Schank ( 1976 ) , Thompson ( 1980 ) , Weischedel and Black ( 1980 ) , and Weischedel and Sondheimer ( 1983 ) . CompareOrContrast train
In most recent research , NEs ( person , location and organisations ) are extracted from the text and used as a source of evidence to calculate the similarity between documents - see for instance ( Blume , 2005 ; <CITED HERE> ; Popescu and Magnini , 2007 ; Kalashnikov et al. , 2007 ) . Background train
This conception of lexical rules thus can be understood as underlying the computational approach that treats lexical rules as unary phrase structure rules as , for example , adopted in the LKB system ( <CITED HERE> ) . Background train
Typical letter-to-sound rule sets are those described by Ainsworth ( 1973 ) , McIlroy ( 1973 ) , Elovitz et al. ( 1976 ) , Hurmicutt ( 1976 ) , and <CITED HERE> . Background train
<CITED HERE> describe an efficient algorithm ( of linear complexity in the number of training sentences ) for computing the LDA transform matrix , which entails computing the withinand between-covariance matrices of the classes , and using Singular Value Decomposition ( SVD ) to compute the eigenvectors of the new space . Uses val
For better comparison with work of others , we adopt the suggestion made by <CITED HERE> to evaluate the parsing quality on sentences up to 70 tokens long . Uses val
Some examples include text categorization ( <CITED HERE> ) , base noun phrase chunking ( Ngai and Yarowsky 2000 ) , part-of-speech tagging ( Engelson Dagan 1996 ) , spelling confusion set disambiguation ( Banko and Brill 2001 ) , and word sense disambiguation ( Fujii et al. 1998 ) . Background val
Differently , <CITED HERE> designed a sampler to infer an STSG by fixing the tree structure and exploring the space of alignment . Motivation val
They use a Bag of Visual Words ( BoVW ) model ( <CITED HERE> ) to create a bimodal vocabulary describing documents . Background train
W. <CITED HERE> discussed sentences of the form * This is a chair but you can sit on it . Background train
This imbalance foils thresholding strategies , clever as they might be ( <CITED HERE> ; Wu & Xia , 1994 ; Chen , 1996 ) . Background train
The problem of handling ill-formed input has been studied by Carbonell and Hayes ( 1983 ) , Granger ( 1983 ) , Jensen et al. ( 1983 ) , Kwasny and Sondheimer ( 1981 ) , Riesbeck and Schank ( 1976 ) , <CITED HERE> , Weischedel and Black ( 1980 ) , and Weischedel and Sondheimer ( 1983 ) . CompareOrContrast train
Prototypes of Internet search engines for linguists , corpus linguists and lexicographers have been proposed : WebCorp ( Kehoe and Renouf , 2002 ) , KWiCFinder ( Fletcher , 2004a ) and the Linguist 's Search Engine ( <CITED HERE> ; Resnik and Elkiss , 2003 ) . Background val
In most recent research , NEs ( person , location and organisations ) are extracted from the text and used as a source of evidence to calculate the similarity between documents - see for instance ( Blume , 2005 ; Chen and Martin , 2007 ; Popescu and Magnini , 2007 ; <CITED HERE> ) . Background train
More recently , Silberer et al. ( 2013 ) show that visual attribute classifiers , which have been immensely successful in object recognition ( <CITED HERE> ) , act as excellent substitutes for feature Background train
In other methods , lexical resources are specifically tailored to meet the requirements of the domain ( <CITED HERE> ) or the system ( Gomez , 1998 ) . Background val
Such systems extract information from some types of syntactic units ( clauses in ( Fillmore and Atkins , 1998 ; <CITED HERE> ; Hull and Gomez , 1996 ) ; noun phrases in ( Hull and Gomez , 1996 ; Rosario et al. , 2002 ) ) . Background val
The third version ( VOYAGER ) serves as an interface both with a recognizer and with a functioning database back-end ( <CITED HERE> ) . Uses train
We use the non-projective k-best MST algorithm to generate k-best lists ( <CITED HERE> ) , where k = 8 for the experiments in this paper . Uses train
As for work on Arabic ( MSA ) , results have been reported on the PATB ( Kulick , Gabbard , and Marcus 2006 ; Diab 2007 ; Green and Manning 2010 ) , the Prague Dependency Treebank ( PADT ) ( Buchholz and Marsi 2006 ; <CITED HERE> ) and the CATiB ( Habash and Roth 2009 ) . Background train
For example , McKnight and Srinivasan ( 2003 ) describe a machine learning approach to automatically label sentences as belonging to introduction , methods , results , or conclusion using structured abstracts as training data ( see also <CITED HERE> ) . Background train
• Graph transformations for recovering nonprojective structures ( <CITED HERE> ) . Uses train
<CITED HERE> 's CCM is an unlabeled bracketing model that generates the span of part-of-speech tags that make up each constituent and the pair of tags surrounding each constituent span ( as well as the spans and contexts of each non-constituent ) . Background train
This process produces a hierarchical clustering of the word types in the corpus , and these clusterings have been found useful in many applications ( <CITED HERE> ; Koo et al. , 2008 ; Miller et al. , 2004 ) . Motivation train
Opposition ( called `` adversative '' or `` contrary-to-expectation '' by Halliday and Hasan 1976 ; cfXXX also <CITED HERE> , p. 672 ) . Background train
Aside from the extraction of theory-neutral subcategorization lexicons , there has also been work in the automatic construction of lexical resources which comply with the principles of particular linguistic theories such as LTAG , CCG , and HPSG ( <CITED HERE> ; Xia 1999 ; Hockenmaier , Bierner , and Baldridge 2004 ; Nakanishi , Miyao , and Tsujii 2004 ) . Background train
1 The representation in <CITED HERE> is even more compact than ours for grammars that are not self-embedding . CompareOrContrast train
29 This improvement of the covariation encoding can also be viewed as an instance of the program transformation technique referred to as deletion of clauses with a finitely failed body ( <CITED HERE> ) . CompareOrContrast train
The reordering models we describe follow our previous work using function word models for translation ( <CITED HERE> ; Setiawan et al. , 2009 ) . Extends train
Although there are other discussions of the paragraph as a central element of discourse ( e.g. Chafe 1979 , <CITED HERE> , Longacre 1979 , Haberlandt et al. 1980 ) , all of them share a certain limitation in their formal techniques for analyzing paragraph structure . CompareOrContrast train
For example , such schema can serve as a mean to represent translation examples , or find structural correspondences for the purpose of transfer grammar learning ( Menezes & Richardson , 2001 ) , ( Aramaki et al. , 2001 ) , ( Watanabe et al. , 2000 ) , ( Meyers et al. , 2000 ) , ( Matsumoto et al. , 1993 ) , ( kaji et al. , 1992 ) , and example-base machine translation EBMT3 ( Sato & Nagao , 1990 ) , ( Sato , 1991 ) , ( Richardson et al. , 2001 ) , ( <CITED HERE> ) . Background train
In corpus linguistics building such megacorpora is beyond the scope of individual researchers , and they are not easily accessible ( Kennedy , 1998 : 56 ) unless the web is used as a corpus ( <CITED HERE> ) . Background train
Here 11 is an optimization precision , oc is a step size chosen with the strong Wolfe 's rule ( <CITED HERE> ) . Uses train
See ( <CITED HERE> ) for a discussion . Background train
Figure 2 ( a ) shows the frame-based semantic representation for the utterance `` What time is Analyze This playing 2 See ( <CITED HERE> ) for how MIMIC 's dialoguelevel knowledge is used to override default prosodic assignments for concept-to-speech generation . Background train
For Berkeley system , we use the reported results from <CITED HERE> . CompareOrContrast train
Instead , we will adopt the nomenclature of the Automatic Content Extraction program ( <CITED HERE> ) : we will call the instances of textual references to objects/abstractions mentions , which can be either named ( e.g. John Mayor ) , nominal ( the president ) or pronominal ( she , it ) . Uses train
Such a component would serve as the first stage of a clinical question answering system ( <CITED HERE> ) or summarization system ( McKeown et al. , 2003 ) . Future train
In the areas of Natural Language Processing ( NLP ) and computational linguistics , proposals have been made for using the computational Grid for data-intensive NLP and text-mining for eScience ( <CITED HERE> ; Hughes et al , 2004 ) . Background train
<CITED HERE> comprehensively compares different approaches to complementation within grammatical theory providing a touchstone against which the LDOCE scheme can be evaluated . CompareOrContrast val
To prove that our method is effective , we also make a comparison between the performances of our system and <CITED HERE> , Xue ( 2008 ) . CompareOrContrast val
The obtained SCFG is further used in a phrase-based and hierarchical phrase-based system ( <CITED HERE> ) . CompareOrContrast train
32 In certain cases an extension of the constraint language with named disjunctions or contexted constraints ( Maxwell and Kaplan 1989 ; Eisele and Dorre 1990 ; <CITED HERE> ) can be used to circumvent constraint propagation . Background train
<CITED HERE> , 2009 , 2010 ) utilized Bayesian methods to learn synchronous context free grammars ( SCFG ) from a parallel corpus . CompareOrContrast val
Since the arguments can provide useful semantic information , the SRL is crucial to many natural language processing tasks , such as Question and Answering ( <CITED HERE> ) , Information Extraction ( Surdeanu et al. 2003 ) , and Machine Translation ( Boas 2002 ) . Background train
In addition to the model based upon a dictionary of stems and words , we also experimented with models based upon character n-grams , similar to those used for Chinese segmentation ( <CITED HERE> ) . CompareOrContrast train
Notable early papers on graph-based semisupervised learning include Blum and Chawla ( 2001 ) , Bansal et al. ( 2002 ) , Kondor and Lafferty ( 2002 ) , and <CITED HERE> . Background train
Due to using a global model like CRFs , our previous work in ( Zhao et al. , 2006 ; <CITED HERE>c ) reported the best results over the evaluated corpora of Bakeoff-2 until now7 . CompareOrContrast val
As already mentioned in the literature , see for example ( <CITED HERE> ) , knowledge about implicit predicates could be potentially useful for a variety of NLP tasks such as language generation , information extraction , question answering or machine translation . Background train
We use the Columbia Arabic Treebank ( CATiB ) ( <CITED HERE> ) . Uses train
This problem may be similar to the situation in which current formal grammars allow nonsensical but parsable collections of words ( e.g. , `` colorless green ideas ... '' ) , while before the advent of Chomskyan formalisms , a sentence was defined as the smallest meaningful collection of words ; <CITED HERE> , p. 546 ) gives 10 definitions of a sentence . Background train
Another technique for making better use of unlabeled data is cotraining ( <CITED HERE> ) , in which two sufficiently different learners help each other learn by labeling training data for one another . Background val
For example , modeling CASE in Czech improves Czech parsing ( <CITED HERE> ) : CASE is relevant , not redundant , and can be predicted with sufficient accuracy . Motivation train
Table look-up using an explicit translation lexicon is sufficient and preferable for many multilingual NLP applications , including `` crummy '' MT on the World Wide Web ( Church & Hovy , 1993 ) , certain machine-assisted translation tools ( e.g. ( Macklovitch , 1994 ; Melamed , 1996b ) ) , concordancing for bilingual lexicography ( Catizone et al. , 1993 ; <CITED HERE> ) , computerassisted language learning , corpus linguistics ( Melby . Background train
Following our previous work ( <CITED HERE> ; Althaus , Karamanis , and Koller 2004 ) , the input to information ordering is an unordered set of informationbearing items represented as CF lists . Extends train
There has been some controversy , at least for simple stemmers ( Lovins , 1968 ; Porter , 1980 ) , about the effectiveness of morphological analysis for document retrieval ( Harman , 1991 ; Krovetz , 1993 ; <CITED HERE> ) . Background train
This situation suggests a response-automation approach that follows the document retrieval paradigm ( <CITED HERE> ) , where a new request is matched with existing response documents ( e-mails ) . Background train
All current approaches to monolingual TE , either syntactically oriented ( Rus et al. , 2005 ) , or applying logical inference ( Tatu and <CITED HERE> ) , or adopting transformation-based techniques ( Kouleykov and Magnini , 2005 ; Bar-Haim et al. , 2008 ) , incorporate different types of lexical knowledge to support textual inference . Background val
This is the approach taken by IBM Models 4 + ( Brown et al. 1993b ; Och and Ney 2003 ) , and more recently by the LEAF model ( <CITED HERE> ) . CompareOrContrast train
We have noted that many of these desiderata make complex question answering quite similar to multi-document summarization ( <CITED HERE>b ) , but these features are also beyond the capabilities of current summarization systems . CompareOrContrast train
<CITED HERE>:472 ) , but these are the only ones which are explicit in the LDOCE coding system . Background train
This paper describes an approach for sharing resources in various grammar formalisms such as Feature-Based Lexicalized Tree Adjoining Grammar ( FB-LTAG1 ) ( Vijay-Shanker , 1987 ; Vijay-Shanker and Joshi , 1988 ) and Head-Driven Phrase Structure Grammar ( HPSG ) ( <CITED HERE> ) by a method of grammar conversion . Background train
These constructs correspond as directly as possible to properties of the linguistic structure that express them and are , to as small an extent as possible , dependent on the requirements of contextual resolution ( unlike , say , the metavariables of standard QLFs [ <CITED HERE> ] , or the labels of UDRS [ Reyle 1996 ] , which are motivated entirely by the mechanisms that operate on them after grammatical processing ) . Background train
Prototypes of Internet search engines for linguists , corpus linguists and lexicographers have been proposed : WebCorp ( Kehoe and Renouf , 2002 ) , KWiCFinder ( Fletcher , 2004a ) and the Linguist 's Search Engine ( Kilgarriff , 2003 ; <CITED HERE> ) . Background val
Table look-up using an explicit translation lexicon is sufficient and preferable for many multilingual NLP applications , including `` crummy '' MT on the World Wide Web ( Church & Hovy , 1993 ) , certain machine-assisted translation tools ( e.g. ( Macklovitch , 1994 ; <CITED HERE>b ) ) , concordancing for bilingual lexicography ( Catizone et al. , 1993 ; Gale & Church , 1991 ) , computerassisted language learning , corpus linguistics ( Melby . Background train
An off-the-shelf speech recognition device , a Nippon Electric Corporation DP-200 , was added to an existing natural language processing system , the Natural Language Computer ( NLC ) ( Ballard 1979 , <CITED HERE> ) . Background train
Some efforts have tackled tasks such as automatic image caption generation ( Feng and Lapata , 2010a ; Ordonez et al. , 2011 ) , text illustration ( Joshi et al. , 2006 ) , or automatic location identification of Twitter users ( Eisenstein et al. , 2010 ; <CITED HERE> ; Roller et al. , 2012 ) . Background train
The system was trained on the Penn Treebank ( Marcus et al. , 1993 ) WSJ Sections 221 and tested on Section 23 ( Table 1 ) , same as used by <CITED HERE> , Collins ( 1997 ) , and Ratnaparkhi ( 1997 ) , and became a common testbed . CompareOrContrast train
<CITED HERE> predefine 163 verbal subcategorization frames , obtained by manually merging the classes exemplified in the COMLEX ( MacLeod , Grishman , and Meyers 1994 ) and ANLT ( Boguraev et al. 1987 ) dictionaries and adding around 30 frames found by manual inspection . Background train
Recently , several alternative , often quite sophisticated approaches to collective classification have been proposed ( <CITED HERE> ; Lafferty et al. , 2001 ; Getoor et al. , 2002 ; Taskar et al. , 2002 ; Taskar et al. , 2003 ; Taskar et al. , 2004 ; McCallum and Wellner , 2004 ) . Background val
<CITED HERE> give a sufficiently general finite-state framework to allow this : weights may fall in any set K ( instead of R ) . Uses train
FBLTAG ( Vijay-Shanker , 1987 ; <CITED HERE> ) is an extension of the LTAG formalism . Background train
There are several variations of such a method ( Ballesteros and Croft , 1998 ; Pirkola , 1998 ; <CITED HERE> ) . CompareOrContrast train
12 In order to focus on the computational aspects of the covariation approach , in this paper we will not go into a discussion of the full lexical rule specification language introduced in <CITED HERE> . Background val
Many investigators ( e.g. Allen 1976 ; Elowitz et al. 1976 ; Luce et al. 1983 ; <CITED HERE> ) have suggested that the poor prosody of synthetic speech , in comparison with natural speech , is the primary factor leading to difficulties in the comprehension of fluent synthetic speech . Motivation train
This Principle of Finitism is also assumed by <CITED HERE> , Jackendoff ( 1983 ) , Kamp ( 1981 ) , and implicitly or explicitly by almost all researchers in computational linguistics . CompareOrContrast train
We performed translation experiments with an implementation of the IBM-4 translation model ( <CITED HERE> ) . Uses train
In this paper , a flexible annotation schema called Structured String-Tree Correspondence ( SSTC ) ( <CITED HERE> ) will be introduced to capture a natural language text , its corresponding abstract linguistic representation and the mapping ( correspondence ) between these two . Background val
For the development of these lists we used a collection of texts of about 300,000 words derived from the New York Times ( NYT ) corpus that was supplied as training data for the 7th Message Understanding Conference ( MUC-7 ) ( <CITED HERE> ) . Background train
For the full parser , we use the one developed by Michael Collins ( Collins , 1996 ; <CITED HERE> ) -- one of the most accurate full parsers around . Uses train
More sophisticated approaches have been proposed ( <CITED HERE> ) , including an extension that , in an interesting reversal of our problem , makes use of sentimentpolarity indicators within speech segments ( Galley et al. , 2004 ) . Background train
<CITED HERE> have demonstrated that differential weighting of automatically labeled sections can lead to improved retrieval performance . Background train
The one-sided t-test ( <CITED HERE> ) at significance level 0.05 indicated that the improvement on Trec5C is not statistically significant . Uses train
mers ( Lovins , 1968 ; <CITED HERE> ) demonstrably improve retrieval performance . Background train
It is therefore no surprise that early attempts at response automation were knowledge-driven ( <CITED HERE> ; Watson 1997 ; Delic and Lahaix 1998 ) . Background train
The framework was originally developed for the realization of deep-syntactic structures in NLG ( <CITED HERE> ) . Background train
These features are very much desired in the design of an annotation scheme , in particular for the treatment of linguistic phenomena , which are non-standard , e.g. crossed dependencies ( <CITED HERE> ) . Background train
Few approaches to parsing have tried to handle disfluent utterances ( notable exceptions are Core & Schubert , 1999 ; <CITED HERE> ; Nakatani & Hirschberg , 1994 ; Shriberg , Bear , & Dowding , 1992 ) . Background train
5 The open source Moses ( <CITED HERE> ) toolkit from www.statmt.org/moses/ . Uses train
There have been several efforts aimed at developing a domain-independent method for generating responses from a frame representation of user requests ( <CITED HERE> ; Chu-Carroll , 1999 ) . Future val
The maximum entropy approach ( <CITED HERE> ) presents a powerful framework for the combination of several knowledge sources . Uses train
Other molecular biology databases We also included several model organism databases or nomenclature databases in the construction of the dictionary , i.e. , mouse Mouse Genome Database ( MGD ) [ 18 ] , fly FlyBase [ 19 ] , yeast Saccharomyces Genome Database ( SGD ) <CITED HERE> , rat -- Rat Genome Database ( RGD ) [ 21 ] , worm -- WormBase [ 22 ] , Human Nomenclature Database ( HUGO ) [ 23 ] , Online Mendelian Inheritance in Man ( OMIM ) [ 24 ] , and Enzyme Nomenclature Database ( ECNUM ) [ 25 , 26 ] . Uses val
We then use the program Snob ( Wallace and Boulton 1968 ; <CITED HERE> ) to cluster these experiences . Uses train
However , studies have shown that existing systems for searching MEDLINE ( such as PubMed , the search service provided by the National Library of Medicine ) are often inadequate and unable to supply clinically relevant answers in a timely manner ( Gorman , Ash , and Wykoff 1994 ; <CITED HERE> ) . Background train
Our implementation of the NP-based QA system uses the Empire noun phrase finder , which is described in detail in <CITED HERE> . Uses train
For all the experiments reported in this article , we used the training portion of PATB Part 3 v3 .1 ( <CITED HERE> ) , converted to the CATiB Treebank format , as mentioned in Section 2.5 . Uses train
In addition , we consider several types of lexical features ( LexF ) inspired by previous work on agreement and disagreement ( Galley et al. , 2004 ; <CITED HERE> ) . Motivation val
All communicative head gestures in the videos were found and annotated with ANVIL using a subset of the attributes defined in the MUMIN annotation scheme ( <CITED HERE> ) . Uses train
Specifically , we examine the strength of association between the verb and the noun constituent of a combination ( the target expression or its lexical variants ) as an indirect cue to its idiomaticity , an approach inspired by <CITED HERE> . Motivation train
The current system learns finite state flowcharts whereas typical learning systems usually acquire coefficient values as in Minsky and Papert ( 1969 ) , assertional statements as in Michalski ( 1980 ) , or semantic nets as in <CITED HERE> . CompareOrContrast val
More recently , an alignment selection approach was proposed in ( <CITED HERE> ) , which computes confidence scores for each link and prunes the links from multiple sets of alignments using a hand-picked threshold . CompareOrContrast train
<CITED HERE> for example discusses a method where a syntactic parse of the text is performed and the context of a word is modeled using dependency triples . Background train
A variety of statistical methods were proposed over the recent years for learning to produce a full parse of free-text sentences ( e.g. , Bod ( 1992 ) , Magerman ( 1995 ) , <CITED HERE> , Ratnaparkhi ( 1997 ) , and Sekine ( 1998 ) ) . Background train
The strategies employed when MIMIC has only dialogue initiative are similar to the mixed initiative dialogue strategies employed by many existing spoken dialogue systems ( e.g. , ( Bennacef et at. , 1996 ; <CITED HERE> ) ) . CompareOrContrast train
7A11 our results are computed with the evalb program following the now-standard criteria in ( <CITED HERE> ) . Uses train
We see no good reason , however , why such text spans should necessarily be sentences , since the majority of tagging paradigms ( e.g. , Hidden Markov Model [ HMM ] [ <CITED HERE> ] , Brill 's [ Brill 1995a ] , and MaxEnt [ Ratnaparkhi 1996 ] ) do not attempt to parse an entire sentence and operate only in the local window of two to three tokens . CompareOrContrast train
results are based on a corpus of movie subtitles ( <CITED HERE> ) , and are consequently shorter sentences , whereas the En → Es results are based on a corpus of parliamentary proceedings ( Koehn 2005 ) . Uses train
The system is implemented based on ( Galley et al. , 2006 ) and ( <CITED HERE> ) . Uses train
Also relevant is work on the general problems of dialog-act tagging ( Stolcke et al. , 2000 ) , citation analysis ( Lehnert et al. , 1990 ) , and computational rhetorical analysis ( <CITED HERE> ; Teufel and Moens , 2002 ) . Background val
For projective parsing , it is significantly faster than exact dynamic programming , at the cost of small amounts of search error , We are interested in extending these ideas to phrase-structure and lattice parsing , and in trying other higher-order features , such as those used in parse reranking ( <CITED HERE> ; Huang , 2008 ) and history-based parsing ( Nivre and McDonald , 2008 ) . Future val
Word alignments are used primarily for extracting minimal translation units for machine translation ( MT ) ( e.g. , phrases [ Koehn , Och , and Marcu 2003 ] and rules [ Galley et al. 2004 ; <CITED HERE> ] ) as well as for Background train
Sridhar et al. ( 2009 ) obtain promising results in dialogue act tagging of the Switchboard-DAMSL corpus using lexical , syntactic and prosodic cues , while <CITED HERE> examine the relation between particular acoustic and prosodic turn-yielding cues and turn taking in a large corpus of task-oriented dialogues . Background train
We have also applied our more general unification grammar acquisition methodology to the TIGER Treebank ( <CITED HERE> ) and Penn Chinese Treebank ( Xue , Chiou , and Palmer 2002 ) , extracting wide-coverage , probabilistic LFG grammar Uses train
We use the structures previously used by <CITED HERE> , and propose one new structure . Uses train
Two exceptions to this generalisation are the Linguistic String Project ( Sager , 1981 ) and the IBM CRITIQUE ( formerly EPISTLE ) Project ( Heidorn et al. , 1982 ; <CITED HERE> ) ; the former employs a dictionary of approximately 10,000 words , most of which are specialist medical terms , the latter has well over 100,000 entries , gathered from machine readable sources . CompareOrContrast train
A number of proposals in the 1990s deliberately limited the extent to which they relied on domain and/or linguistic knowledge and reported promising results in knowledge-poor operational environments ( Dagan and Itai 1990 , 1991 ; Lappin and Leass 1994 ; Nasukawa 1994 ; Kennedy and Boguraev 1996 ; Williams , Harvey , and Preston 1996 ; <CITED HERE> ; Mitkov 1996 , 1998b ) . Background train
Towards this aim , a flexible annotation structure called Structured String-Tree Correspondence ( SSTC ) was introduced in <CITED HERE> to record the string of terms , its associated representation structure and the mapping between the two , which is expressed by the sub-correspondences recorded as part of a SSTC . Background val
We have built an experimental text-to-speech system that uses our analysis of prosody to generate phrase boundaries for the Olive -- Liberman synthesizer ( <CITED HERE> ) . Uses train
In this paper , I present a computational implementation of Distributed Morphology ( <CITED HERE> ) , a non-lexicalist linguistic theory that erases the distinction between syntactic derivation and morphological derivation . Uses train
Some recent GRE algorithms have done away with the separation between content determination and linguistic realization , interleaving the two processes instead ( Stone and Webber 1998 ; <CITED HERE> ) . CompareOrContrast train
Subsequent processing by the natural language and response generation components was done automatically by the computer ( <CITED HERE> ) . Uses train
This is implemented as a cascade of simple strategies , which were briefly described in <CITED HERE> . Uses val
But , obviously , there are other possibilities -- for instance , the discourse representation structures ( DRS 's ) of <CITED HERE> , which have been used to translate a subset of English into logical formulas , to model text ( identified with a list of sentences ) , to analyze a fragment of English , and to deal with anaphora . CompareOrContrast train
Better results would be expected by combining the PCFG-LA parser with discriminative reranking approaches ( Charniak and Johnson , 2005 ; <CITED HERE> ) for self training . Future train
Intermedia is no more developed and nobody of us had the opportunity to try it ( <CITED HERE> ) . Background val
Morphological alterations of a search term have a negative impact on the recall performance of an information retrieval ( IR ) system ( Choueka , 1990 ; J ¨ appinen and Niemist ¨ o , 1988 ; <CITED HERE> ) , since they preclude a direct match between the search term proper and its morphological variants in the documents to be retrieved . Background train
ment ( <CITED HERE> ; Doran et al. , 2000 ; Makino et al. , 1998 ) . Background train
Finally , feedback expressions ( head nods and shakes ) are successfully predicted from speech , prosody and eye gaze in interaction with Embodied Communication Agents as well as human communication ( <CITED HERE> ; Morency et al. , 2005 ; Morency et al. , 2007 ; Morency et al. , 2009 ) . Background val
Another line of research approaches grounded language knowledge by augmenting distributional approaches of word meaning with perceptual information ( <CITED HERE> ; Steyvers , 2010 ; Feng and Lapata , 2010b ; Bruni et al. , 2011 ; Silberer and Lapata , 2012 ; Johns and Jones , 2012 ; Bruni et al. , 2012a ; Bruni et al. , 2012b ; Silberer et al. , 2013 ) . Background train
We also made use of the person-name/instance pairs automatically extracted by <CITED HERE> .2 This data provides counts for pairs such as `` Edwin Moses , hurdler '' and `` William Farley , industrialist . '' Uses train
<CITED HERE> presented an approach for constructing a BKB based on the S-SSTC . Background train
However , since work in this direction has started , a significant progress has also been made in the research on statistical learning of full parsers , both in terms of accuracy and processing time ( <CITED HERE>b ; Charniak , 1997a ; Collins , 1997 ; Ratnaparkhi , 1997 ) . Background train
Also , advanced methods often require many training iterations , for example active learning ( Dagan and Engelson ,1995 ) and co-training ( <CITED HERE> ) . Background train
<CITED HERE> furthered this work by showing that a bimodal topic model , consisting of both text and feature norms , outperformed models using only one modality on the prediction of association norms , word substitution errors , and semantic interference tasks . Extends val
It can be shown ( <CITED HERE> ) that the use of this model with maximum likelihood parameter estimation is justified on information-theoretic grounds when q represents some prior knowledge about the true distribution and when the expected values of f in the training corpus are identical to their true expected values .3 There is no requirement that the components of f represent disjoint or statistically independent events . Motivation train
Shortly after the publication of The Sound Pattern of English ( Chomsky and Halle 1968 ) , Kornai points out , `` Johnson ( 1970 ) demonstrated that the context-sensitive machinery of SPE ... [ could ] be replaced by a much simpler one , based on finite-state transducers ( FSTs ) ; the same conclusion was reached independently by Kaplan and Kay , whose work remained an underground classic until it was finally published in <CITED HERE> . '' Background train
18 In this article , we use a newer version of the corpus by <CITED HERE> than the one we used in Marton , Habash , and Rambow ( 2011 ) . Uses train
<CITED HERE>b ) and Topkara et al. ( 2006a ) attained the embedding capacity of 0.5 bits per sentence with the syntactic transformation method . Background train
Another possibility that often works better is to use Minimum Bayes-Risk ( MBR ) decoding ( <CITED HERE> ; Liang , Taskar , and Klein 2006 ; Ganchev , and Taskar 2007 ) . Uses train
The subcategorization requirements expressed by semantic forms are enforced at f-structure level through completeness and coherence well-formedness conditions on f-structure ( <CITED HERE> ) : An f-structure is locally complete iff it contains all the governable grammatical functions that its predicate governs . Background train
This paper describes an approach for sharing resources in various grammar formalisms such as Feature-Based Lexicalized Tree Adjoining Grammar ( FB-LTAG1 ) ( Vijay-Shanker , 1987 ; <CITED HERE> ) and Head-Driven Phrase Structure Grammar ( HPSG ) ( Pollard and Sag , 1994 ) by a method of grammar conversion . Background train
Following the example of <CITED HERE> , we will call the autonomous units of a hypertext lexias ( from ` lexicon ' ) , a word coined by Roland Barthes ( 1970 ) . Uses train
( <CITED HERE> ) . Background train
We use the same set of binary features as in previous work on this dataset ( <CITED HERE> ; Pang and Lee , 2004 ; Zaidan et al. , 2007 ) . Uses train
Various feature selection techniques have been used in automatic text categorization ; they include document frequency ( DF ) , information gain ( IG ) ( <CITED HERE> ) , minimum description length principal ( Lang , 1995 ) , and the X2 statistic . Background train
Politically-oriented text Sentiment analysis has specifically been proposed as a key enabling technology in eRulemaking , allowing the automatic analysis of the opinions that people submit ( Shulman et al. , 2005 ; Cardie et al. , 2006 ; <CITED HERE> ) . Background train
<CITED HERE> observed that dependency parsers tend to do quite poorly when parsing questions due to their limited exposure to them in the news corpora from the PennTreebank . CompareOrContrast val
<CITED HERE>a ) 's similar word list for eat misses these but includes sleep ( ranked 6 ) and sit ( ranked 14 ) , because these have similar subjects to eat . Background train
• Learnability ( <CITED HERE> ) • Text generation ( Hovy 1988 ; Milosavljevic , Tulloch , and Dale 1996 ) • Speech generation ( Rayner and Carter 1997 ) • Localization ( Sch ¨ aler 1996 ) Background train
For example , a ` web page ' is more similar to an infinite canvas than a written page ( <CITED HERE> ) . Background train
Other molecular biology databases We also included several model organism databases or nomenclature databases in the construction of the dictionary , i.e. , mouse Mouse Genome Database ( MGD ) <CITED HERE> , fly FlyBase [ 19 ] , yeast Saccharomyces Genome Database ( SGD ) [ 20 ] , rat -- Rat Genome Database ( RGD ) [ 21 ] , worm -- WormBase [ 22 ] , Human Nomenclature Database ( HUGO ) [ 23 ] , Online Mendelian Inheritance in Man ( OMIM ) [ 24 ] , and Enzyme Nomenclature Database ( ECNUM ) [ 25 , 26 ] . Uses train
2This view of typed feature structures differs from the perspective on typed feature structures as modeling partial information as in ( <CITED HERE> ) . Background train
This approach resembles the work by Grishman et al. ( 1986 ) and <CITED HERE> on selectional restrictions . CompareOrContrast train
This is noticeable for German ( <CITED HERE> ) and Portuguese ( Afonso et al. , 2002 ) , which still have high overall accuracy thanks to very high attachment scores , but much more conspicuous for Czech ( B ¨ ohmov ´ a et al. , 2003 ) , Dutch ( van der Beek et al. , 2002 ) and Slovene ( Dˇzeroski et al. , 2006 ) , where root precision drops more drastically to about 69 % , 71 % and 41 % , respectively , and root recall is also affected negatively . CompareOrContrast train
LTAG ( <CITED HERE> ) is a grammar formalism that provides syntactic analyses for a sentence by composing elementary trees with two opera - Background train
Translations have been generated by the CrowdFlower3 channel to Amazon Mechanical Turk4 ( MTurk ) , adopting the methodology proposed by ( <CITED HERE> ) . Uses train
This work is a continuation of that initiated in ( <CITED HERE> ) , which reports an overall NB classification correctness of 75.6 % , in cross validation experiments , on a data set that consists of 100 documents for each of 12 categories ( the data set is collected from different Arabic portals ) . Extends train
Our previous work ( <CITED HERE> ) designed an EMbased method to construct unsupervised trees for tree-based translation models . CompareOrContrast train
Then , we binarize the English parse trees using the head binarization approach ( <CITED HERE> ) and use the resulting binary parse trees to build another s2t system . Uses train
This is the strongest version of the sorites paradox ( e.g. , <CITED HERE> ) . Background train
We used a standard implementation of IBM Model 4 ( <CITED HERE> ) and because changing the existing code is not trivial , we could not use the same stopping criterion to avoid overfitting and we are not able to produce precision/recall curves . Uses train
Cross-lingual Textual Entailment ( CLTE ) has been proposed by ( <CITED HERE> ) as an extension of Textual Entailment ( Dagan and Glickman , 2004 ) that consists in deciding , given two texts T and H in different languages , if the meaning of H can be inferred from the meaning of T . Background val
Some approaches apply semantic parsing , where words and sentences are mapped to logical structure meaning ( <CITED HERE> ) . Background train
For projective parsing , it is significantly faster than exact dynamic programming , at the cost of small amounts of search error , We are interested in extending these ideas to phrase-structure and lattice parsing , and in trying other higher-order features , such as those used in parse reranking ( Charniak and Johnson , 2005 ; <CITED HERE> ) and history-based parsing ( Nivre and McDonald , 2008 ) . Future train
Prototypes of Internet search engines for linguists , corpus linguists and lexicographers have been proposed : WebCorp ( Kehoe and Renouf , 2002 ) , KWiCFinder ( <CITED HERE>a ) and the Linguist 's Search Engine ( Kilgarriff , 2003 ; Resnik and Elkiss , 2003 ) . Background train
Other molecular biology databases We also included several model organism databases or nomenclature databases in the construction of the dictionary , i.e. , mouse Mouse Genome Database ( MGD ) [ 18 ] , fly FlyBase [ 19 ] , yeast Saccharomyces Genome Database ( SGD ) [ 20 ] , rat -- Rat Genome Database ( RGD ) [ 21 ] , worm -- WormBase <CITED HERE> , Human Nomenclature Database ( HUGO ) [ 23 ] , Online Mendelian Inheritance in Man ( OMIM ) [ 24 ] , and Enzyme Nomenclature Database ( ECNUM ) [ 25 , 26 ] . Uses train
Other factors , such as the role of focus ( Grosz 1977 , 1978 ; Sidner 1983 ) or quantifier scoping ( <CITED HERE> ) must play a role , too . Background train
Since mid-2002 , the Library has been employing software that automatically suggests MeSH headings based on content ( <CITED HERE> ) . Background train
Discriminative approaches ( especially SVMs ) have been shown to be very effective for many supervised classification tasks ; see , for example , ( Joachims , 1998 ; <CITED HERE> ) . Background val
Also , the <CITED HERE> approach will be undefined if the pair is unobserved on the web . Uses train
The automation of help-desk responses has been previously tackled using mainly knowledge-intensive paradigms , such as expert systems ( <CITED HERE> ) and case-based reasoning ( Watson 1997 ) . CompareOrContrast train
We measure this association using pointwise Mutual Information ( MI ) ( <CITED HERE> ) . Uses train
Some methods of semantic relation analysis rely on predefined templates filled with information from processed texts ( <CITED HERE> ) . Background train
There are two corpora normally used for evaluation in a number of text-processing tasks : the Brown corpus ( <CITED HERE> ) and the Wall Street Journal ( WSJ ) corpus , both part of the Penn Treebank ( Marcus , Marcinkiewicz , and Santorini 1993 ) . Uses train
or quotation of messages in emails or postings ( see Mullen and Malouf ( 2006 ) but cfXXX <CITED HERE> ) . Background train
<CITED HERE> extend LDA to allow for the inference of document and topic distributions in a multimodal corpus . Background val
SNoW ( <CITED HERE> ; Roth , 1998 ) is a multi-class classifier that is specifically tailored for learning in domains in which the potential number of information sources ( features ) taking part in decisions is very large , of which NLP is a principal example . Uses train
The search algorithm is the standard Viterbi search ( <CITED HERE> ) , except that the match involves a network-to-network alignment problem rather than sequence-to-sequence . Uses train
Since then this idea has been applied to several tasks , including word sense disambiguation ( Yarowsky 1995 ) and named-entity recognition ( <CITED HERE> ) . Background train
7 We employed the LIBSVM package ( <CITED HERE> ) . Uses train
coreference performance on perfect mentions ( e.g. , Incorporate the two knowledge sources in a <CITED HERE> ) ; and for those that do report percoreference resolver . CompareOrContrast val
According to <CITED HERE> , p. 67 ) , these two sentences are incoherent . CompareOrContrast train
Thus , over the past few years , along with advances in the use of learning and statistical methods for acquisition of full parsers ( Collins , 1997 ; Charniak , 1997a ; Charniak , 1997b ; Ratnaparkhi , 1997 ) , significant progress has been made on the use of statistical learning methods to recognize shallow parsing patterns syntactic phrases or words that participate in a syntactic relationship ( <CITED HERE> ; Ramshaw and Marcus , 1995 ; Argamon et al. , 1998 ; Cardie and Pierce , 1998 ; Munoz et al. , 1999 ; Punyakanok and Roth , 2001 ; Buchholz et al. , 1999 ; Tjong Kim Sang and Buchholz , 2000 ) . Background train
Another dialogue acquisition system has been developed by <CITED HERE> . CompareOrContrast train
Lexical functional grammar ( Kaplan and Bresnan 1982 ; Bresnan 2001 ; <CITED HERE> ) is a member of the family of constraint-based grammars . Background val
1Our rules are similar to those from <CITED HERE> . CompareOrContrast train
In psycholinguistics , relatedness of words can also be determined through association tests ( Schulte im <CITED HERE> ) . Background train
In this article , we use an in-house system which provides functional gender , number , and rationality features ( <CITED HERE> ) . Uses train
Over the last decade there has been a lot of interest in developing tutorial dialogue systems that understand student explanations ( <CITED HERE> ; Graesser et al. , 1999 ; Aleven et al. , 2001 ; Buckley and Wolska , 2007 ; Nielsen et al. , 2008 ; VanLehn et al. , 2007 ) , because high percentages of selfexplanation and student contentful talk are known to be correlated with better learning in humanhuman tutoring ( Chi et al. , 1994 ; Litman et al. , 2009 ; Purandare and Litman , 2008 ; Steinhauser et al. , 2007 ) . Background train
The use of the web as a corpus for teaching and research on language has been proposed a number of times ( Kilgarriff , 2001 ; Robb , 2003 ; Rundell , 2000 ; <CITED HERE> , 2004b ) and received a special issue of the journal Computational Linguistics ( Kilgarriff and Grefenstette , 2003 ) . Background val
Several authors in communication studies have pointed out that head movements are relevant to feedback phenomena ( see <CITED HERE> for an overview ) . Background train
Our method resorts to some translation examples , which is similar as example-based translation or translation memory ( <CITED HERE> ; He et al. , 2010 ; Ma et al. , 2011 ) . CompareOrContrast train
Baseline Systems We choose three publicly available state-of-the-art end-to-end coreference systems as our baselines : Stanford system ( Lee et al. , 2011 ) , Berkeley system ( <CITED HERE> ) and HOTCoref system ( Bj ¨ orkelund and Kuhn , 2014 ) . CompareOrContrast val
A number of speech understanding systems have been developed during the past fifteen years ( Barnett et al. 1980 , Dixon and Martin 1979 , Erman et al. 1980 , Haton and Pierrel 1976 , Lea 1980 , Lowerre and Reddy 1980 , Medress 1980 , Reddy 1976 , Walker 1978 , and <CITED HERE> ) . CompareOrContrast train
As for work on Arabic ( MSA ) , results have been reported on the PATB ( Kulick , Gabbard , and Marcus 2006 ; Diab 2007 ; Green and Manning 2010 ) , the Prague Dependency Treebank ( PADT ) ( Buchholz and Marsi 2006 ; Nivre 2008 ) and the CATiB ( <CITED HERE> ) . Background train
Other representations use the link structure ( <CITED HERE> ) or generate graph representations of the extracted features ( Kalashnikov et al. , 2007 ) . Background train
From an IR view , a lot of specialized research has already been carried out for medical applications , with emphasis on the lexico-semantic aspects of dederivation and decomposition ( Pacak et al. , 1980 ; Norton and Pacak , 1983 ; <CITED HERE> ; Wingert , 1985 ; Dujols et al. , 1991 ; Baud et al. , 1998 ) . Background train
There has been some controversy , at least for simple stemmers ( Lovins , 1968 ; <CITED HERE> ) , about the effectiveness of morphological analysis for document retrieval ( Harman , 1991 ; Krovetz , 1993 ; Hull , 1996 ) . Background train
This revalidates the observation of <CITED HERE> that phrase structure representations and dependency representations add complimentary value to the learning task . CompareOrContrast train
The parsing algorithm used for all languages is the deterministic algorithm first proposed for unlabeled dependency parsing by Nivre ( 2003 ) and extended to labeled dependency parsing by <CITED HERE> . Uses train
The semantic categories of verbs and other words are extracted from the Semantic Knowledge-base of Contemporary Chinese ( <CITED HERE> ) . Uses train
Moreover , in order to determine whether the performances of the predictive criteria are consistent across different learning models within the same domain , we have performed the study on two parsing models : one based on a context-free variant of tree-adjoining grammars ( Joshi , Levy , and Takahashi 1975 ) , the Probabilistic Lexicalized Tree Insertion Grammar ( PLTIG ) formalism ( <CITED HERE> ; Hwa 1998 ) , and Collins 's Model 2 parser ( 1997 ) . Uses train
6 The Partial-VP Topicalization Lexical Rule proposed by <CITED HERE> , 10 ) is a linguistic example . Background train
We measure the inter annotator agreement using the Fleiss Kappa ( <CITED HERE> ) measure ( x ) where the agreement lies around 0.79 . Uses train
The task we used to compare different generalisation techniques is similar to that used by <CITED HERE> and Rooth et al. ( 1999 ) . CompareOrContrast val
Other milestones of recent research include the deployment of probabilistic and machine learning techniques ( Aone and Bennett 1995 ; Kehler 1997 ; Ge , Hale , and Charniak 1998 ; Cardie and Wagstaff 1999 ; the continuing interest in centering , used either in original or in revised form ( Abracos and Lopes 1994 ; Strube and Hahn 1996 ; <CITED HERE> ; Tetreault 1999 ) ; and proposals related to the evaluation methodology in anaphora resolution ( Mitkov 1998a , 2001b ) . Background train
It is not aimed at handling dependencies , which require heavy use of lexical information ( <CITED HERE> , for PP attachment ) . CompareOrContrast val
This includes work on question answering ( Wang et al. , 2007 ) , sentiment analysis ( <CITED HERE> ) , MT reordering ( Xu et al. , 2009 ) , and many other tasks . Background train
I A more detailed discussion of various aspects of the proposed parser can be found in ( <CITED HERE> ) . Background val
Our experimental design with professional bilingual translators follows our previous work <CITED HERE>a ) comparing scratch translation to post-edit . Extends val
Lexical functional grammar ( Kaplan and Bresnan 1982 ; <CITED HERE> ; Dalrymple 2001 ) is a member of the family of constraint-based grammars . Background train
Thus , the second class of SBD systems employs machine learning techniques such as decision tree classifiers ( <CITED HERE> ) , neural networks ( Palmer and Hearst 1994 ) , and maximum-entropy modeling ( Reynar and Ratnaparkhi 1997 ) . Background train
Previously , a user study ( <CITED HERE> ) has shown that people are reluctant to type full natural language questions , even after being told that they were using a questionanswering system and that typing complete questions would result in better performance . CompareOrContrast train
We will examine the worst-case complexity of interpretation as well as generation to shed some light on the hypothesis that vague descriptions are more difficult to process than others because they involve a comparison between objects ( Beun and Cremers 1998 , <CITED HERE> ) . Background train
In our case , the clustering is performed by the program Snob , which implements mixture modeling combined with model selection based on the Minimum Message Length ( MML ) criterion ( Wallace and Boulton 1968 ; <CITED HERE> ) . Uses train
The third approach to cross-lingual retrieval is to map queries and documents to some intermediate representation , e.g latent semantic indexing ( LSI ) ( Littman et al , 1998 ) , or the General Vector space model ( GVSM ) , ( <CITED HERE> ) . CompareOrContrast train
Many provide graphical user interfaces ( GUI ) for manual annotation ( e.g. General Architecture for Text Engineering ( GATE ) ( <CITED HERE> ) and the Alembic Workbench ( Day et al. , 1997 ) ) as well as NLP tools and resources that can be manipulated from the GUI . Background train
The keypoints are clustered into 5,000 visual codewords ( centroids ) using k-means clustering ( <CITED HERE> ) , and images are then quantized over the 5,000 codewords . Uses train
Another approach for partial parsing was presented by <CITED HERE> . Background val
In knowledge-lean approaches , coreference resolvers employ only morpho-syntactic cues as knowledge sources in the resolution process ( e.g. , Mitkov ( 1998 ) , <CITED HERE> ) . Background train
The application of domain models and deep semantic knowledge to question answering has been explored by a variety of researchers ( e.g. , <CITED HERE> , Rinaldi et al. 2004 ) , and was also the focus of recent workshops on question answering in restricted domains at ACL 2004 and AAAI 2005 . Background train
This result is consistent with other works using this model with these features ( Andrews et al. , 2009 ; <CITED HERE> ) . CompareOrContrast train
Cases of intracategorial synonymy are relatively straigthtforward as several electronic synonym dictionnaries for french are available ( <CITED HERE> ) . Background train
The feature of head word trigger which we apply to the log-linear model is motivated by the trigger-based approach ( <CITED HERE> ) . Motivation train
Other work on modeling the meanings of verbs using video recognition has also begun showing great promise ( Mathe et al. , 2008 ; <CITED HERE> ) . Background val
based parsing algorithms with an arc-factored parameterization ( <CITED HERE> ) . Uses train
No attempt has been made to map any closed class entries from LDOCE , as a 3,000 word lexicon containing most closed class items has been developed independently by one of the groups collaborating with us to develop the general purpose morphological and syntactic analyser ( see the Introduction and <CITED HERE> ) . Background train
In this paper we focus on the exploitation of the LDOCE grammar coding system ; <CITED HERE> and Alshawi ( 1987 ) describe further research in Cambridge utilising different types of information available in LDOCE . Background val
Thus , the second class of SBD systems employs machine learning techniques such as decision tree classifiers ( Riley 1989 ) , neural networks ( <CITED HERE> ) , and maximum-entropy modeling ( Reynar and Ratnaparkhi 1997 ) . Background train
We see no good reason , however , why such text spans should necessarily be sentences , since the majority of tagging paradigms ( e.g. , Hidden Markov Model [ HMM ] [ Kupiec 1992 ] , Brill 's [ <CITED HERE>a ] , and MaxEnt [ Ratnaparkhi 1996 ] ) do not attempt to parse an entire sentence and operate only in the local window of two to three tokens . CompareOrContrast train
Similarly , ( Barzilay and Lee , 2003 ) and ( <CITED HERE> ) learn sentence level paraphrase templates from a corpus of news articles stemming from different news source . Background train
In future work we plan to experiment with richer representations , e.g. including long-range n-grams ( Rosenfeld , 1996 ) , class n-grams ( <CITED HERE> ) , grammatical features ( Amaya and Benedy , 2001 ) , etc ' . Future val
Furthermore , we demonstrate that our results carry over successfully to another parser , the Easy-First Parser ( <CITED HERE> ) ( Section 6 ) . Uses train
For instance , relating `` they '' to `` apples '' in the sentence ( cfXXX <CITED HERE> p. 195 ; Zadrozny 1987a ) : We bought the boys apples because they were so cheap Background train
We follow our previous work ( <CITED HERE> ) in our feature choices , using a fiveword window that includes the target stem and two words on either side for context ( see also Tetreault and Chodorow , 2008 ) . Extends train
Adjectives , more than other categories , are a striking example of regular polysemy since they are able to take on different meanings depending on their context , viz. , the noun or noun class they modify ( see <CITED HERE> and the references therein ) . Background train
According to <CITED HERE> , LFG assumes the following universally available inventory of grammatical functions : SUBJ ( ect ) , OBJ ( ect ) , OBJe , COMP , XCOMP , OBL ( ique ) e , ADJ ( unct ) , XADJ . Background train
But their importance has grown far beyond machine translation : for instance , transferring annotations between languages ( Yarowsky and Ngai 2001 ; Hwa et al. 2005 ; Ganchev , Gillenwater , and Taskar 2009 ) ; discovery of paraphrases ( Bannard and Callison-Burch 2005 ) ; and joint unsupervised POS and parser induction across languages ( <CITED HERE> ) . Motivation train
4 This interpretation of the signature is sometimes referred to as closed world ( Gerdemann and King 1994 ; <CITED HERE> ) . Background val
Our training examples are similar to the data created for pseudodisambiguation , the usual evaluation task for SP models ( Erk , 2007 ; Keller and Lapata , 2003 ; <CITED HERE> ) . CompareOrContrast train
There have already been several attempts to develop distributed NLP systems for dialogue systems ( Bayer et al. , 2001 ) and speech recognition ( <CITED HERE> ) . Background train
For instance , implementing an efficient version of the MXPOST POS tagger ( <CITED HERE> ) will simply involve composing and configuring the appropriate text file reading component , with the sequential tagging component , the collection of feature extraction components and the maximum entropy model component . Future train
cue word and name the first ( or several ) associated words that come to mind ( e.g. , Nelson et al. ( 2004 ) ) , and feature norms , where subjects are given a cue word and asked to describe typical properties of the cue concept ( e.g. , <CITED HERE> ) . Background train
<CITED HERE> and Litman and Hirschberg ( 1990 ) also examine the relation between discourse and prosodic phrasing . Background train
<CITED HERE> evaluate 914 Czech verbs against a custom-made gold standard and record a token recall of 88 % . CompareOrContrast train
We use two measures from Information Retrieval to determine the quality of an automatically generated response : precision and F-score ( van Rijsbergen 1979 ; <CITED HERE> ) . Uses val
The automation of help-desk responses has been previously tackled using mainly knowledge-intensive paradigms , such as expert systems ( Barr and Tessler 1995 ) and case-based reasoning ( <CITED HERE> ) . CompareOrContrast train
<CITED HERE> did very encouraging work on the feature calibration of semantic role labeling . Background train
This approach , which uses words that appear in the context of terms to formulate hypotheses on their semantic relatedness ( <CITED HERE> , for example ) , does not specify the relationship itself . Background train
And ( <CITED HERE> ) use clustering and similarity measures to identify similar contexts in a single corpus and extract verbal paraphrases from these contexts . Background train
Not having to represent the frame explicitly not only enables the linguist to express only the relevant things , but also allows a more compact representation of lexical rules where explicit framing would require the rules to be split up ( <CITED HERE> ) . Background train
There has also been work focused upon determining the political leaning ( e.g. , `` liberal '' vs. `` conservative '' ) of a document or author , where most previously-proposed methods make no direct use of relationships between the documents to be classified ( the `` unlabeled '' texts ) ( Laver et al. , 2003 ; Efron , 2004 ; <CITED HERE> ) . Background train
Other similar approaches include those of Cicekli and G ¨ uvenir ( 1996 ) , <CITED HERE> , Carl ( 1999 ) , and Brown ( 2000 ) , inter alia . Background train
To sum up , this work has been carried out to automatically classify Arabic documents using the NB algorithm , with the use of a different data set , a different number of categories , and a different root extraction algorithm from those used in ( <CITED HERE> ) . CompareOrContrast val
Our work builds on earlier research on learning to identify dialogues in which the user experienced poor speech recognizer performance ( <CITED HERE> ) . Extends val
CCGBank ( <CITED HERE> ) is used to train the model . Uses train
More recent work on terminology structuring has focussed on formal similarity to develop hypotheses on the semantic relationships between terms : <CITED HERE> uses derivational morphology ; Grabar and Zweigenbaum ( 2002 ) use , as a starting point , a number of identical characters . Background train
Some researchers ( Cucerzan , 2007 ; <CITED HERE> ) have explored the use of Wikipedia information to improve the disambiguation process . Background train
To address this issue , we use a version of the PATB3 training and dev sets manually annotated with functional gender , number , and rationality ( <CITED HERE> ) .18 This is the first resource providing all three features ( ElixirFm only provides functional number , and to some extent functional gender ) . Uses train
One approach to this problem is that taken by the ASCOT project ( Akkerman et al. , 1985 ; <CITED HERE> ) . Background train
Narrative writings or essays are creative works and they generally treat ownership as authorship , even for the most enthusiastic fellows of free culture ( <CITED HERE> ) . Background train
McDonald has even argued for extending the model to a large number of components ( McDonald 1988 ) , and several systems have indeed added an additional component between the planner and the linguistic component ( Meteer 1994 ; Panaget 1994 ; <CITED HERE> ) . Background train
According to current tagger comparisons ( van Halteren et al. , 1998 ; Zavrel and Daelemans , 1999 ) , and according to a comparsion of the results presented here with those in ( <CITED HERE> ) , the Maximum Entropy framework seems to be the only other approach yielding comparable results to the one presented here . CompareOrContrast train
Similar findings have been proposed by <CITED HERE> that points out V1 and V2 are paired on the basis of their semantic compatibility , which is subject to syntactic constraints . Background train
The first lexical substitution method was proposed by <CITED HERE> . Background train
This can be a hazardous affair , since vague expressions tend to be interpreted in different ways by different people ( <CITED HERE> ) , sometimes in stark contrast with the intention of the speaker/writer ( Berry , Knapp , and Raynor 2002 ) . Background train
We see no good reason , however , why such text spans should necessarily be sentences , since the majority of tagging paradigms ( e.g. , Hidden Markov Model [ HMM ] [ Kupiec 1992 ] , Brill 's [ Brill 1995a ] , and MaxEnt [ <CITED HERE> ] ) do not attempt to parse an entire sentence and operate only in the local window of two to three tokens . CompareOrContrast train
<CITED HERE> ; also reported in Levelt 1989 ) show that greater differences are most likely to be chosen , presumably because they are more striking . Background val
Nevertheless , recent results show that knowledge-poor methods perform with amazing accuracy ( cfXXX ( <CITED HERE> ) , ( Kennedy and Boguraev , 1996 ) ( Kameyama , 1997 ) ) . Background train
Notice that it is not possible to use corpus annotation to determine the likelihood of a given property to be chosen , unless we know in advance all of the properties that can be attributed to a given object , as in the case of Jordan 's work on the COCONUT domain ( <CITED HERE> ) . Background train
Some well-known approaches include rule-based models ( Brill and Resnik 1994 ) , backed-off models ( <CITED HERE> ) , and a maximumentropy model ( Ratnaparkhi 1998 ) . Background train
Other psycholing-uistic studies that confirm the validity of paragraph units can be found in Black and Bower ( 1979 ) and <CITED HERE> . Background train
Tetreault 's contribution features comparative evaluation involving the author 's own centering-based pronoun resolution algorithm called the Left-Right Centering algorithm ( LRC ) as well as three other pronoun resolution methods : Hobbs 's naive algorithm ( Hobbs 1978 ) , BFP ( Brennan , Friedman , and Pollard 1987 ) , and Strube 's 5list approach ( <CITED HERE> ) . Background train
<CITED HERE> investigated three approaches to the automatic generation of response e-mails : text classification , case-based reasoning , and question answering . CompareOrContrast train
This contrasts with one of the traditional approaches ( e.g. , <CITED HERE> ; Watanabe 1995 ) to posing the translation problem , i.e. , the approach in which translation problems are seen in terms of bridging the gap between the most natural monolingual representations underlying the sentences of each language . CompareOrContrast val
In practical context , German , English , and Japanese HPSG-based grammars are developed and used in the Verbmobil project ( <CITED HERE> ) . Background train
Thus , over the past few years , along with advances in the use of learning and statistical methods for acquisition of full parsers ( Collins , 1997 ; Charniak , 1997a ; Charniak , 1997b ; Ratnaparkhi , 1997 ) , significant progress has been made on the use of statistical learning methods to recognize shallow parsing patterns syntactic phrases or words that participate in a syntactic relationship ( Church , 1988 ; Ramshaw and Marcus , 1995 ; Argamon et al. , 1998 ; Cardie and Pierce , 1998 ; Munoz et al. , 1999 ; <CITED HERE> ; Buchholz et al. , 1999 ; Tjong Kim Sang and Buchholz , 2000 ) . Background train
For example , such schema can serve as a mean to represent translation examples , or find structural correspondences for the purpose of transfer grammar learning ( Menezes & Richardson , 2001 ) , ( Aramaki et al. , 2001 ) , ( Watanabe et al. , 2000 ) , ( <CITED HERE> ) , ( Matsumoto et al. , 1993 ) , ( kaji et al. , 1992 ) , and example-base machine translation EBMT3 ( Sato & Nagao , 1990 ) , ( Sato , 1991 ) , ( Richardson et al. , 2001 ) , ( Al-Adhaileh & Tang , 1999 ) . Background train
While IA is generally thought to be consistent with findings on human language production ( Hermann and Deutsch 1976 ; Levelt 1989 ; Pechmann 1989 ; <CITED HERE> ) , the hypothesis that incrementality is a good model of human GRE seems unfalsifiable until a preference order is specified for the properties on which it operates . Background train
It helps them build complex knowledge bases by combining components : events , entities and modifiers ( <CITED HERE> ) . Background train
Our group has developed a wide-coverage HPSG grammar for Japanese ( <CITED HERE> ) , which is used in a high-accuracy Japanese dependency analyzer ( Kanayama et al. , 2000 ) . Background train
Word frequency counts in internet search engines are inconsistent and unreliable ( <CITED HERE> ) . Background train
Other milestones of recent research include the deployment of probabilistic and machine learning techniques ( Aone and Bennett 1995 ; Kehler 1997 ; Ge , Hale , and Charniak 1998 ; Cardie and Wagstaff 1999 ; the continuing interest in centering , used either in original or in revised form ( Abracos and Lopes 1994 ; Strube and Hahn 1996 ; Hahn and Strube 1997 ; Tetreault 1999 ) ; and proposals related to the evaluation methodology in anaphora resolution ( <CITED HERE>a , 2001b ) . Background train
It is only recently that the web name ambiguity has been approached as a separate problem and defined as an NLP task Web People Search on its own ( Artiles et al. , 2005 ; <CITED HERE> ) . Background train
Actually , if we use LSH technique ( <CITED HERE> ) in retrieval process , the local method can be easily scaled to a larger training data . Future train
Many lexicons , both automatically acquired and manually created , are more fine grained in their approaches to subcategorized clausal arguments , differentiating , for example , between a that-clause and a to + infinitive clause ( <CITED HERE> ) . Background train
Others provide automatic mappings of natural language instructions to executable actions , such as interpreting navigation directions ( <CITED HERE> ) or robot commands ( Tellex et al. , 2011 ; Matuszek et al. , 2012 ) . Background train
Agreement between two annotation sets is calculated here in terms of Cohen 's kappa ( Cohen , 1960 ) 1 and corrected kappa ( <CITED HERE> ) 2 . Uses train
Second , software for utilizing this ontology already exists : MetaMap ( Aronson 2001 ) identifies concepts in free text , and SemRep ( <CITED HERE> ) extracts relations between the concepts . Background val
The implementation has been inspired by experience in extracting information from very large corpora ( Curran and Moens , 2002 ) and performing experiments on maximum entropy sequence tagging ( Curran and Clark , 2003 ; <CITED HERE> ) . Motivation train
Some efforts have tackled tasks such as automatic image caption generation ( Feng and Lapata , 2010a ; <CITED HERE> ) , text illustration ( Joshi et al. , 2006 ) , or automatic location identification of Twitter users ( Eisenstein et al. , 2010 ; Wing and Baldridge , 2011 ; Roller et al. , 2012 ) . Background train
These types of features result in an improvement in both the mention detection and coreference resolution performance , as shown through experiments on the <CITED HERE> Arabic data . Uses val
The use of the web as a corpus for teaching and research on language has been proposed a number of times ( Kilgarriff , 2001 ; <CITED HERE> ; Rundell , 2000 ; Fletcher , 2001 , 2004b ) and received a special issue of the journal Computational Linguistics ( Kilgarriff and Grefenstette , 2003 ) . Background train
But while Bod 's estimator obtains state-of-the-art results on the WSJ , comparable to Charniak ( 2000 ) and <CITED HERE> , Bonnema et al. 's estimator performs worse and is comparable to Collins ( 1996 ) . Background train
The implementation has been inspired by experience in extracting information from very large corpora ( <CITED HERE> ) and performing experiments on maximum entropy sequence tagging ( Curran and Clark , 2003 ; Clark et al. , 2003 ) . Motivation train
Another line of research approaches grounded language knowledge by augmenting distributional approaches of word meaning with perceptual information ( Andrews et al. , 2009 ; Steyvers , 2010 ; Feng and Lapata , 2010b ; Bruni et al. , 2011 ; Silberer and Lapata , 2012 ; Johns and Jones , 2012 ; Bruni et al. , 2012a ; Bruni et al. , 2012b ; <CITED HERE> ) . Background train
14We parse each sentence with the Collins parser ( <CITED HERE> ) . Uses train
Encouraged by the success of chunk-based verb reordering lattices on ArabicEnglish ( <CITED HERE> ) , we tried to adapt the same approach to the German-English language pair . Motivation train
We followed the same experimental procedure as discussed in ( <CITED HERE> ) for English polymorphemic words . Uses train
In our experiments , we employed the well-known classifier SVM `` ght to obtain individual-document classification scores , treating Y as the positive class and using plain unigrams as features .5 Following standard practice in sentiment analysis ( <CITED HERE> ) , the input to SVM `` ght consisted of normalized presence-of-feature ( rather than frequency-of-feature ) vectors . Uses train
The problem of handling ill-formed input has been studied by Carbonell and Hayes ( 1983 ) , Granger ( 1983 ) , Jensen et al. ( 1983 ) , <CITED HERE> , Riesbeck and Schank ( 1976 ) , Thompson ( 1980 ) , Weischedel and Black ( 1980 ) , and Weischedel and Sondheimer ( 1983 ) . CompareOrContrast train
To address this problem , we are currently working on developing a metagrammar in the sense of ( <CITED HERE> ) . Future train
Sentences like 12 , from <CITED HERE> , are frequently cited . Background train
After the PropBank ( <CITED HERE> ) was built , Xue and Palmer ( 2005 ) and Xue ( 2008 ) have produced more complete and systematic research on Chinese SRL . Background train
In order to estimate the parameters of our model , we develop a blocked sampler based on that of <CITED HERE> to sample parse trees for sentences in the raw training corpus according to their posterior probabilities . Uses train
The extraction procedure utilizes a head percolation table as introduced by <CITED HERE> in combination with a variation of Collins 's ( 1997 ) approach to the differentiation between complement and adjunct . Background train
Many other such cases are described in Danlos 's book ( <CITED HERE> ) . Background train
• Before indexing the text , we process it with Textract ( Byrd and Ravin , 1998 ; <CITED HERE> ) , which performs lemmatization , and discovers proper names and technical terms . Uses train
Therefore , we repeated the experiments with POS tags predicted by the MADA toolkit ( <CITED HERE> ; Habash , Rambow , and Roth 2012 ) 15 ( see Table 2 , 14 Some parsers predict POS tags internally , instead of receiving them as input , but this is not the case in this article . Uses train
Experiments on Chinese SRL ( <CITED HERE> , Xue 2008 ) reassured these findings . Motivation train
An example of this is the estimation of maximum entropy models , from simple iterative estimation algorithms used by <CITED HERE> that converge very slowly , to complex techniques from the optimisation literature that converge much more rapidly ( Malouf , 2002 ) . Background train
According to <CITED HERE> , there are three prevalent approaches for evaluating SR measures : mathematical analysis , applicationspecific evaluation and comparison with human judgments . Background train
The choice of learning algorithm for each classifier is motivated by earlier findings showing that discriminative classifiers outperform other machine-learning methods on error correction tasks ( <CITED HERE> ) . Motivation train
A good study comparing document categorization algorithms can be found in ( <CITED HERE> ) . Background train
For HMMs ( footnote 11 ) , Ti is the familiar trellis , and we would like this computation of ti to reduce to the forwardbackward algorithm ( <CITED HERE> ) . Background val
We follow <CITED HERE> , for compound merging . Uses val
Semantic filters can also be used to prevent multiple versions of the same case frame ( <CITED HERE> ) showing up as complements . Uses val
The terms have been identified as the most specific to our corpus by a program developed by <CITED HERE> and called TER1vloSTAT . Uses train
For statistical significance , we use McNemar 's test on non-gold LAS , as implemented by Nilsson and <CITED HERE> . Uses train
The availability of toolkits for this weighted case ( <CITED HERE> ; van Noord and Gerdemann , 2001 ) promises to unify much of statistical NLP . Background val
tionally reconstructed by <CITED HERE> and Crouch and Putman ( 1994 ) , the context-independent meaning of a sentence is given by one or more QLFs that are built directly from syntactic and semantic rules . CompareOrContrast val
We use a standard split of 268 training documents , 68 development documents , and 106 testing documents ( Culotta et al. , 2007 ; <CITED HERE> ) . Uses train
<CITED HERE> improved on this by marking prepositions with the case they mark ( one of the most important markups in our system ) . CompareOrContrast train
But typical OT grammars offer much richer finite-state models of left context ( <CITED HERE>a ) than provided by the traditional HMM finite-state topologies . Background train
Other tools have been designed around particular techniques , such as finite state machines ( Karttunen et al. , 1997 ; <CITED HERE> ) . Background train
This contrasts with the findings described in <CITED HERE> where significant improvements could be achieved by increasing the number of source languages . CompareOrContrast train
The function selects the Value that removes most distractors , but in case of a tie , the least specific contestant is chosen , as long as it is not less specific than the basic-level Value ( i.e. , the most commonly occurring and psychologically most fundamental level , <CITED HERE> ) . Background train
<CITED HERE> pointed out that distribution plots of judgments for the word pairs used by Rubenstein and Goodenough display an empty horizontal band that could be used to separate related and unrelated pairs . CompareOrContrast train
In fact , Reiter has even argued in favor of this approach , claiming that the interactions are sufficiently minor to be ignored ( or at least handled on an ad hoc basis ) ( <CITED HERE> ) . Background train
The method is called targeted self-training as it is similar in vein to self-training ( <CITED HERE> ) , with the exception that the new parse data is targeted to produce accurate word reorderings . CompareOrContrast train
In our previous work ( <CITED HERE> ) , we started an initial investigation on conversation entailment . Extends train
<CITED HERE> tried to construct a semantic analysis based on `` prepared '' and `` unprepared mind '' . Background train
It is also possible to focus on non-compositional compounds , a key point in bilingual applications ( <CITED HERE> ; Melamed , 1997 ; Lin , 99 ) . Background train
This is mainly due to the fact that Arabic is a non-concatenative language ( <CITED HERE> ) , and that the stem/infix obtained by suppression of infix and prefix add-ons is not the same for words derived from the same origin called the root . Background train
ear regression adapted for classification ( <CITED HERE> ) , which can be described by the following equation : Uses train
For example , the forward-backward algorithm ( Baum , 1972 ) trains only Hidden Markov Models , while ( <CITED HERE> ) trains only stochastic edit distance . Background val
Using the tree-cut technique described above , our previous work ( <CITED HERE> ) extracted systematic polysemy from WordNet . Extends val
( Och and Ney , 2002 ; Blunsom et al. , 2008 ) used maximum likelihood estimation to learn weights for MT. ( Och , 2003 ; Moore and Quirk , 2008 ; Zhao and Chen , 2009 ; <CITED HERE> ) employed an evaluation metric as a loss function and directly optimized it . CompareOrContrast val
5 Significant bigrams are obtained using the n-gram statistics package NSP ( <CITED HERE> ) , which offers statistical tests to decide whether to accept or reject the null hypothesis regarding a bigram ( that it is not a collocation ) . Uses val
Liu et al. ( 2005 ) , Meral et al. ( 2007 ) , Murphy ( 2001 ) , <CITED HERE> and Topkara et al. ( 2006a ) all belong to the syntactic transformation category . Background train
2The WePS-1 corpus includes data from the Web03 testbed ( <CITED HERE> ) which follows similar annotation guidelines , although the number of document per ambiguous name is more variable . Uses train
The local training method ( <CITED HERE> ) is widely employed in computer vision ( Zhang et al. , 2006 ; Cheng et al. , 2010 ) . Background train
Recent work ( Banko and Brill , 2001 ; <CITED HERE> ) has suggested that some tasks will benefit from using significantly more data . Background train
Rather than producing a complete analysis of sentences , the alternative is to perform only partial analysis of the syntactic structures in a text ( Harris , 1957 ; <CITED HERE> ; Greffenstette , 1993 ) . Background train
The feasibility of automatically identifying outcome statements in secondary sources has been demonstrated by <CITED HERE> . CompareOrContrast train
The standard approach is to train two models independently and then intersect their predictions ( <CITED HERE> ) . CompareOrContrast train
We use the same method as <CITED HERE> for generating our multimodal corpora : for each word token in the text corpus , a feature is selected stochastically from the word 's feature distribution , creating a word-feature pair . Uses train
MI was also recently used for inference-rule SPs by <CITED HERE> . Background train
Based on this advise ( Moore and <CITED HERE> ) exclude the latent segmentation variables and opt for a heuristic training procedure . CompareOrContrast train
For example , 10 million words of the American National Corpus ( Ide et al. , 2002 ) will have manually corrected POS tags , a tenfold increase over the Penn Treebank ( <CITED HERE> ) , currently used for training POS taggers . Background train
The forward and backward probabilities , p0j and pkn , can be computed using single-source algebraic path for the simpler semiring ( R , + , x , ∗ ) -- or equivalently , by solving a sparse linear system of equations over R , a much-studied problem at O ( n ) space , O ( nm ) time , and faster approximations ( <CITED HERE> ) . Background train
4To prove ( 1 ) ⇒ ( 3 ) , express f as an FST and apply the well-known Kleene-Sch ¨ utzenberger construction ( <CITED HERE> ) , taking care to write each regexp in the construction as a constant times a probabilistic regexp . Uses val
Second , software for utilizing this ontology already exists : MetaMap ( <CITED HERE> ) identifies concepts in free text , and SemRep ( Rindflesch and Fiszman 2003 ) extracts relations between the concepts . Background val
Our work is more similar to NLG work that concentrates on structural constraints such as generative poetry ( <CITED HERE> ) ( Colton et al. , 2012 ) ( Jiang and Zhou , 2008 ) or song lyrics ( Wu et al. , 2013 ) ( Ramakrishnan A et al. , 2009 ) , where specified meter or rhyme schemes are enforced . CompareOrContrast train
Most DOP models , such as in Bod ( 1993 ) , Goodman ( 1996 ) , <CITED HERE> , Sima'an ( 2000 ) and Collins & Duffy ( 2002 ) , use a likelihood criterion in defining the best parse tree : they take ( some notion of ) the most likely ( i.e. most probable ) tree as a candidate for the best tree of a sentence . Background train
The best performance on the WSJ corpus was achieved by a combination of the SATZ system ( <CITED HERE> ) with the Alembic system ( Aberdeen et al. 1995 ) : a 0.5 % error rate . CompareOrContrast train
The extracted frames are noisy as a result of parser errors and so are filtered using the binomial hypothesis theory ( BHT ) , following <CITED HERE> . Background train
This method allows the efficient retrieval of arbitrary length n-grams ( Nagao and Mori , 94 ; Haruno et al. , 96 ; Ikehaxa et al. , 96 ; <CITED HERE> ; Russell , 1998 ) . Background train
The current system learns finite state flowcharts whereas typical learning systems usually acquire coefficient values as in Minsky and Papert ( 1969 ) , assertional statements as in <CITED HERE> , or semantic nets as in Winston ( 1975 ) . CompareOrContrast val
Our most accurate product model achieves an F score of 92.5 without the use of discriminative reranking and comes close to the best known numbers on this test set ( <CITED HERE> ) . CompareOrContrast train
To demonstrate that this is possible we have implemented a system which constructs dictionary entries for the PATR-II system ( <CITED HERE> and references therein ) . Uses val
Other works ( Kasper et al. , 1995 ; <CITED HERE> ) convert HPSG grammars into LTAG grammars . CompareOrContrast train
The work of <CITED HERE> demonstrates that faceted queries can be converted into simple filtering constraints to boost precision . Background val
Efficient hardware implementation is also possible via chip-level parallelism ( <CITED HERE> ) . Future train
2We could just as easily use other symmetric `` association '' measures , such as 02 ( Gale & Church , 1991 ) or the Dice coefficient ( <CITED HERE> ) . CompareOrContrast train
<CITED HERE> have argued that Dale and Reiter 's ( 1995 ) dichotomy between salient and nonsalient objects ( where the objects in the domain are the salient ones ) should be replaced by an account that takes degrees of salience into account : No object can be too unsalient to be referred to , as long as the right properties are available . Background train
The acquisition of dialogue as implemented in VNLCE is reminiscent of the program synthesis methodology developed by <CITED HERE> where program flowcharts were constructed from traces of their behaviors . CompareOrContrast train
Character classes , such as punctuation , are defined according to the Unicode Standard ( <CITED HERE> ) . Uses train
Rather than producing a complete analysis of sentences , the alternative is to perform only partial analysis of the syntactic structures in a text ( Harris , 1957 ; Abney , 1991 ; <CITED HERE> ) . Background train
Finally , it has been shown by Groesser ( 1981 ) that the ratio of derived to explicit information necessary for understanding a piece of text is about 8:1 ; furthermore , our reading of the analysis of five paragraphs by <CITED HERE> strongly suggests that only the most direct or obvious inferences are being made in the process of building a model or constructing a theory of a paragraph . Motivation train
In other methods , lexical resources are specifically tailored to meet the requirements of the domain ( Rosario and Hearst , 2001 ) or the system ( <CITED HERE> ) . Background train
The exact form of M ( Si ) need not be discussed at this point ; it could be a conceptual dependence graph ( <CITED HERE> ) , a deep parse of Si , or some other representation . Background train
Some methods are based on likelihood ( Och and Ney , 2002 ; Blunsom et al. , 2008 ) , error rate ( Och , 2003 ; <CITED HERE> ; Pauls et al. , 2009 ; Galley and Quirk , 2011 ) , margin ( Watanabe et al. , 2007 ; Chiang et al. , 2008 ) and ranking ( Hopkins and May , 2011 ) , and among which minimum error rate training ( MERT ) ( Och , 2003 ) is the most popular one . Background train
Similarly , ( <CITED HERE> ) and ( Shinyanma et al. , 2002 ) learn sentence level paraphrase templates from a corpus of news articles stemming from different news source . Background train
This setup is also scalable to a higher number of word pairs ( 350 ) as was shown in <CITED HERE> . Background train
More recent work on terminology structuring has focussed on formal similarity to develop hypotheses on the semantic relationships between terms : Daille ( 2003 ) uses derivational morphology ; <CITED HERE> use , as a starting point , a number of identical characters . Background train
Louwerse et al. ( 2006 ) and Louwerse et al. ( 2007 ) study the relation between eye gaze , facial expression , pauses and dialogue structure in annotated English map-task dialogues ( <CITED HERE> ) and find correlations between the various modalities both within and across speakers . Background train
Some examples include text categorization ( Lewis and Catlett 1994 ) , base noun phrase chunking ( Ngai and Yarowsky 2000 ) , part-of-speech tagging ( Engelson Dagan 1996 ) , spelling confusion set disambiguation ( <CITED HERE> ) , and word sense disambiguation ( Fujii et al. 1998 ) . Background train
It is known that certain cue words and phrases ( <CITED HERE> ) can serve as explicit indicators of discourse structure . Motivation val
The last point may be seen better if we look at some differences between our system and KRYPTON , which also distinguishes between an object theory and background knowledge ( cfXXX <CITED HERE> ) . CompareOrContrast val
One approach to this problem is that taken by the ASCOT project ( <CITED HERE> ; Akkerman , 1986 ) . Background val
Though we could have used a further downstream measure like BLEU , METEOR has also been shown to directly correlate with translation quality ( <CITED HERE> ) and is simpler to measure . Motivation train
For all experiments reported in this section we used the syntactic dependency parser MaltParser v1 .3 ( Nivre 2003 , 2008 ; Kübler , McDonald , and <CITED HERE> ) , a transition-based parser with an input buffer and a stack , which uses SVM classifiers Uses val
We conducted experiments with gold features to assess the potential of these features , and with predicted features , obtained from training a simple maximum likelihood estimation classifier on this resource ( <CITED HERE> ) .19 The first part of Table 8 shows that the RAT ( rationality ) feature is very relevant ( in gold ) , but suffers from low accuracy ( no gains in machine-predicted input ) . Uses val
In our previous work ( <CITED HERE> ; Salloum and Habash , 2012 ) , we applied our approach to tokenized Arabic and our DA-MSA transfer component used feature transfer rules only . CompareOrContrast train
It has already been used to implement a framework for teaching NLP ( <CITED HERE> ) . Extends train
Due to their remarkable ability to incorporate context structure information and long distance reordering into the translation process , tree-based translation models have shown promising progress in improving translation quality ( Liu et al. , 2006 , 2009 ; Quirk et al. , 2005 ; Galley et al. , 2004 , 2006 ; <CITED HERE> ; Shen et al. , 2008 ; Zhang et al. , 2011b ) . Background train
Liu et al. ( 2005 ) , Meral et al. ( 2007 ) , <CITED HERE> , Murphy and Vogel ( 2007 ) and Topkara et al. ( 2006a ) all belong to the syntactic transformation category . Background train
Similarly , the notion of R + M-abduction is spiritually related to the `` abductive inference '' of <CITED HERE> , the `` diagnosis from first principles '' of Reiter ( 1987 ) , `` explainability '' of Poole ( 1988 ) , and the subset principle of Berwick ( 1986 ) . CompareOrContrast train
On the other side , wikis started as collective works where each entry is not owned by a single author e.g. <CITED HERE> . Background val
To address this inconsistency in the correspondence between inflectional features and morphemes , and inspired by Smrž ( 2007 ) , we distinguish between two types of inflectional features : formbased ( a.k.a. surface , or illusory ) features and functional features .6 Most available Arabic NLP tools and resources model morphology using formbased ( `` surface '' ) inflectional features , and do not mark rationality ; this includes the Penn Arabic Treebank ( PATB ) ( <CITED HERE> ) , the Buckwalter morphological analyzer ( Buckwalter 2004 ) , and tools using them such as the Morphological Analysis and Disambiguation for Arabic ( MADA ) toolkit ( Habash and Rambow 2005 ; Habash , Rambow , and Roth 2012 ) . CompareOrContrast train
<CITED HERE> has built a semantic role classifier exploiting the interdependence of semantic roles . Uses val
Others have applied the NLP technologies of near-duplicate detection and topic-based text categorization to politically oriented text ( Yang and Callan , 2005 ; <CITED HERE> ) . Background val
8 It is based on the dataset of <CITED HERE> ,9 which consists of 1000 positive and 1000 negative movie reviews , tokenized and divided into 10 folds ( F0 -- F9 ) . Extends train
In previous work ( <CITED HERE> ) , we described an experimental text-to-speech system that determined prosodic phrasing for the Olive -- Liberman synthesizer ( Olive and Liberman 1985 ) . Extends train
The UMLS -- the Unified Medical Language System ( UMLS ) has been developed and maintained by National Library of Medicine ( NLM ) <CITED HERE> . Background train
The data used in the experiment was selected from the Penn Treebank Wall Street Journal , and is the same used by <CITED HERE> . Uses train
As shown in <CITED HERE> this is a well-motivated convention since it avoids splitting up lexical rules to transfer the specifications that must be preserved for different lexical entries . Motivation train
The reader may consult recent papers on this subject ( e.g. Moens and Steedman 1987 ; <CITED HERE> ) to see what a formal interpretation of events in time might look like . Background train
Proceedings of EACL '99 example , the ALE parser ( <CITED HERE> ) presupposes a phrase structure backbone which can be used to determine whether a constraint is to be interpreted bottom-up or topdown . Background train
Following Soon et al. ( 2001 ) , we represent use the ACE training data for acquiring our SC clasSCA as a binary value that indicates whether the insifier ; instead , we use the BBN Entity Type Corpus duced SCs of the two NPs involved are the same or ( <CITED HERE> ) , which consists of not . Uses train
Following <CITED HERE> , we also compare the performance of our system with a system using features based on the Brown clusters of the word types in a document . Uses train
<CITED HERE> use mutual information to identify collocations , a method they claim is reasonably effective for words with a frequency of not less than five . Background train
We do this with a first-order HMM part-ofspeech tagger ( Merialdo <CITED HERE> ) . Uses train
Using the bottom-up , dynamic programming technique ( see the appendix for details ) of computing inside probabilities ( <CITED HERE> ) , we can efficiently compute the probability of the sentence , P ( w | G ) . Uses train
As shown in ( <CITED HERE> ) • The presented research was carried out at the University of Tubingen , Germany , as part of the Sonderforschungsbereich 340 . Background train
According to <CITED HERE> , paragraphs are made up of segments , which in turn are made up of sentences or clauses , which in turn are made up of phrases . Background train
Similarly , the notion of R + M-abduction is spiritually related to the `` abductive inference '' of Reggia ( 1985 ) , the `` diagnosis from first principles '' of <CITED HERE> , `` explainability '' of Poole ( 1988 ) , and the subset principle of Berwick ( 1986 ) . CompareOrContrast train
This choice is motivated by an observation we made previously ( <CITED HERE>a ) : since each post in a sequence is a reply to the preceding post , we could exploit their dependencies by determining their stance labels together .3 As our sequence learner , we employ a maximum entropy Markov model ( MEMM ) ( McCallum et al. , 2000 ) . Extends train
Over the past decade , researchers at IBM have developed a series of increasingly sophisticated statistical models for machine translation ( <CITED HERE> ; Brown et al. , 1990 ; Brown et al. , 1993a ) . Background train
This may be because pipelines have many engineering advantages , and in practice the sort of problems pointed out by Danlos and other pipeline critics do not seem to be a major problem in current applied NLG systems ( <CITED HERE> ) . Background train
This approach has its roots in Fillmore 's Case Grammar ( 1968 ) , and serves as the foundation for two current large-scale semantic annotation projects : FrameNet ( Baker et al. , 1998 ) and PropBank ( <CITED HERE> ) . Background train
Two exceptions to this generalisation are the Linguistic String Project ( <CITED HERE> ) and the IBM CRITIQUE ( formerly EPISTLE ) Project ( Heidorn et al. , 1982 ; Byrd , 1983 ) ; the former employs a dictionary of approximately 10,000 words , most of which are specialist medical terms , the latter has well over 100,000 entries , gathered from machine readable sources . CompareOrContrast val
They can also shift the dialogue 2The notion of the initiative in this paper is different from that of the dialogue initiative of <CITED HERE> . CompareOrContrast train
The Google n-gram data was collected by Google Research for statistical language modelling , and has been used for many tasks such as lexical disambiguation ( <CITED HERE> ) , and contains English n-grams and their observed frequency counts , for counts of at least 40 . Background train
Withindocument coreference resolution has been applied to produce summaries of text surrounding occurrences of the name ( Bagga and Baldwin , 1998 ; <CITED HERE> ) . Background train
To represent the semantics of predicative units , we use FrameNet inventory of frames and frame elements ( C.<CITED HERE> ) . Uses train
Similar observation for surface word frequency was also observed by ( Bertram et al. , 2000 ; Bradley , 1980 ; Burani et al. , 1987 ; Burani et al. , 1984 ; Schreuder et al. , 1997 ; Taft 1975 ; <CITED HERE> ) where it has been claimed that words having low surface frequency tends to decompose . Background train
These two sets of data were used for automatic dialogue act classification , which was run in the Weka system ( <CITED HERE> ) . Uses train
• The transition probability a is 0.7 using the EM algorithm ( <CITED HERE> ) on the TREC4 ad-hoc query set . Uses train
<CITED HERE> obtain promising results in dialogue act tagging of the Switchboard-DAMSL corpus using lexical , syntactic and prosodic cues , while Gravano and Hirschberg ( 2009 ) examine the relation between particular acoustic and prosodic turn-yielding cues and turn taking in a large corpus of task-oriented dialogues . Background val
( <CITED HERE> ) has found strong correlations between DF , IG and the X2 statistic for a term . Background train
Despite this , to date , there has been little work on corpus-based approaches to help-desk response automation ( notable exceptions are Carmel , Shtalhaim , and Soffer 2000 ; <CITED HERE> ; Bickel and Scheffer 2004 ; Malik , Subramaniam , and Kaushik 2007 ) . Background train
To quantify the relative strengths of these transitive inferences , <CITED HERE> propose to assign a weight to each link . Background train
While these approaches have been reasonably successful ( see <CITED HERE> ) , Kehler et al. ( 2004 ) speculate that deeper linguistic knowledge needs to be made available to resolvers in order to reach the next level of performance . Background train
Lisp is not particularly well suited for interfacing to complex , structured objects , and it was not our intention to embark on a major effort involving the development of a formal model of a dictionary ( of the style described in , eg. , <CITED HERE> ) ; on the other hand a method of access was clearly required , which was flexible enough to support a range of applications intending to make use of the LDOCE tape . Background train
We train a 4-gram language model on the Xinhua portion of the English Gigaword corpus using the SRILM Toolkits ( Stolcke , 2002 ) with modified Kneser-Ney smoothing ( <CITED HERE> ) . Uses val
• Learnability ( Zernik and Dyer 1987 ) • Text generation ( <CITED HERE> ; Milosavljevic , Tulloch , and Dale 1996 ) • Speech generation ( Rayner and Carter 1997 ) • Localization ( Sch ¨ aler 1996 ) Background train
Details of the top performing heuristics of COCKTAIL were reported in ( <CITED HERE> ) . Background train
Representative systems are described in <CITED HERE> , De Mattia and Giachin ( 1989 ) , Niedermair ( 1989 ) , Niemann ( 1990 ) , and Young ( 1989 ) . Background train
In addition , <CITED HERE> note that our Object Raising rule would assign mean to this category incorrectly . CompareOrContrast train
The parallel corpus is word-aligned using GIZA + + ( <CITED HERE> ) . Uses train
tions for the remaining 20 % of the instances ; and ( 3 ) train an SVM classifier ( using the LIBSVM package ( <CITED HERE> ) ) on these 20 % of the instances , where each instance , i , is represented by a set of 31 binary features . Uses val
Our algorithm is similar to the approach taken by <CITED HERE> for inducing PCFG parsers . CompareOrContrast train
Recently , several alternative , often quite sophisticated approaches to collective classification have been proposed ( Neville and Jensen , 2000 ; Lafferty et al. , 2001 ; Getoor et al. , 2002 ; Taskar et al. , 2002 ; Taskar et al. , 2003 ; Taskar et al. , 2004 ; <CITED HERE> ) . Background val
This experiment was again replicated by <CITED HERE> with 10 subjects . Background train
Children use vague adjectives among their first dozens of words ( Peccei 1994 ) and understand some of their intricacies as early as their 24th month ( <CITED HERE> ) . Background train
Other approaches use less deep linguistic resources ( e.g. , POS-tags Stymne ( 2008 ) ) or are ( almost ) knowledge-free ( e.g. , <CITED HERE> ) . CompareOrContrast train
This includes work on question answering ( Wang et al. , 2007 ) , sentiment analysis ( Nakagawa et al. , 2010 ) , MT reordering ( <CITED HERE> ) , and many other tasks . Background train
( 4 ) NE : We use BBN 's IdentiFinder ( <CITED HERE> ) , a MUC-style NE recognizer to determine the NE type of NPZ . Uses train
<CITED HERE> has made the first attempt working on the single semantic role level to make further improvement . CompareOrContrast train
In particular , since we treat each individual speech within a debate as a single `` document '' , we are considering a version of document-level sentiment-polarity classification , namely , automatically distinguishing between positive and negative documents ( Das and Chen , 2001 ; Pang et al. , 2002 ; <CITED HERE> ; Dave et al. , 2003 ) . Background train
raw length value as a feature , we follow our previous work ( <CITED HERE> ; Wagner et al. , 2014 ) and create multiple features for length using a decision tree ( J48 ) . Extends train
<CITED HERE> used unification in an SMT system to model some of the CompareOrContrast train
Following Ruch et al. ( 2003 ) and <CITED HERE> , we employed Hidden Markov Models to model the discourse structure of MEDLINE abstracts . Uses train
The language grounding problem has received significant attention in recent years , owed in part to the wide availability of data sets ( e.g. Flickr , Von Ahn ( 2006 ) ) , computing power , improved computer vision models ( Oliva and Torralba , 2001 ; Lowe , 2004 ; Farhadi et al. , 2009 ; <CITED HERE> ) and neurological evidence of ties between the language , perceptual and motor systems in the brain ( Pulverm ¨ uller et al. , 2005 ; Tettamanti et al. , 2005 ; Aziz-Zadeh et al. , 2006 ) . Background train
Due to their remarkable ability to incorporate context structure information and long distance reordering into the translation process , tree-based translation models have shown promising progress in improving translation quality ( Liu et al. , 2006 , 2009 ; Quirk et al. , 2005 ; Galley et al. , 2004 , 2006 ; Marcu et al. , 2006 ; <CITED HERE> ; Zhang et al. , 2011b ) . Background train
Previous work has argued that initiative affects the degree of control an agent has in the dialogue interaction ( Whittaker and Stenton , 1988 ; <CITED HERE> ; Chu-Carroll and Brown , 1998 ) . Background val
Many provide graphical user interfaces ( GUI ) for manual annotation ( e.g. General Architecture for Text Engineering ( GATE ) ( Cunningham et al. , 1997 ) and the Alembic Workbench ( <CITED HERE> ) ) as well as NLP tools and resources that can be manipulated from the GUI . Background train
A number of speech understanding systems have been developed during the past fifteen years ( Barnett et al. 1980 , Dixon and Martin 1979 , Erman et al. 1980 , Haton and Pierrel 1976 , Lea 1980 , Lowerre and Reddy 1980 , <CITED HERE> , Reddy 1976 , Walker 1978 , and Wolf and Woods 1980 ) . CompareOrContrast train
It allows the construction of a non-TAL ( Shieber , 1994 ) , ( <CITED HERE> ) . Background train
In addition , the advantages of using linguistically annotated data over raw data are well documented ( Mair , 2005 ; <CITED HERE> ) . Background val
In particular , the `` Semantic Information Retrieval '' project ( SIR <CITED HERE> ) systematically investigates the use of lexical-semantic relations between words or concepts for improving the performance of information retrieval systems . Motivation val
Our re-ranking approach , like the approach to parse re-ranking of <CITED HERE> , employs a simpler model -- a local semantic role labeling algorithm -- as a first pass to generate a set of n likely complete assignments of labels to all parse tree nodes . CompareOrContrast train
For instance , the Alembic workbench ( <CITED HERE> ) contains a sentence-splitting module that employs over 100 regular-expression rules written in Flex . Background train
In contrast , a single statistical model allows one to maintain a single table ( <CITED HERE> ) . Background train
Our experiments are based on the multimodal extension of Latent Dirichlet Allocation developed by <CITED HERE> . Uses train
First , it has been noted that in many natural language applications it is sufficient to use shallow parsing information ; information such as noun phrases ( NPs ) and other syntactic sequences have been found useful in many large-scale language processing applications including information extraction and text summarization ( Grishman , 1995 ; <CITED HERE> ) . Background train
<CITED HERE> adopted a Bayesian method to infer an STSG by exploring the space of alignments based on parse trees . CompareOrContrast train
In corpus linguistics building such megacorpora is beyond the scope of individual researchers , and they are not easily accessible ( <CITED HERE> : 56 ) unless the web is used as a corpus ( Kilgarriff and Grefenstette , 2003 ) . Background train
In addition , there are a number of projects under way to develop substantial lexicons from machine readable sources ( see <CITED HERE> for details ) . Background train
We carried out two parallel experiments with two parsers available for Czech , parser I ( Hajie et al. , 1998 ) and parser II ( <CITED HERE> ) . Uses val
Part of speech taggers typically require input in the format of a single sentence per line ( for example Brill 's tagger ( <CITED HERE> ) ) and parsers generally aim to produce a tree spanning each sentence . Background train
The elimination of redundant nondeterminism is based on Unfold/Fold transformation techniques ( <CITED HERE> ) .29 The unfolding transformation is also referred to as partial execution , for example , by Pereira and Shieber ( 1987 ) . Uses train
Problems such as these have motivated research on more abstract , dependencybased parser evaluation ( e.g. , Lin 1995 ; Carroll , Briscoe , and Sanfilippo 1998 ; Carroll et al. 2002 ; Clark and Hockenmaier 2002 ; <CITED HERE> ; Preiss 2003 ; Kaplan et al. 2004 ; Miyao and Tsujii 2004 ) . Motivation train
This seems to provide additional evidence of <CITED HERE>b ) 's suggestion that something like a distributional hypothesis of images is plausible . CompareOrContrast train
For instance , GATE currently provides a POS tagger , named entity recogniser and gazetteer and ontology editors ( <CITED HERE> ) . Background train
Latent variables we wish to consider are an increased number of word classes ; more flexible regions -- see <CITED HERE> on learning a state transition diagram for acoustic regions in phone recognition -- and phonological features and syllable boundaries . Background train
Since earlier versions of the SNoW based CSCL were used only to identify single phrases ( Punyakanok and Roth , 2001 ; Munoz et al. , 1999 ) and never to identify a collection of several phrases at the same time , as we do here , we also trained and tested it under the exact conditions of CoNLL-2000 ( Tjong Kim <CITED HERE> ) to compare it to other shallow parsers . Uses train
Using the implicit modeling of argument consistency , we follow the same approach as in our previous work ( <CITED HERE> ) and trained a logistic regression model to predict verb alignment based on the features in Table 1 . Extends train
Recent developments in linguistics , and especially on grammatical theory -- for example , Generalised Phrase Structure Grammar ( GPSG ) ( Gazdar et al. , 1985 ) , Lexical Functional Grammar ( LFG ) ( <CITED HERE> ) -- and on natural language parsing frameworks -- for example , Functional Unification Grammar ( FUG ) ( Kay , 1984a ) , PATR-II ( Shieber , 1984 ) -- make it feasible to consider the implementation of efficient systems for the syntactic analysis of substantial fragments of natural language . Background train
We train a 4-gram language model on the Xinhua portion of the English Gigaword corpus using the SRILM Toolkits ( <CITED HERE> ) with modified Kneser-Ney smoothing ( Chen and Goodman , 1998 ) . Uses train
It is only recently that the web name ambiguity has been approached as a separate problem and defined as an NLP task Web People Search on its own ( <CITED HERE> ; Artiles et al. , 2007 ) . Background train
This deficiency is rectified in the verb classification system employed by <CITED HERE> in the Brandeis verb catalogue . CompareOrContrast train
Previous work has argued that initiative affects the degree of control an agent has in the dialogue interaction ( <CITED HERE> ; Walker and Whittaker , 1990 ; Chu-Carroll and Brown , 1998 ) . Background val
1990 ) , on linguisitic acquisition ( by the use of Part-of-Speech filters hand-crafted by a linguist ) ( <CITED HERE> ) or , more frequently , on a combination of the two ( Smadja , 1993 ; Kilgarriff and Tugwell , 2001 , for example ) . CompareOrContrast val
It is frequently used in tasks like scene identification , and <CITED HERE> shows that distance in GIST space correlates well with semantic distance in WordNet . Motivation train
Because each rule r consists of a target tree fragment frag and a source string str in the model , we follow <CITED HERE> and decompose the prior probability P0 ( r | N ) into two factors as follows : Uses train
To address this inconsistency in the correspondence between inflectional features and morphemes , and inspired by Smrž ( 2007 ) , we distinguish between two types of inflectional features : formbased ( a.k.a. surface , or illusory ) features and functional features .6 Most available Arabic NLP tools and resources model morphology using formbased ( `` surface '' ) inflectional features , and do not mark rationality ; this includes the Penn Arabic Treebank ( PATB ) ( Maamouri et al. 2004 ) , the Buckwalter morphological analyzer ( Buckwalter 2004 ) , and tools using them such as the Morphological Analysis and Disambiguation for Arabic ( MADA ) toolkit ( <CITED HERE> ; Habash , Rambow , and Roth 2012 ) . CompareOrContrast train
In our prior work ( <CITED HERE> ) , we examined whether techniques used for predicting the helpfulness of product reviews ( Kim et al. , 2006 ) could be tailored to our peer-review domain , where the definition of helpfulness is largely influenced by the educational context of peer review . Extends train
Previous sentiment-analysis work in different domains has considered inter-document similarity ( <CITED HERE> ; Pang and Lee , 2005 ; Goldberg and Zhu , 2006 ) or explicit Background train
<CITED HERE>a ) show how a BoVW model may be easily combined with a distributional vector space model of language using only vector concatenation . Background train
The following are the various tag sets we use in this article : ( a ) the core POS tag sets CORE44 and the newly introduced CORE12 ; ( b ) CATiB Treebank tag set ( CATIB6 ) ( <CITED HERE> ) and its newly introduced extension of CATIBEX created using simple regular expressions on word form , indicating particular morphemes such as the prefix JI Al + or the suffix v ' + wn ; this tag set is the best-performing tag set for Arabic on predicted values as reported in Section 4 ; ( c ) the PATB full tag set with complete morphological tag ( BW ) ( Buckwalter 2004 ) ; and two extensions of the PATB reduced tag set ( PENN POS , a.k.a. RTS , size 24 [ Diab , Hacioglu , and Jurafsky 2004 ] ) , both outperforming it : ( d ) Kulick , Gabbard , and Marcus ( 2006 ) 's tag set ( KULICK ) , size 43 , one of whose most important extensions is the marking of the definite article clitic , and ( e ) Diab and Benajiba 's ( in preparation ) EXTENDED RTS tag set ( ERTS ) , which marks gender , number , and definiteness , size 134 . Uses val
The numeral ( whether it is implicit , as in ( 3 ) , or explicit ) can be construed as allowing the reader to draw inferences about the standards employed ( <CITED HERE> ; DeVault and Stone 2004 ) : ( 3 ) , for example , implies a standard that counts 10 cm as large and 8 cm as not large . Background train
Agreement between two annotation sets is calculated here in terms of Cohen 's kappa ( <CITED HERE> ) 1 and corrected kappa ( Brennan and Prediger , 1981 ) 2 . Uses train
On the WSJ corpus our system performed slightly better than the combination of the Alembic and SATZ systems described in <CITED HERE> ( 0.44 % vs. 0.5 % error rate ) . CompareOrContrast train
<CITED HERE> studied the issue of disambiguation for mono-lingual M. Background train
Recently , several alternative , often quite sophisticated approaches to collective classification have been proposed ( Neville and Jensen , 2000 ; <CITED HERE> ; Getoor et al. , 2002 ; Taskar et al. , 2002 ; Taskar et al. , 2003 ; Taskar et al. , 2004 ; McCallum and Wellner , 2004 ) . Background train
To address this inconsistency in the correspondence between inflectional features and morphemes , and inspired by Smrž ( 2007 ) , we distinguish between two types of inflectional features : formbased ( a.k.a. surface , or illusory ) features and functional features .6 Most available Arabic NLP tools and resources model morphology using formbased ( `` surface '' ) inflectional features , and do not mark rationality ; this includes the Penn Arabic Treebank ( PATB ) ( Maamouri et al. 2004 ) , the Buckwalter morphological analyzer ( <CITED HERE> ) , and tools using them such as the Morphological Analysis and Disambiguation for Arabic ( MADA ) toolkit ( Habash and Rambow 2005 ; Habash , Rambow , and Roth 2012 ) . CompareOrContrast val
Although in this paper we take modus ponens as the main rule of inference , in general one can consider deductive closures with respect to weaker , nonstandard logics , ( cfXXX Levesque 1984 ; <CITED HERE> ; Patel-Schneider 1985 ) . CompareOrContrast train
Riehemann 1993 ; Oliva 1994 ; Frank 1994 ; <CITED HERE> ; Sanfilippo 1995 ) . CompareOrContrast train
Berger et al. 2000 ; Jijkoun and de Rijke 2005 ; <CITED HERE> ) . CompareOrContrast train
We will examine the worst-case complexity of interpretation as well as generation to shed some light on the hypothesis that vague descriptions are more difficult to process than others because they involve a comparison between objects ( <CITED HERE> , Krahmer and Theune 2002 ) . Background train
A number of proposals in the 1990s deliberately limited the extent to which they relied on domain and/or linguistic knowledge and reported promising results in knowledge-poor operational environments ( Dagan and Itai 1990 , 1991 ; Lappin and Leass 1994 ; Nasukawa 1994 ; <CITED HERE> ; Williams , Harvey , and Preston 1996 ; Baldwin 1997 ; Mitkov 1996 , 1998b ) . Background train
From the Meaning-Text Theory ( MTT ) 1 point of view , Natural Language ( NL ) is considered as a correspondence between meanings and texts ( <CITED HERE> ) . Background train
This means that natural language expressions such as `` A is B , '' `` A is the same as B , '' etc. are not directly represented by logical equality ; similarly , `` not '' is often not treated as logical negation ; cfXXX <CITED HERE> . CompareOrContrast train
The full-listing model claims that polymorphic words are represented as a whole in the human mental lexicon ( <CITED HERE> ; Butterworth , 1983 ) . Background train
As for work on Arabic ( MSA ) , results have been reported on the PATB ( Kulick , Gabbard , and Marcus 2006 ; Diab 2007 ; Green and Manning 2010 ) , the Prague Dependency Treebank ( PADT ) ( <CITED HERE> ; Nivre 2008 ) and the CATiB ( Habash and Roth 2009 ) . Background train
However , learning-based resolvers have not been able to benefit from having an SC agreement feature , presumably because the method used to compute the SC of an NP is too simplistic : while the SC of a proper name is computed fairly accurately using a named entity ( NE ) recognizer , many resolvers simply assign to a common noun the first ( i.e. , most frequent ) WordNet sense as its SC ( e.g. , Soon et al. ( 2001 ) , <CITED HERE> ) . Background train
Thus , over the past few years , along with advances in the use of learning and statistical methods for acquisition of full parsers ( Collins , 1997 ; Charniak , 1997a ; Charniak , 1997b ; Ratnaparkhi , 1997 ) , significant progress has been made on the use of statistical learning methods to recognize shallow parsing patterns syntactic phrases or words that participate in a syntactic relationship ( Church , 1988 ; Ramshaw and Marcus , 1995 ; Argamon et al. , 1998 ; Cardie and Pierce , 1998 ; <CITED HERE> ; Punyakanok and Roth , 2001 ; Buchholz et al. , 1999 ; Tjong Kim Sang and Buchholz , 2000 ) . Background train
The ability to explicitly identify these sections in unstructured text could play an important role in applications such as document summarization ( <CITED HERE> ) , information retrieval ( Tbahriti et al. , 2005 ) , information extraction ( Mizuta et al. , 2005 ) , and question answering . Background train
Many statistical parsers ( Ratnaparkhi , 1999 ; Collins , 1999 ; Charniak , 2001 ) are based on a history-based probability model ( <CITED HERE> ) , where the probability of each decision in a parse is conditioned on the previous decisions in the parse . Background train
For example , such schema can serve as a mean to represent translation examples , or find structural correspondences for the purpose of transfer grammar learning ( Menezes & Richardson , 2001 ) , ( Aramaki et al. , 2001 ) , ( Watanabe et al. , 2000 ) , ( Meyers et al. , 2000 ) , ( Matsumoto et al. , 1993 ) , ( kaji et al. , 1992 ) , and example-base machine translation EBMT3 ( <CITED HERE> ) , ( Sato , 1991 ) , ( Richardson et al. , 2001 ) , ( Al-Adhaileh & Tang , 1999 ) . Background train
For instance , ( <CITED HERE> ) acquire two-argument templates ( inference rules ) from corpora using an extended version of the distributional analysis in which paths in dependency trees that have similar arguments are taken to be close in meaning . Background train
Per-state joint normalization ( <CITED HERE>b , § 8.2 ) is similar but drops the dependence on a . CompareOrContrast train
Other studies on the value of disambiguation for cross-lingual IR include Hiemstra and de Jong , 1999 ; <CITED HERE> . Background train
We have not yet made use of TINA 'S probabilities in adjusting the recognizer scores on the fly , but we have been able to incorporate linguistic scores to resort N-best outputs , giving a significant improvement in performance ( <CITED HERE> ) . Uses train
These features are carefully designed to reduce the data sparseness problem and some of them are inspired by previous work ( He et al. , 2008 ; Gimpel and Smith , 2008 ; <CITED HERE> ; Chiang et al. , 2009 ; Setiawan et al. , 2009 ; Shen et al. , 2009 ; Xiong et al. , 2009 ) : 1 . Motivation train
Nevertheless , the full document text is present in most systems , sometimes as the only feature ( Sugiyama and Okumura , 2007 ) and sometimes in combination with others see for instance ( <CITED HERE> ; Popescu and Magnini , 2007 ) - . Background train
More recently , ( <CITED HERE> ) has performed a good survey of document categorization ; recent works can also be found in ( Joachims , 2002 ) , ( Crammer and Singer , 2003 ) , and ( Lewis et al. , 2004 ) . Background train
McDonald has even argued for extending the model to a large number of components ( McDonald 1988 ) , and several systems have indeed added an additional component between the planner and the linguistic component ( <CITED HERE> ; Panaget 1994 ; Wanner 1994 ) . Background train
It is these orthographic variations and complex morphological structure that make Arabic language processing challenging ( Xu et al. , 2001 ; <CITED HERE> ) . Background train
A number of speech understanding systems have been developed during the past fifteen years ( Barnett et al. 1980 , Dixon and Martin 1979 , <CITED HERE> , Haton and Pierrel 1976 , Lea 1980 , Lowerre and Reddy 1980 , Medress 1980 , Reddy 1976 , Walker 1978 , and Wolf and Woods 1980 ) . CompareOrContrast train
It has been more difficult showing that agreement morphology helps parsing , however , with negative results for dependency parsing in several languages ( Eryigit , Nivre , and Oflazer 2008 ; Nivre , Boguslavsky , and Iomdin 2008 ; <CITED HERE> ) . Motivation train
Some efforts have tackled tasks such as automatic image caption generation ( <CITED HERE>a ; Ordonez et al. , 2011 ) , text illustration ( Joshi et al. , 2006 ) , or automatic location identification of Twitter users ( Eisenstein et al. , 2010 ; Wing and Baldridge , 2011 ; Roller et al. , 2012 ) . Background train
This idea was proposed by Krauwer and des Tombe ( 1981 ) , Langendoen and Langsam ( 1987 ) , and Pulman ( 1986 ) , and was rediscovered by Black ( 1989 ) and recently by <CITED HERE> . Background train
A nonprobabilistic approach for DA labeling proposed by Samuel , Carberry , and Vijay-Shanker ( 1998 ) is transformation-based learning ( <CITED HERE> ) . CompareOrContrast train
However , since work in this direction has started , a significant progress has also been made in the research on statistical learning of full parsers , both in terms of accuracy and processing time ( Charniak , 1997b ; Charniak , 1997a ; Collins , 1997 ; <CITED HERE> ) . Background val
Some methods are based on likelihood ( Och and Ney , 2002 ; Blunsom et al. , 2008 ) , error rate ( Och , 2003 ; Zhao and Chen , 2009 ; Pauls et al. , 2009 ; <CITED HERE> ) , margin ( Watanabe et al. , 2007 ; Chiang et al. , 2008 ) and ranking ( Hopkins and May , 2011 ) , and among which minimum error rate training ( MERT ) ( Och , 2003 ) is the most popular one . Background train
The algorithm we implemented is inspired by the work of <CITED HERE> on word sense disambiguation . Motivation val
Various approaches for computing semantic relatedness of words or concepts have been proposed , e.g. dictionary-based ( Lesk , 1986 ) , ontology-based ( Wu and Palmer , 1994 ; Leacock and Chodorow , 1998 ) , information-based ( Resnik , 1995 ; Jiang and Conrath , 1997 ) or distributional ( <CITED HERE> ) . Background train
EM maximizes G ( 0 ) via block-coordinate ascent on a lower bound F ( q , 0 ) using an auxiliary distribution over the latent variables q ( z | x , y ) ( <CITED HERE> ) : Uses train
It has been argued that generating such modifiers is not a trivial decision because it interferes with the planning of both local and global coherence ( in the sense of ( Grosz and Sidner , 1986 ) ) ( <CITED HERE>a ) . Background train
• Only qualitative observations of the responses were reported ( no formal evaluation was performed ) ( <CITED HERE> ; Roy and Subramaniam 2006 ) . CompareOrContrast train
A stops B from doing something ; A disagreees with B on something , 8 % and 12 % ) Note that in our original work ( <CITED HERE> ) , only development data were used to show some initial observations . CompareOrContrast train
We evaluated on the English CCGBank ( Hockenmaier and Steedman , 2007 ) , which is a transformation of the Penn Treebank ( <CITED HERE> ) ; the CTBCCG ( Tse and Curran , 2010 ) transformation of the Penn Chinese Treebank ( Xue et al. , 2005 ) ; and the CCG-TUT corpus ( Bos et al. , 2009 ) , built from the TUT corpus of Italian text ( Bosco et al. , 2000 ) . Uses train
The EM algorithm ( <CITED HERE> ) can maximize these functions . Uses train
Finally , feedback expressions ( head nods and shakes ) are successfully predicted from speech , prosody and eye gaze in interaction with Embodied Communication Agents as well as human communication ( Fujie et al. , 2004 ; Morency et al. , 2005 ; <CITED HERE> ; Morency et al. , 2009 ) . Background train
For these or for a specific domain , basic synonymic dictionaries can be complemented using learning methods based on distributional similarity ( Pereira et al. , 1993 ; <CITED HERE> ) . Future train
For projective parsing , it is significantly faster than exact dynamic programming , at the cost of small amounts of search error , We are interested in extending these ideas to phrase-structure and lattice parsing , and in trying other higher-order features , such as those used in parse reranking ( Charniak and Johnson , 2005 ; Huang , 2008 ) and history-based parsing ( <CITED HERE> ) . Future train
The coreference system system is similar to the Bell tree algorithm as described by ( <CITED HERE> ) . CompareOrContrast train
Other representations use the link structure ( Malin , 2005 ) or generate graph representations of the extracted features ( <CITED HERE> ) . Background train
The research described below is taking place in the context of three collaborative projects ( Boguraev , 1987 ; <CITED HERE> ; Phillips and Thompson , 1986 ) to develop a general-purpose , wide coverage morphological and syntactic analyser for English . Background train
We experiment with four learners commonly employed in language learning : Decision List ( DL ) : We use the DL learner as described in <CITED HERE> , motivated by its success in the related tasks of word sense disambiguation ( Yarowsky , 1995 ) and NE classification ( Collins and Singer , 1999 ) . Uses train
transition-based dependency parsing framework ( Nivre , 2008 ) using an arc-eager transition strategy and are trained using the perceptron algorithm as in <CITED HERE> with a beam size of 8 . Uses train
Using the basic solution proposed by ( <CITED HERE> ) as a term of comparison , we experiment with different sources of multilingual lexical knowledge to address the following questions : ( 1 ) What is the potential of the existing multilingual lexical resources to approach CLTE ? CompareOrContrast train
Acoustic models for HTK is trained with the continuous speech database of the Acoustical Society of Japan ( <CITED HERE> ) . Uses train
Automatic text categorization has been used in search engines , digital library systems , and document management systems ( <CITED HERE> ) . Background train
For our Text modality , we use deWaC , a large German web corpus created by the WaCKy group ( <CITED HERE> ) containing approximately 1.7 B word tokens . Uses train
Unless very high rates of misspellings are to be expected ( this explains the favorable results for trigram indexing in ( <CITED HERE> ) ) one can not really recommend this method . CompareOrContrast val
For english , there is for instance the 15 year old HewlettPackard test suite , a simple text file listing test sentences and grouping them according to linguistics phenomena ( <CITED HERE> ) ; and more recently , the much more sophisticated TSNLP ( Test Suite for Natural Language Processing ) which includes some 9500 test items for English , French and German , each of them being annotated with syntactic and application related information ( Oepen and Flickinger , 1998 ) . CompareOrContrast val
Previous work on Chinese SRL mainly focused on how to transplant the machine learning methods which has been successful with English , such as <CITED HERE> , Xue and Palmer ( 2005 ) and Xue ( 2008 ) . Background train
A similar method is included in PATR-II ( <CITED HERE> ) and can be used to encode lexical rules as binary relations in the CUF system ( Dorre and Eisele 1991 ; Done and Dorna 1993b ) or the TFS system ( Emele and Zajac 1990 ; Emele 1994 ) . CompareOrContrast train
Our training examples are similar to the data created for pseudodisambiguation , the usual evaluation task for SP models ( Erk , 2007 ; <CITED HERE> ; Rooth et al. , 1999 ) . CompareOrContrast train
With the use of computers in storing the explosive amount of biological information , natural language processing ( NLP ) approaches have been explored to make the task of managing information recorded in free text more feasible <CITED HERE> . Background val
A formula q5 of L ( =-RRB- , the language with equality , is weakly R + M-abductible from an object theory T , denoted by T I-R + m 0 , iff there exists a partial theory T e PT ( T ) and a preferred model M E PM ( T ) such that M = 0 , i.e. 0 is true in at least one preferred model of the partial theory T. Note : The notions of strong provability and strong R + M-abduction can be introduced by replacing `` there exists '' by `` all '' in the above definitions ( cfXXX <CITED HERE>b ) . CompareOrContrast train
The first version ( TIMIT ) was developed for the 450 phonetically rich sentences of the TIMIT database ( <CITED HERE> ) . Uses val
A third problem arises with the approach to the semantics of QLFs that this notion of the relationship between QLF and RQLF encourages one to adopt : it is that taken by <CITED HERE> . CompareOrContrast train
In ( <CITED HERE> ) , I present evidence from Mandarin Chinese that this analysis is on the right track . Extends train
Interaction between components is coordinated by the dialogue manager which uses the informationstate approach ( <CITED HERE> ) . Uses val
Other molecular biology databases We also included several model organism databases or nomenclature databases in the construction of the dictionary , i.e. , mouse Mouse Genome Database ( MGD ) [ 18 ] , fly FlyBase [ 19 ] , yeast Saccharomyces Genome Database ( SGD ) [ 20 ] , rat -- Rat Genome Database ( RGD ) [ 21 ] , worm -- WormBase [ 22 ] , Human Nomenclature Database ( HUGO ) [ 23 ] , Online Mendelian Inheritance in Man ( OMIM ) <CITED HERE> , and Enzyme Nomenclature Database ( ECNUM ) [ 25 , 26 ] . Uses train
There are many more distinctions which are conveyed by the conjunction of grammar codes and word qualifiers ( see <CITED HERE> , for further details ) . Background train
The most common way is to divide each half of the bitext into an equal number of segments and to align the segments so that each pair of segments Si and Ti are translations of each other ( Gale & Church , 1991 ; <CITED HERE>a ) . Background val
<CITED HERE> ) . Future train
<CITED HERE> proposes readjustment rules similar to those of Chomsky and Halle , but he claims that the readjustment of structure is part of the grammar , not part of the performance model . Background train
financial news , we created a probabilistic CzechEnglish dictionary by running GIZA + + training ( translation models 1-4 , see <CITED HERE> ) on the training part of the English-Czech WSJ parallel corpus extended by the parallel corpus of entry/translation pairs from the manual dictionary . Uses train
In the field of machine learning research , incremental training has been employed in the work ( <CITED HERE> ; Shilton et al. , 2005 ) , but there is little work for tuning parameters of statistical machine translation . Background train
A similar method is included in PATR-II ( Shieber et al. 1983 ) and can be used to encode lexical rules as binary relations in the CUF system ( Dorre and Eisele 1991 ; Done and Dorna 1993b ) or the TFS system ( Emele and Zajac 1990 ; <CITED HERE> ) . CompareOrContrast train
In a log-linear parameterization , for example , a prior that penalizes feature strengths far from 1 can be used to do feature selection and avoid overfitting ( <CITED HERE> ) . Uses train
We report performance in terms of two metrics : ( 1 ) the Fmeasure score as computed by the commonly-used MUC scorer ( <CITED HERE> ) , and ( 2 ) the accuracy on the anaphoric references , computed as the fraction of anaphoric references correctly resolved . Uses train
A study of the query log of the AllTheWeb and Altavista search sites gives an idea of the relevance of the people search task : 11-17 % of the queries were composed of a person name with additional terms and 4 % were identified as person names ( <CITED HERE> ) . Background val
This was done because purely unsupervised techniques ( e.g. , Baum-Welch [ Baum and Petrie 1966 ] or Brill 's [ <CITED HERE>b ] ) enable regularities to be induced for word classes which contain many entries , exploiting the fact that individual words that belong to a POS class occur in different ambiguity patterns . CompareOrContrast train
Future research should apply the work of <CITED HERE> and Blunsom and Osborne ( 2008 ) , who marginalize over derivations to find the most probable translation rather than the most probable derivation , to these multi-nonterminal grammars . Future train
In informal experiments described elsewhere ( <CITED HERE> ) , I found that the G2 statistic suggested by Dunning ( 1993 ) slightly outperforms 02 . Background train
porating these two KSs into our resolver : they can Following <CITED HERE> , we select as the aneach be represented as a constraint or as a feature , tecedent of each NP , NPS , the closest preceding NP and they can be applied to the resolver in isolation that is classified as coreferent with NPS . Motivation train
Using the section labels , the HMM was trained using the HTK toolkit ( <CITED HERE> ) , which efficiently performs the forward-backward algorithm and BaumWelch estimation . Uses train
In this paper , we use TF-IDF ( a kind of augmented DF ) as a feature selection criterion , in order to ensure results are comparable with those in ( <CITED HERE> ) . CompareOrContrast train
A variety of statistical methods were proposed over the recent years for learning to produce a full parse of free-text sentences ( e.g. , Bod ( 1992 ) , Magerman ( 1995 ) , Collins ( 1997 ) , <CITED HERE> , and Sekine ( 1998 ) ) . Background train
There is a rich literature on organization and lexical access of morphologically complex words where experiments have been conducted mainly for derivational suffixed words of English , Hebrew , Italian , French , Dutch , and few other languages ( Marslen-Wilson et al. , 2008 ; Frost et al. , 1997 ; <CITED HERE> ; Drews and Zwitserlood , 1995 ) . Background train
In the disambiguation of capitalized words , the most widespread method is POS tagging , which achieves about a 3 % error rate on the Brown corpus and a 5 % error rate on the WSJ corpus , as reported in <CITED HERE> . CompareOrContrast train
The problem with this approach is that any threshold is , to some extent , arbitrary , and there is evidence to suggest that , for some tasks , low counts are important ( <CITED HERE> ) . Motivation train
<CITED HERE> further labeled the SCFG rules with POS tags and unsupervised word classes . CompareOrContrast train
Steganography is concerned with hiding information in some cover medium , by manipulating properties of the medium in such a way that the hidden information is not easily detectable by an observer ( <CITED HERE> ) . Background val
We offer a theorem that highlights the broad applicability of these modeling techniques .4 If f ( input , output ) is a weighted regular relation , then the following statements are equivalent : ( 1 ) f is a joint probabilistic relation ; ( 2 ) f can be computed by a Markovian FST that halts with probability 1 ; ( 3 ) f can be expressed as a probabilistic regexp , i.e. , a regexp built up from atomic expressions a : b ( for a E E U -LCB- E -RCB- , b E A U -LCB- E -RCB- ) using concatenation , probabilistic union + p , and probabilistic closure * p. For defining conditional relations , a good regexp language is unknown to us , but they can be defined in several other ways : ( 1 ) via FSTs as in Fig. 1c , ( 2 ) by compilation of weighted rewrite rules ( Mohri and Sproat , 1996 ) , ( 3 ) by compilation of decision trees ( <CITED HERE> ) , ( 4 ) as a relation that performs contextual left-to-right replacement of input substrings by a smaller conditional relation ( Gerdemann and van Noord , 1999 ) ,5 ( 5 ) by conditionalization of a joint relation as discussed below . Background train
Word pairs containing polysemous words are expanded to concept pairs using GermaNet ( <CITED HERE> ) , the German equivalent to WordNet , as a sense inventory for each word . Uses train
Using the initial target U-trees , source sentences and word alignment , we extract minimal GHKM translation rules7 in terms of frontier nodes ( <CITED HERE> ) . Uses train
For example , while it is difficult to induce a grammar with raw text alone , the task is tractable when the syntactic analysis for each sentence is provided as a part of the training data ( <CITED HERE> ) . Background train
Our results are lower than those of full parsers , e.g. , <CITED HERE> as might be expected since much less structural data , and no lexical data are being used . CompareOrContrast train
Our work is more similar to NLG work that concentrates on structural constraints such as generative poetry ( Greene et al. , 2010 ) ( Colton et al. , 2012 ) ( Jiang and Zhou , 2008 ) or song lyrics ( Wu et al. , 2013 ) ( Ramakrishnan <CITED HERE> ) , where specified meter or rhyme schemes are enforced . CompareOrContrast val
Hermann and Deutsch ( 1976 ; also reported in <CITED HERE> ) show that greater differences are most likely to be chosen , presumably because they are more striking . Background train
The result holds for both the MaltParser ( <CITED HERE> ) and the Easy-First Parser ( Goldberg and Elhadad 2010 ) . Uses train
This system has been successfully tested with the development of plug-ins supporting instant messaging , distributed video encoding ( <CITED HERE> ) , distributed virtual worlds ( Hughes et al. , 2005 ) and digital library management ( Walkerdine and Rayson , 2004 ) . Background train
In modern syntactic theories ( e.g. , lexical-functional grammar [ LFG ] [ <CITED HERE> ; Bresnan 2001 ; Dalrymple 2001 ] , head-driven phrase structure grammar [ HPSG ] [ Pollard and Sag 1994 ] , tree-adjoining grammar [ TAG ] [ Joshi 1988 ] , and combinatory categorial grammar [ CCG ] [ Ades and Steedman 1982 ] ) , the lexicon is the central repository for much morphological , syntactic , and semantic information . Uses train
We then use Illinois Chunker ( Punyakanok and Roth , 2001 ) 6 to extract more noun phrases from the text and employ Collins head rules ( <CITED HERE> ) to identify their heads . Uses train
<CITED HERE> argues CV formations in Hindi and Urdu are either morphological or syntactical and their formation take place at the argument structure . Background train
feature Cohen 's k corrected k agreement 73.59 98.74 dial act 84.53 98.87 turn 73.52 99.16 Table 2 : Inter-coder agreement on feedback expression annotation Although researchers do not totally agree on how to measure agreement in various types of annotated data and on how to interpret the resulting figures , see <CITED HERE> , it is usually assumed that Cohen 's kappa figures over 60 are good while those over 75 are excellent ( Fleiss , 1971 ) . Background val
Collins 1996 , Charniak 1997 , Collins 1999 and <CITED HERE> ) . CompareOrContrast train
Two formalizations of lexical rules as used by HPSG linguists have been proposed , the meta-level lexical rules ( MLRs ; Calcagno 1995 ; <CITED HERE> ) and the . Background train
We run TreeTagger ( Schmid , 1994 ) for tokenization , and used the Giza + + ( <CITED HERE> ) to align the tokenized corpora at the word level . Uses train
Other milestones of recent research include the deployment of probabilistic and machine learning techniques ( Aone and Bennett 1995 ; Kehler 1997 ; Ge , Hale , and Charniak 1998 ; Cardie and Wagstaff 1999 ; the continuing interest in centering , used either in original or in revised form ( Abracos and Lopes 1994 ; <CITED HERE> ; Hahn and Strube 1997 ; Tetreault 1999 ) ; and proposals related to the evaluation methodology in anaphora resolution ( Mitkov 1998a , 2001b ) . Background train
We then go on to compare the current approach with that of some other theories with similar aims : the `` standard '' version of quasi-logical form implemented in the Core Language Engine , as rationally reconstructed by <CITED HERE> and Crouch and Pulman ( 1994 ) ; underspecified Discourse Representation Theory ( Reyle 1993 ) ; and the `` glue language '' approach of Dalrymple et al. ( 1996 ) . CompareOrContrast train
Note that this ensures that greater importance is attributed to longer chunks , as is usual in most EBMT systems ( cfXXX <CITED HERE> ; Veale and Way 1997 ; Carl 1999 ) .7 As an example , consider the translation into French of the house collapsed . Background train
Using WordNet , annotating the sem feature of an adjective involves first choosing the correct sense for the adjective 2Some descriptions of int modifiers can be found in ( <CITED HERE>b ) . Background val
One common approach is using Machine Translation ( MT ) to translate the queries to the language of the documents or translate documents to the language of the queries ( Gey et al , 1999 ; <CITED HERE> ) . CompareOrContrast train
Adding selectional restrictions ( semantic feature information , <CITED HERE> ) does not solve the problem , because isolated features offer only part of the background knowledge necessary for reference disambiguation . Background val
Software engineering research on Generative Programming ( <CITED HERE> ) attempts to solve these problems by focusing on the development of configurable elementary components and knowledge to combine these components into complete systems . Background train
Both use the evaluation software and triple encoding presented in <CITED HERE> . Uses val
In this Section , we will describe some example cases , which are drawn from the problem of using synchronous formalisms to define translations between languages ( e.g. <CITED HERE> cases ) . Background train
Some examples include text categorization ( Lewis and Catlett 1994 ) , base noun phrase chunking ( <CITED HERE> ) , part-of-speech tagging ( Engelson Dagan 1996 ) , spelling confusion set disambiguation ( Banko and Brill 2001 ) , and word sense disambiguation ( Fujii et al. 1998 ) . Background train
Most coreference resolution work simply mentions it in passing as a module in the pipelined system ( <CITED HERE> ; Durrett and Klein , 2013 ; Lee et al. , 2011 ; Bj ¨ orkelund and Kuhn , 2014 ) . Background train
Because the judges do not evaluate the same cases , we could not employ standard inter-annotator agreement measures ( <CITED HERE> ) . Uses train
Related are also the studies by Rieks op den Akker and Schulz ( 2008 ) and <CITED HERE> : both achieve promising results in the automatic segmentation of dialogue acts using the annotations in a large multimodal corpus . Background train
Due to advances in statistical syntactic parsing techniques ( Collins , 1997 ; <CITED HERE> ) , attention has recently shifted towards the harder question of analyzing the meaning of natural language sentences . Background train
The reader may consult recent papers on this subject ( e.g. <CITED HERE> ; Webber 1987 ) to see what a formal interpretation of events in time might look like . Background train
Because it is also consistent , it will be chosen as a best interpretation of S , ( cfXXX <CITED HERE>a , 1987b ) . Background val
For our experiments we used the standard division of the WSJ ( <CITED HERE> ) , with sections 2 through 21 for training ( approx . Uses train
With respect to this , we apply the different priming and other lexical decision experiments , described in literature ( <CITED HERE> ; Bentin , S. and Feldman , 1990 ) specifically for derivationally suffixed polymorphemic words and compound verbs of Bangla . Uses train
From this description , it should be clear that TM systems do not translate : Indeed , some researchers consider them to be little more than a search-and-replace engine , albeit a rather sophisticated one ( <CITED HERE> ) . Background train
Other milestones of recent research include the deployment of probabilistic and machine learning techniques ( Aone and Bennett 1995 ; <CITED HERE> ; Ge , Hale , and Charniak 1998 ; Cardie and Wagstaff 1999 ; the continuing interest in centering , used either in original or in revised form ( Abracos and Lopes 1994 ; Strube and Hahn 1996 ; Hahn and Strube 1997 ; Tetreault 1999 ) ; and proposals related to the evaluation methodology in anaphora resolution ( Mitkov 1998a , 2001b ) . Background val
Our group has developed a wide-coverage HPSG grammar for Japanese ( Mitsuishi et al. , 1998 ) , which is used in a high-accuracy Japanese dependency analyzer ( <CITED HERE> ) . Background train
There are several variations of such a method ( Ballesteros and Croft , 1998 ; <CITED HERE> ; Hull 1997 ) . CompareOrContrast val
This evaluation set-up is an improvement versus the one we previously reported ( <CITED HERE> ) , in which fixed partitions were used for training , development , and testing . Extends train
<CITED HERE> compared the performace of NEs versus BoW features . Background train
The right-side context of a non-terminal category -- the probability of generating a category to the right of the current constituent 's category -- corresponds directly to the category transitions used for the HMM supertagger of <CITED HERE> . CompareOrContrast train
The starting point for the approach followed here was a dissatisfaction with certain aspects of the theory of quasi-logical form as described in <CITED HERE> , 1992 ) , and implemented in SRI 's Core Language Engine ( CLE ) . CompareOrContrast train
However , the method we are currently using in the ATIS domain ( <CITED HERE> ) represents our most promising approach to this problem . Future train
Three UniRef tables UniRef100 , UniRef90 and UniRef50 ) are available for download : UniRef100 combines identical sequences and sub-fragments into a single UniRef entry ; and UniRef90 and UniRef50 are built by clustering UniRef100 sequences into clusters based on the CD-HIT algorithm <CITED HERE> such that each cluster is composed of sequences that have at least 90 % or 50 % sequence similarity , respectively , to the representative sequence . Uses train
<CITED HERE> , Charniak 1997 , Collins 1999 and Charniak 2000 ) . CompareOrContrast train
To name a few examples , Rohrbach et al. ( 2010 ) and Socher et al. ( 2013 ) show how semantic information from text can be used to improve zero-shot classification ( i.e. , classifying never-before-seen objects ) , and <CITED HERE> show that verb clusters can be used to improve activity recognition in videos . Background train
The computational treatment of lexical rules as covariation in lexical entries was implemented in Prolog by the authors in cooperation with Dieter Martini for the ConTroll system ( Gerdemann and <CITED HERE> ; Gotz and Meurers 1997a ) . Uses train
Arabic has two kinds of plurals : broken plurals and sound plurals ( <CITED HERE> ; Chen and Gey , 2002 ) . Background train
Although there are other discussions of the paragraph as a central element of discourse ( e.g. Chafe 1979 , Halliday and Hasan 1976 , <CITED HERE> , Haberlandt et al. 1980 ) , all of them share a certain limitation in their formal techniques for analyzing paragraph structure . CompareOrContrast train
To prove that our method is effective , we also make a comparison between the performances of our system and Xue and Palmer ( 2005 ) , <CITED HERE> . CompareOrContrast train
But while Bod 's estimator obtains state-of-the-art results on the WSJ , comparable to <CITED HERE> and Collins ( 2000 ) , Bonnema et al. 's estimator performs worse and is comparable to Collins ( 1996 ) . Background train
Our plan is to implement a windowed or moving-average version of BLEU as in ( <CITED HERE> ) . Future train
As <CITED HERE> rightly pointed out , however , `` Proper nouns and capitalized words are particularly problematic : some capitalized words are proper nouns and some are not . CompareOrContrast train
The recent great advances in speech and language technologies have made it possible to build fully implemented spoken dialogue systems ( Aust et al. , 1995 ; Allen et al. , 1996 ; Zue et al. , 2000 ; <CITED HERE> ) . Background train
For instance , part of the ACE Phase 2 also adopted a corpus-based approach to SC deterevaluation involves classifying an NP as PERSON , mination that is investigated as part of the mention ORGANIZATION , GPE ( a geographical-political redetection ( MD ) task ( e.g. , <CITED HERE> ) . CompareOrContrast train
In most cases , the accuracy of parsers degrades when run on out-of-domain data ( Gildea , 2001 ; McClosky et al. , 2006 ; <CITED HERE> ; Petrov et al. , 2010 ) . Background train
Based on this assumption , the problem of identifying mention heads is a sequential phrase identification problem , and we choose to employ the BILOU-representation as it has advantages over traditional BIO-representation , as shown , e.g. in <CITED HERE> . Motivation val
TF is given by TFD , t , and it denotes frequency of term t in document D. IDF is given by IDFt = log ( N/dft ) , where N is the number of documents in the collection , and dft is the number of documents containing the term t. ( <CITED HERE> ) proposed the combination of TF and IDF as weighting schemes , and it has been shown that their product gave better performance . Motivation train
An HPSG grammar consists of lexical entries and ID grammar rules , each of which is described with typed feature structures ( <CITED HERE> ) . Background train
More recently , Burke , Cahill , et al. ( 2004a ) carried out an evaluation of the automatic annotation algorithm against the publicly available PARC 700 Dependency Bank ( <CITED HERE> ) , a set of 700 randomly selected sentences from Section 23 which have been parsed , converted to dependency format , and manually corrected and extended by human validators . Background train
In our work , we gather sets of sentences , and assume ( but do not employ ) existing approaches for their organization ( <CITED HERE> ; Barzilay , Elhadad , and McKeown 2001 ; Barzilay and McKeown 2005 ) . Background train
For example , such schema can serve as a mean to represent translation examples , or find structural correspondences for the purpose of transfer grammar learning ( Menezes & Richardson , 2001 ) , ( Aramaki et al. , 2001 ) , ( Watanabe et al. , 2000 ) , ( Meyers et al. , 2000 ) , ( <CITED HERE> ) , ( kaji et al. , 1992 ) , and example-base machine translation EBMT3 ( Sato & Nagao , 1990 ) , ( Sato , 1991 ) , ( Richardson et al. , 2001 ) , ( Al-Adhaileh & Tang , 1999 ) . Background train
This was done because purely unsupervised techniques ( e.g. , Baum-Welch [ <CITED HERE> ] or Brill 's [ Brill 1995b ] ) enable regularities to be induced for word classes which contain many entries , exploiting the fact that individual words that belong to a POS class occur in different ambiguity patterns . CompareOrContrast train
Two formalizations of lexical rules as used by HPSG linguists have been proposed , the meta-level lexical rules ( MLRs ; <CITED HERE> ; Calcagno and Pollard 1995 ) and the . Background train
In terms of treebank data , the primary training corpus is the Penn Wall Street Journal Treebank ( PTB ) ( <CITED HERE> ) . Uses train
The extraction of each PICO element relies to a different extent on an annotated corpus of MEDLINE abstracts , created through an effort led by the first author at the National Library of Medicine ( <CITED HERE> ) . Uses train
Various feature selection techniques have been used in automatic text categorization ; they include document frequency ( DF ) , information gain ( IG ) ( Tzeras and Hartman , 1993 ) , minimum description length principal ( <CITED HERE> ) , and the X2 statistic . Background val
Usually , the classes are from WordNet ( Miller et al. , 1990 ) , although they can also be inferred from clustering ( <CITED HERE> ) . Background train
The account sketched in Section 4 was superimposed on an incremental GRE algorithm , partly because incrementality is well established in this area ( Appelt 1985 ; <CITED HERE> ) . Background train
In most cases , the accuracy of parsers degrades when run on out-of-domain data ( Gildea , 2001 ; <CITED HERE> ; Blitzer et al. , 2006 ; Petrov et al. , 2010 ) . Background train
Now for some important remarks on efficiency : • Computing ti is an instance of the well-known algebraic path problem ( <CITED HERE> ; Tar an , 1981a ) . Background train
<CITED HERE> take an entirely different approach by showing that one can successfully infer held out feature norms from weighted mixtures based on textual similarity . Background train
Over the last decade there has been a lot of interest in developing tutorial dialogue systems that understand student explanations ( Jordan et al. , 2006 ; Graesser et al. , 1999 ; Aleven et al. , 2001 ; Buckley and Wolska , 2007 ; Nielsen et al. , 2008 ; VanLehn et al. , 2007 ) , because high percentages of selfexplanation and student contentful talk are known to be correlated with better learning in humanhuman tutoring ( Chi et al. , 1994 ; Litman et al. , 2009 ; <CITED HERE> ; Steinhauser et al. , 2007 ) . Background train
Previous work in sentence planning in the natural language generation ( NLG ) community uses hand-written rules to approximate the distribution of linguistic phenomena in a corpus ( see ( <CITED HERE> ) for a recent example with further references ) . Background train
We assume that every determiner has its own equivalence , which resolves it as a quantifier : sometimes this can be quite a complicated matter , as with any ( <CITED HERE> ) , which will resolve in different ways depending on its linguistic context , but here we avoid this complexity ' 6 Separate equivalences might also make it easier to encode determiner-specific preferences , such as that of each for wide scope . Background train
Following <CITED HERE> , such expressions will be called vague descriptions even though , as we shall see , the vagueness of the adjective does not extend to the description as a whole . Uses train
Such approaches have been tried recently in restricted cases ( McCallum et al. , 2000 ; Eisner , 2001b ; <CITED HERE> ) . Background train
The standard way to handle this problem is to handcraft a finite set of features which provides a sufficient summary of the unbounded history ( Ratnaparkhi , 1999 ; <CITED HERE> ; Charniak , 2000 ) . CompareOrContrast train
Following <CITED HERE> , we measure association norm prediction as an average of percentile ranks . Uses val
There is a general consensus among theoretical linguists that the proper representation of verbal argument structure is event structure -- representations grounded in a theory of events that decompose semantic roles in terms of primitive predicates representing concepts such as causality and inchoativity ( Dowty , 1979 ; Jackendoff , 1983 ; <CITED HERE>b ; Rappaport Hovav and Levin , 1998 ) . Background train
Other molecular biology databases We also included several model organism databases or nomenclature databases in the construction of the dictionary , i.e. , mouse Mouse Genome Database ( MGD ) [ 18 ] , fly FlyBase [ 19 ] , yeast Saccharomyces Genome Database ( SGD ) [ 20 ] , rat -- Rat Genome Database ( RGD ) <CITED HERE> , worm -- WormBase [ 22 ] , Human Nomenclature Database ( HUGO ) [ 23 ] , Online Mendelian Inheritance in Man ( OMIM ) [ 24 ] , and Enzyme Nomenclature Database ( ECNUM ) [ 25 , 26 ] . Uses train
<CITED HERE> re-trained the linguistic parsers bilingually based on word alignment . CompareOrContrast train
de URL : http://www.sfs.nphil.uni-tuebingen.de/sfb / b4home.html 1 This is , for example , the case for all proposals working with verbal lexical entries that raise the arguments of a verbal complement ( <CITED HERE> ) that also use lexical rules such as the Complement Extraction Lexical Rule ( Pollard and Sag 1994 ) or the Complement Cliticization Lexical Rule ( Miller and Sag 1993 ) to operate on those raised elements . Background train
Much of theoretical linguistics can be formulated in a very natural manner as stating correspondences ( translations ) between layers of representation structures ( <CITED HERE> ) , such as the relation between syntax and semantic . Background val
From an IR view , a lot of specialized research has already been carried out for medical applications , with emphasis on the lexico-semantic aspects of dederivation and decomposition ( Pacak et al. , 1980 ; Norton and Pacak , 1983 ; Wolff , 1984 ; <CITED HERE> ; Dujols et al. , 1991 ; Baud et al. , 1998 ) . Background train
The following are the various tag sets we use in this article : ( a ) the core POS tag sets CORE44 and the newly introduced CORE12 ; ( b ) CATiB Treebank tag set ( CATIB6 ) ( Habash and Roth 2009 ) and its newly introduced extension of CATIBEX created using simple regular expressions on word form , indicating particular morphemes such as the prefix JI Al + or the suffix v ' + wn ; this tag set is the best-performing tag set for Arabic on predicted values as reported in Section 4 ; ( c ) the PATB full tag set with complete morphological tag ( BW ) ( <CITED HERE> ) ; and two extensions of the PATB reduced tag set ( PENN POS , a.k.a. RTS , size 24 [ Diab , Hacioglu , and Jurafsky 2004 ] ) , both outperforming it : ( d ) Kulick , Gabbard , and Marcus ( 2006 ) 's tag set ( KULICK ) , size 43 , one of whose most important extensions is the marking of the definite article clitic , and ( e ) Diab and Benajiba 's ( in preparation ) EXTENDED RTS tag set ( ERTS ) , which marks gender , number , and definiteness , size 134 . Uses train
A similar problem is discussed in the psycholinguistics of interpretation ( <CITED HERE> ) : Interpretation is widely assumed to proceed incrementally , but vague descriptions resist strict incrementality , since an adjective in a vague description can only be fully interpreted when its comparison set is known . CompareOrContrast train
In this paper , we evaluated the role of low-level image features , SURF and GIST , for their compatibility with the multimodal Latent Dirichlet Allocation model of <CITED HERE> . Uses train
Figure 2 illustrates a DSyntS from a meteorological application , MeteoCogent ( Kittredge and Lavoie , 1998 ) , represented using the standard graphical notation and also the RealPro ASCII notation used internally in the framework ( <CITED HERE> ) . Extends train
And <CITED HERE> argues for `` keeping track of counts of arbitrary fragments within parse trees '' , which has indeed been carried out in Collins and Duffy ( 2002 ) who use exactly the same set of ( all ) tree fragments as proposed in Bod ( 1992 ) . Background train
<CITED HERE> use a tagged parallel corpus to extract translationally equivalent English-Greek clauses on the basis of word occurrence and co-occurrence probabilities . Background train
The key linguistic knowledge sources that we use are morphological analysis and generation of German based on SMOR , a morphological analyzer/generator of German ( Schmid et al. , 2004 ) and the BitPar parser , which is a state-of-the-art parser of German ( <CITED HERE> ) . Uses train
A common computational treatment of lexical rules adopted , for example , in the ALE system ( <CITED HERE> ) consists of computing the transitive closure of the base lexical entries under lexical rule application at compile-time . CompareOrContrast train
Manually defined heuristics are used to automatically annotate each tree in the treebank with partially specified HPSG derivation trees : Head/argument/modifier distinctions are made for each node in the tree based on <CITED HERE> and Collins ( 1997 ) ; Uses train
Authors may choose this right with the No-Deriv option of the Creative Commons licences ( <CITED HERE> ) . Background train
Mathematical analysis can assess a measure with respect to some formal properties , e.g. whether a measure is a metric ( <CITED HERE> ) .4 However , mathematical analysis can not tell us whether a measure closely resembles human judgments or whether it performs best when used in a certain application . Background train
The same annotation scheme as in our previous work on anger detection has been applied , see e.g. ( <CITED HERE> ) . Extends train
We also compute GIST vectors ( <CITED HERE> ) for every image using LearGIST ( Douze et al. , 2009 ) . Uses train
In modern syntactic theories ( e.g. , lexical-functional grammar [ LFG ] [ Kaplan and Bresnan 1982 ; Bresnan 2001 ; Dalrymple 2001 ] , head-driven phrase structure grammar [ HPSG ] [ Pollard and Sag 1994 ] , tree-adjoining grammar [ TAG ] [ Joshi 1988 ] , and combinatory categorial grammar [ CCG ] [ <CITED HERE> ] ) , the lexicon is the central repository for much morphological , syntactic , and semantic information . Background train
The analysis of the data we have collected indicates that student satisfaction may be affected if the system rephrases student answers using different words ( for example , using better terminology ) but does n't explicitly explain the reason why different terminology is needed ( <CITED HERE> ) . Future val
A logic that provides the formal architecture required by Pollard and Sag ( 1994 ) was defined by <CITED HERE> , 1994 ) . Background val
fields generally follow the pattern of `` introduction '' , `` methods '' , `` results '' , and `` conclusions '' ( SalangerMeyer , 1990 ; <CITED HERE> ; Or˘asan , 2001 ) . Background val
<CITED HERE> report that an optimal tag set for parsing Czech consists of a basic POS tag plus a CASE feature ( when applicable ) . Background train
The elimination of redundant nondeterminism is based on Unfold/Fold transformation techniques ( Tamaki and Sato 1984 ) .29 The unfolding transformation is also referred to as partial execution , for example , by <CITED HERE> . Background train
Over the last decade there has been a lot of interest in developing tutorial dialogue systems that understand student explanations ( Jordan et al. , 2006 ; Graesser et al. , 1999 ; Aleven et al. , 2001 ; Buckley and Wolska , 2007 ; <CITED HERE> ; VanLehn et al. , 2007 ) , because high percentages of selfexplanation and student contentful talk are known to be correlated with better learning in humanhuman tutoring ( Chi et al. , 1994 ; Litman et al. , 2009 ; Purandare and Litman , 2008 ; Steinhauser et al. , 2007 ) . Background train
9 We do not relate to specific results in their study because it has been brought to our attention that <CITED HERE> are in the process of rechecking their code for errors , and rerunning their experiments ( personal communication ) . CompareOrContrast val
Inspired by ( <CITED HERE> ) , we split one phrase type into several subsymbols , which contain category information of current constituent 's parent . Motivation train
The recent great advances in speech and language technologies have made it possible to build fully implemented spoken dialogue systems ( Aust et al. , 1995 ; Allen et al. , 1996 ; <CITED HERE> ; Walker et al. , 2000 ) . Background train
Moreover , in order to determine whether the performances of the predictive criteria are consistent across different learning models within the same domain , we have performed the study on two parsing models : one based on a context-free variant of tree-adjoining grammars ( Joshi , Levy , and Takahashi 1975 ) , the Probabilistic Lexicalized Tree Insertion Grammar ( PLTIG ) formalism ( Schabes and Waters 1993 ; <CITED HERE> ) , and Collins 's Model 2 parser ( 1997 ) . Uses train
Most of the early work on automatic f-structure annotation ( e.g. , van Genabith , Way , and Sadler 1999 ; <CITED HERE> ; Sadler , van Genabith , and Way 2000 ) was applied only to small data sets ( fewer than 200 sentences ) and was largely proof of concept . Background train
We prepare the training data by splitting compounds in two steps , following the technique of <CITED HERE> . Uses train
Recent work by <CITED HERE> on the filtering phase of this approach uses linguistic verb classes ( based on Levin [ 1993 ] ) for obtaining more accurate back-off estimates for hypothesis selection . Background train
Numerous previous pseudodisambiguation evaluations only include arguments that occur between 30 and 3000 times ( Erk , 2007 ; Keller and Lapata , 2003 ; <CITED HERE> ) . CompareOrContrast train
Some efforts have tackled tasks such as automatic image caption generation ( Feng and Lapata , 2010a ; Ordonez et al. , 2011 ) , text illustration ( Joshi et al. , 2006 ) , or automatic location identification of Twitter users ( Eisenstein et al. , 2010 ; Wing and Baldridge , 2011 ; <CITED HERE> ) . Background train
Default parameters were used , although experimentation with different parameter settings is an important direction for future work ( <CITED HERE> ; Munson et al. , 2005 ) . Future train
The language chosen for semantic representation is a flat semantics along the line of ( Bos , 1995 ; <CITED HERE> ; Copestake et al. , 2001 ) . CompareOrContrast train
For example , <CITED HERE> and Jokinen et al. ( 2008 ) find that machine learning algorithms can be trained to recognise some of the functions of head movements , while Reidsma et al. ( 2009 ) show that there is a dependence between focus of attention and assignment of dialogue act labels . Background train
As a logical postulate it is not very radical ; it is possible within a finitary framework to develop that part of mathematics that is used or has potential applications in natural science , such as mathematical analysis ( cfXXX <CITED HERE> ) . Background train
In a similar vain to Skut and Brants ( 1998 ) and <CITED HERE> , the method extends an existing flat shallow-parsing method to handle composite structures . Future val
An interesting aspect of our generative approach is that we model HMM outputs as Gaussian vectors ( log probabilities of observing entire sentences based on our language models ) , as opposed to sequences of terms , as done in ( <CITED HERE> ) . CompareOrContrast train
• language learning ( Green 1979 ; Mori and Moeser 1983 ; Morgan , Meier , and Newport 1989 ) • monolingual grammar induction ( Juola 1998 ) • grammar optimization ( Juola 1994 ) • insights into universal grammar ( Juola 1998 ) • machine translation ( Juola 1994 , 1997 ; <CITED HERE> ; Gough , Way , and Hearne 2002 ) Background train
In Charniak ( 1996 ) and <CITED HERE> , it was observed that treebank grammars ( CFGs extracted from treebanks ) are very large and grow with the size of the treebank . Background val
We apply our system to the latest version of the XTAG English grammar ( The XTAG Research <CITED HERE> ) , which is a large-scale FB-LTAG grammar . Uses train
Furthermore , the need to answer questions related to patient care at the point of service has been well studied and documented ( Covell , Uman , and Manning 1985 ; Gorman , Ash , and Wykoff 1994 ; <CITED HERE> , 2005 ) . Background train
Typical examples are Bulgarian ( Simov et al. , 2005 ; Simov and Osenova , 2003 ) , Chinese ( Chen et al. , 2003 ) , Danish ( Kromann , 2003 ) , and Swedish ( <CITED HERE> ) . Background train
Over the last decade there has been a lot of interest in developing tutorial dialogue systems that understand student explanations ( Jordan et al. , 2006 ; Graesser et al. , 1999 ; Aleven et al. , 2001 ; Buckley and Wolska , 2007 ; Nielsen et al. , 2008 ; VanLehn et al. , 2007 ) , because high percentages of selfexplanation and student contentful talk are known to be correlated with better learning in humanhuman tutoring ( Chi et al. , 1994 ; <CITED HERE> ; Purandare and Litman , 2008 ; Steinhauser et al. , 2007 ) . Background train
Worst case , calculating the set corresponding with such a property , of the form size ( x ) = maxm , for example , involves sorting the distractors as to their size , which may amount to O ( n2d ) or O ( nd log nd ) calculations ( depending on the sorting algorithm : cfXXX [ <CITED HERE> ] Chapter 8 ) . Background train
• language learning ( <CITED HERE> ; Mori and Moeser 1983 ; Morgan , Meier , and Newport 1989 ) • monolingual grammar induction ( Juola 1998 ) • grammar optimization ( Juola 1994 ) • insights into universal grammar ( Juola 1998 ) • machine translation ( Juola 1994 , 1997 ; Veale and Way 1997 ; Gough , Way , and Hearne 2002 ) Background train
Virpioja et al. ( 2007 ) , Badr et al. ( 2008 ) , Luong et al. ( 2010 ) , <CITED HERE> , and others are primarily concerned with using morpheme segmentation in SMT , which is a useful approach for dealing with issues of word-formation . CompareOrContrast train
While corpus driven efforts along the PARSEVAL lines ( <CITED HERE> ) are good at giving some measure of a grammar coverage , they are not suitable for finer grained analysis and in particular , for progress evaluation , regression testing and comparative report generation . Background val
Before using the DCA method , we applied a Russian morphological processor ( <CITED HERE> ) to convert each word in the text to its main form : nominative case singular for nouns and adjectives , infinitive for verbs , etc. . Uses train
The result holds for both the MaltParser ( Nivre 2008 ) and the Easy-First Parser ( <CITED HERE> ) . Uses train
Also relevant is work on the general problems of dialog-act tagging ( Stolcke et al. , 2000 ) , citation analysis ( <CITED HERE> ) , and computational rhetorical analysis ( Marcu , 2000 ; Teufel and Moens , 2002 ) . Background train
The language chosen for semantic representation is a flat semantics along the line of ( <CITED HERE> ; Copestake et al. , 1999 ; Copestake et al. , 2001 ) . CompareOrContrast val
In simple terms , P2P is a technology that takes advantage of the resources and services available at the edge of the Internet ( <CITED HERE> ) . Background train
A common way to combine different models consists of selecting the model that is most confident regarding its decision ( <CITED HERE> ) . CompareOrContrast train
Common sense ( as well as the Gricean maxims ; <CITED HERE> ) suggests that vague descriptions are preferred by speakers over quantitative ones whenever the additional information provided by a quantitative description is irrelevant to the purpose of the communication . Background train
A similar method is included in PATR-II ( Shieber et al. 1983 ) and can be used to encode lexical rules as binary relations in the CUF system ( Dorre and Eisele 1991 ; Done and Dorna 1993b ) or the TFS system ( <CITED HERE> ; Emele 1994 ) . CompareOrContrast train
For automatically extracting these surface level mappings we will draw on the approach to learning paraphrases from a corpus that is described in <CITED HERE> . Future val
This description can then be given the standard set-theoretical interpretation of <CITED HERE> , 1994 ) . ' Background train
<CITED HERE> helped pave the path for cognitive-linguistic multimodal research , showing that Latent Dirichlet Allocation outperformed Latent Semantic Analysis ( Deerwester et al. , 1990 ) in the prediction of association norms . Background train
A more flexible approach is used by <CITED HERE> , where users can specify boundary values for attributes like rainfall , specifying , for example , rain counts as moderate above 7 mm/h , as heavy above 20 mm/h , and so on . Background train
Collins and Duffy ( 2002 ) define a kernel over parse trees and apply it to re-ranking the output of a parser , but the resulting feature space is restricted by the need to compute the kernel efficiently , and the results are not as good as Collins ' previous work on re-ranking using a finite set of features ( <CITED HERE> ) . Background train
The shallow parser used is the SNoW-based CSCL parser ( <CITED HERE> ; Munoz et al. , 1999 ) . Uses train
In practice , perceptron-type algorithms are often applied in a batch learning scenario , i.e. , the algorithm is applied for K epochs to a training sample of size T and then used for prediction on an unseen test set ( Freund and Schapire , 1999 ; <CITED HERE> ) . CompareOrContrast train
Table 2 shows the results on identifying all phrases -- chunking in CoNLL2000 ( Tjong Kim <CITED HERE> ) terminology . Uses train
Unfortunately , as shown in ( <CITED HERE> ) , with the represetation of sentences that we use , linear classifiers can not discriminate real sentences from sentences sampled from a trigram , which is the model we use as a baseline , so here we resort to a non-linear large-margin classifier ( see section 3 for details ) . Motivation train
Perhaps some variation of multi-level bulleted lists , appropriately integrated with interface elements for expanding and hiding items , might provide physicians a better overview of the information landscape ; see , for example , <CITED HERE> . Background train
<CITED HERE> have shown , in the context of base noun identification , that combining sample selection and cotraining can be an effective learning framework for large-scale training . Background train
• The regular TBL , as described in section 2 ; • An improved version of TBL , which makes extensive use of indexes to speed up the rules ' update ; • The FastTBL algorithm ; • The ICA algorithm ( <CITED HERE> ) . CompareOrContrast train
For example , Jokinen and Ragni ( 2007 ) and <CITED HERE> find that machine learning algorithms can be trained to recognise some of the functions of head movements , while Reidsma et al. ( 2009 ) show that there is a dependence between focus of attention and assignment of dialogue act labels . Background train
This approach resembles the work by <CITED HERE> and Hirschman et al. ( 1975 ) on selectional restrictions . CompareOrContrast train
Unless it is desired to intentionally filter these out as being outside of the new domain , one can insert some arbitrarily small probability for these arcs , using , for example , an N-gram back-off model ( <CITED HERE> ) . Background val
feature Cohen 's k corrected k agreement 73.59 98.74 dial act 84.53 98.87 turn 73.52 99.16 Table 2 : Inter-coder agreement on feedback expression annotation Although researchers do not totally agree on how to measure agreement in various types of annotated data and on how to interpret the resulting figures , see Artstein and Poesio ( 2008 ) , it is usually assumed that Cohen 's kappa figures over 60 are good while those over 75 are excellent ( <CITED HERE> ) . Background train
To name a few examples , Rohrbach et al. ( 2010 ) and <CITED HERE> show how semantic information from text can be used to improve zero-shot classification ( i.e. , classifying never-before-seen objects ) , and Motwani and Mooney ( 2012 ) show that verb clusters can be used to improve activity recognition in videos . Background train
Empirical evidence has been brought forward that inflectional and/or derivational stemmers augmented by dictionaries indeed perform substantially better than those without access to such lexical repositories ( Krovetz , 1993 ; <CITED HERE> ; Tzoukermann et al. , 1997 ) . Background train
The XTAG group ( <CITED HERE> ) at the University of Pennsylvania is also developing Korean , Chinese , and Hindi grammars . Background train
HOLMES is given the following set of six domainindependent rules , which are similar to the upward monotone rules introduced by ( <CITED HERE> ) . CompareOrContrast train
The lexicon is used to mediate and map between a language-independent domain model and a language-dependent ontology widely used in NLG , the Upper Model ( <CITED HERE> ) . Background train
ImageNet is a large-scale and widely used image database , built on top of WordNet , which maps words into groups of images , called synsets ( <CITED HERE> ) . Uses train
based parsing algorithms with an arc-factored parameterization ( <CITED HERE> ) . Uses train
Two applications that , like help-desk , deal with question -- answer pairs are : summarization of e-mail threads ( Dalli , Xia , and Wilks 2004 ; Shrestha and McKeown 2004 ) , and answer extraction in FAQs ( Frequently Asked Questions ) ( <CITED HERE> ; CompareOrContrast val
The retrieval process relies on the vector space model ( <CITED HERE> ) , with the cosine measure expressing the similarity between a query and a document . Uses val
For right-branching structures , the leftcorner ancestor is the parent , conditioning on which has been found to be beneficial ( <CITED HERE> ) , as has conditioning on the left-corner child ( Roark and Johnson , 1999 ) . Background train
A few others incorporate various measures of inter-document similarity between the texts to be labeled ( Agarwal and Bhattacharyya , 2005 ; <CITED HERE> ; Goldberg and Zhu , 2006 ) . Background train
A few others incorporate various measures of inter-document similarity between the texts to be labeled ( <CITED HERE> ; Pang and Lee , 2005 ; Goldberg and Zhu , 2006 ) . Background train
This is similar to `` one sense per collocation '' idea of <CITED HERE> . CompareOrContrast train
Against the background of a growing interest in multilingual NLP , multilingual anaphora / coreference resolution has gained considerable momentum in recent years ( Aone and McKee 1993 ; Azzam , Humphreys , and Gaizauskas 1998 ; Harabagiu and Maiorano 2000 ; Mitkov and Barbu 2000 ; <CITED HERE> ; Mitkov and Stys 1997 ; Mitkov , Belguith , and Stys 1998 ) . Background train
This system has been successfully tested with the development of plug-ins supporting instant messaging , distributed video encoding ( Hughes and Walkerdine , 2005 ) , distributed virtual worlds ( Hughes et al. , 2005 ) and digital library management ( <CITED HERE> ) . Background train
be found in figure 2 , which is similar with that in <CITED HERE> . CompareOrContrast train
It is inspired by the system described in <CITED HERE> . Motivation train
• A user study was performed , but it was either very small compared to the corpus ( Carmel , Shtalhaim , and Soffer 2000 ; Jijkoun and de Rijke 2005 ) , or the corpus itself was significantly smaller than ours ( Feng et al. 2006 ; <CITED HERE> ) . CompareOrContrast train
Although a number of methods for query-dependent text summarization are beginning to be developed and evaluated in a variety of realistic settings ( <CITED HERE> ) , we again propose the use of vector space methods from IR , which can be easily extended to the summarization task ( Salton et al. , 1994 ) : CompareOrContrast train
Riehemann 1993 ; <CITED HERE> ; Frank 1994 ; Opalka 1995 ; Sanfilippo 1995 ) . CompareOrContrast train
When we run our classifiers on resource-tight environments such as cell-phones , we can use a random feature mixing technique ( <CITED HERE> ) or a memory-efficient trie implementation based on a succinct data structure ( Jacobson , 1989 ; Delpratt et al. , 2006 ) to reduce required memory usage . Future train
These translations gave rise to a number of automatically constructed linguistic resources : ( 1 ) the original ( source , target ) phrasal translation pairs , ( 2 ) the marker lexicon , ( 3 ) the gen11 Thanks are due to one of the anonymous reviewers for pointing out that our wEBMT system , seeded with input from multiple translation systems , with a postvalidation process via the Web ( amounting to an n-gram target language model ) , in effect forms a multiengine MT system as described by Frederking and Nirenburg ( 1994 ) , <CITED HERE> , and Hogan and Frederking ( 1998 ) . CompareOrContrast train
A variety of statistical methods were proposed over the recent years for learning to produce a full parse of free-text sentences ( e.g. , Bod ( 1992 ) , <CITED HERE> , Collins ( 1997 ) , Ratnaparkhi ( 1997 ) , and Sekine ( 1998 ) ) . Background train
Discriminative approaches ( especially SVMs ) have been shown to be very effective for many supervised classification tasks ; see , for example , ( <CITED HERE> ; Ng and Jordan , 2001 ) . Background train
This is the approach taken by IBM Models 4 + ( Brown et al. 1993b ; <CITED HERE> ) , and more recently by the LEAF model ( Fraser and Marcu 2007 ) . CompareOrContrast train
A few others incorporate various measures of inter-document similarity between the texts to be labeled ( Agarwal and Bhattacharyya , 2005 ; Pang and Lee , 2005 ; <CITED HERE> ) . Background val
11 <CITED HERE> proposes to unify these two steps by including an update operator in the Background train
For instance , when building translation units in EBMT approaches ( Richardson et al. , 2001 ) , ( Aramaki , 2001 ) , ( AlAdhaileh & Tang , 1999 ) , ( Sato & Nagao , 1990 ) , ( Sato , 1991 ) , ( <CITED HERE> ) , etc. , where S-SSTC can be used to represent the entries of the BKB or when S-SSTC used as an annotation schema to find the translation correspondences ( lexical and structural correspondences ) for transferrules ' extraction from parallel parsed corpus ( Menezes & Richardson , 2001 ) , ( Watanabe et al. , Background train
<CITED HERE> introduce a new method of multimodal integration based on Canonical Correlation Analysis , and performs a systematic comparison between their CCA-based model and others on association norm prediction , held out feature prediction , and word similarity . Background train
More details on how the structural divergences described in ( <CITED HERE> ) can be accounted for using our formalism can be found in ( Nasr et al. , 1998 ) . Background train
At present , the system takes into consideration the number of incorrect answers received in response to the current question and the number of uninterpretable answers .1 In addition to a remediation policy , the tutorial planner implements an error recovery policy ( <CITED HERE> ) . Uses train
More recently , <CITED HERE> show that visual attribute classifiers , which have been immensely successful in object recognition ( Farhadi et al. , 2009 ) , act as excellent substitutes for feature Background train
We found the same number using our previous approach ( <CITED HERE> ) , which is roughly equivalent to our core module . Extends train
Similar things hold for multifaceted properties like intelligence ( <CITED HERE> ) . Background train
We have shown elsewhere ( <CITED HERE> ; Zadrozny 1987a , 1987b ) that natural language programs , such as on-line grammars and dictionaries , can be used as referential levels for commonsense reasoning -- for example , to disambiguate PP attachment . Extends val
( Och and Ney , 2002 ; Blunsom et al. , 2008 ) used maximum likelihood estimation to learn weights for MT. ( Och , 2003 ; Moore and Quirk , 2008 ; <CITED HERE> ; Galley and Quirk , 2011 ) employed an evaluation metric as a loss function and directly optimized it . CompareOrContrast train
There has been some controversy , at least for simple stemmers ( <CITED HERE> ; Porter , 1980 ) , about the effectiveness of morphological analysis for document retrieval ( Harman , 1991 ; Krovetz , 1993 ; Hull , 1996 ) . Background train
<CITED HERE> reported an intra-subject correlation of r = .85 for 15 subjects judging the similarity of a subset ( 36 ) of the original 65 word pairs . CompareOrContrast train
Withindocument coreference resolution has been applied to produce summaries of text surrounding occurrences of the name ( <CITED HERE> ; Gooi and Allan , 2004 ) . Background train
More recently , <CITED HERE> have proposed the exploitation of TMs at a subsentential level , while Carl , Way , and Sch ¨ aler ( 2002 ) and Sch ¨ aler , Way , and Carl ( 2003 , pages 108 -- 109 ) describe how phrasal lexicons might come to occupy a central place in a future hybrid integrated translation environment . Background train
With a minimal set of features and a small number of lexical entries , Niyogi ( 2001 ) has successfully modeled many of the argument alternations described by Levin ( 1993 ) using a <CITED HERE> style analysis . Background train
For some adjectives , including the ones that <CITED HERE> called evaluative ( as opposed to dimensional ) , this is clearly inadequate . Background train
WIT has been implemented in Common Lisp and C on UNIX , and we have built several experimental and demonstration dialogue systems using it , including a meeting room reservation system ( <CITED HERE>b ) , a video-recording programming system , a schedule management system ( Nakano et al. , 1999a ) , and a weather infomiation system ( Dohsaka et al. , 2000 ) . Extends train
The more conservative approach is to try to integrate existing statistical disambiguation schemes for QLFs , either individually or in a `` packed '' structure ( <CITED HERE> ) , with the resolution process as described here . Future val
They proved to be useful in a number of NLP applications such as natural language generation ( Iordanskaja et al. , 1991 ) , multidocument summarization ( <CITED HERE> ) , automatic evaluation of MT ( Denkowski and Lavie , 2010 ) , and TE ( Dinu and Wang , 2009 ) . Motivation train
While many approaches have addressed this problem , our work is most closely related to that of ( Raina et al. , 2005 ; <CITED HERE> ; Tatu and Moldovan , 2006 ; Braz et al. , 2005 ) , which convert the inputs into logical forms and then attempt to ` prove ' H from T plus a set of axioms . CompareOrContrast train
While this is simply irrelevant for general-purpose morphological analyzers , dealing with such phenomena is crucial for any attempt to cope adequately with medical free-texts in an IR setting ( <CITED HERE> ) . Background train
The EDR has close ties to the named entity recognition ( NER ) and coreference resolution tasks , which have been the focus of several recent investigations ( Bikel et al. , 1997 ; Miller et al. , 1998 ; Borthwick , 1999 ; Mikheev et al. , 1999 ; Soon et al. , 2001 ; Ng and Cardie , 2002 ; <CITED HERE> ) , and have been at the center of evaluations such as : MUC-6 , MUC-7 , and the CoNLL '02 and CoNLL '03 shared tasks . Background train
In English , where the base form is morphologically simpler than the other two , this rule could be argued to follow from Gricean principles ( <CITED HERE> ) . Background val
More generally , distributional clustering techniques ( Sch ¨ utze , 1992 ; <CITED HERE> ) could be applied to extract semantic classes from the corpus itself . Future train
The use of the web as a corpus for teaching and research on language has been proposed a number of times ( Kilgarriff , 2001 ; Robb , 2003 ; <CITED HERE> ; Fletcher , 2001 , 2004b ) and received a special issue of the journal Computational Linguistics ( Kilgarriff and Grefenstette , 2003 ) . Background train
<CITED HERE> replicates the work of Fung and McKeown with different language pairs using the simpler metric of Levenshtein distance . Background val
• Only an automatic evaluation was performed , which relied on having model responses ( <CITED HERE> ; Berger et al. 2000 ) . CompareOrContrast train
Typed feature grammars can be used as the basis for implementations of Head-driven Phrase Structure Grammar ( HPSG ; <CITED HERE> ) as discussed in ( Gotz and Meurers , 1997a ) and ( Meurers and Minnen , 1997 ) . Background train
In <CITED HERE>a ) we identified several systems that resemble ours in that they provide answers to queries . Background train
The problem of handling ill-formed input has been studied by Carbonell and Hayes ( 1983 ) , Granger ( 1983 ) , <CITED HERE> , Kwasny and Sondheimer ( 1981 ) , Riesbeck and Schank ( 1976 ) , Thompson ( 1980 ) , Weischedel and Black ( 1980 ) , and Weischedel and Sondheimer ( 1983 ) . CompareOrContrast val
A variety of such lists for many languages are already available ( e.g. , <CITED HERE> ) . Background train
In principle , this might be done by providing the generator with vague input -- in which case no special algorithms are needed -- but suitably contextualized vague input is often not available ( <CITED HERE> ) . Motivation train
Liu et al. ( 2005 ) , Meral et al. ( 2007 ) , Murphy ( 2001 ) , Murphy and Vogel ( 2007 ) and <CITED HERE>a ) all belong to the syntactic transformation category . Background train
We induced a two-class word-to-word model of translational equivalence from 13 million words of the Canadian Hansards , aligned using the method in ( <CITED HERE> ) . Uses train
<CITED HERE> observes that accomplishments differ from achievements only in terms of event duration , which is often a question of granularity . Background train
[ The current system should be distinguished from an earlier voice system ( VNLC , <CITED HERE> ) , which had no expectation and which handled discrete speech where a 300 millisecond pause must follow each word . ] CompareOrContrast train
Thus , over the past few years , along with advances in the use of learning and statistical methods for acquisition of full parsers ( Collins , 1997 ; Charniak , 1997a ; Charniak , 1997b ; Ratnaparkhi , 1997 ) , significant progress has been made on the use of statistical learning methods to recognize shallow parsing patterns syntactic phrases or words that participate in a syntactic relationship ( Church , 1988 ; <CITED HERE> ; Argamon et al. , 1998 ; Cardie and Pierce , 1998 ; Munoz et al. , 1999 ; Punyakanok and Roth , 2001 ; Buchholz et al. , 1999 ; Tjong Kim Sang and Buchholz , 2000 ) . Background train
He was a grammarian who analysed Sanskrit ( <CITED HERE> ) . Background train
After much exploration , <CITED HERE> discovered that it was not practical to annotate PICO entities at the phrase level due to significant unresolvable disagreement and interannotator reliability issues . Background train
The necessity of this kind of merging of arguments has been recognized before : <CITED HERE> call it abductive unification/matching , Hobbs ( 1978 , 1979 ) refers to such operations using the terms knitting or petty conversational implicature . Background train
The OntoNotes-5 .0 dataset , which is released for the CoNLL-2012 Shared Task ( <CITED HERE> ) , contains 3,145 annotated documents . Uses train
and <CITED HERE> , as described below . CompareOrContrast train
We collect substring rationales for a sentiment classification task ( <CITED HERE> ) and use them to obtain significant accuracy improvements for each annotator . Uses train
We apply two different priming experiments namely , the cross modal priming and masked priming experiment discussed in ( <CITED HERE> ; Rastle et al. , 2000 ; Marslen-Wilson et al. , 1994 ; Marslen-Wilson et al. , 2008 ) for Bangla morphologically complex words . Uses val
Recently , several alternative , often quite sophisticated approaches to collective classification have been proposed ( Neville and Jensen , 2000 ; Lafferty et al. , 2001 ; Getoor et al. , 2002 ; Taskar et al. , 2002 ; Taskar et al. , 2003 ; <CITED HERE> ; McCallum and Wellner , 2004 ) . Background train
More sophisticated approaches have been proposed ( Hillard et al. , 2003 ) , including an extension that , in an interesting reversal of our problem , makes use of sentimentpolarity indicators within speech segments ( <CITED HERE> ) . Background train
The expectation parser uses an ATN-like representation for its grammar ( <CITED HERE> ) . Uses train
• Only an automatic evaluation was performed , which relied on having model responses ( Berger and Mittal 2000 ; <CITED HERE> ) . CompareOrContrast train
The error rate on sentence boundaries in the Brown corpus was not significantly worse than the lowest quoted before ( <CITED HERE> : 0.28 % vs. 0.20 % error rate ) . CompareOrContrast train
We follow our previous work ( <CITED HERE>b ) and restrict bridging to non-coreferential cases . Extends train
In our case , the clustering is performed by the program Snob , which implements mixture modeling combined with model selection based on the Minimum Message Length ( MML ) criterion ( <CITED HERE> ; Wallace 2005 ) . Uses train
This is in line with our previous findings from ( <CITED HERE> ) that candidates with higher power attempt to shift topics less often than others when responding to moderators . CompareOrContrast train
As has been previously observed and exploited in the NLP literature ( <CITED HERE> ; Agarwal and Bhattacharyya , 2005 ; Barzilay and Lapata , 2005 ) , the above optimization function , unlike many others that have been proposed for graph or set partitioning , can be solved exactly in an provably efficient manner via methods for finding minimum cuts in graphs . CompareOrContrast train
It is analogous to the step in other translation model induction algorithms that sets all probabilities below a certain threshold to negligible values ( Brown et al. , 1990 ; <CITED HERE> ; Chen , 1996 ) . Background train
It provides a fine grained NE recognition covering 100 different NE types ( <CITED HERE> ) . Uses train
Only a few such corpora exist , including the Hansard English-French corpus and the HKUST EnglishChinese corpus ( <CITED HERE> ) . Background train
Recently , several alternative , often quite sophisticated approaches to collective classification have been proposed ( Neville and Jensen , 2000 ; Lafferty et al. , 2001 ; Getoor et al. , 2002 ; <CITED HERE> ; Taskar et al. , 2003 ; Taskar et al. , 2004 ; McCallum and Wellner , 2004 ) . Background train
In FAQs , <CITED HERE> employed a sentence retrieval approach based on a language model where the entire response to an FAQ is considered a sentence , and the questions and answers are embedded in an FAQ document . Background train
The statistical significance test is performed by the re-sampling approach ( <CITED HERE> ) . Uses val
With a minimal set of features and a small number of lexical entries , Niyogi ( 2001 ) has successfully modeled many of the argument alternations described by <CITED HERE> using a Hale and Keyser ( 1993 ) style analysis . Background train
Although evaluated on different data sets , this result is consistent with results from previous work ( Gatt and Belz , 2008 ; <CITED HERE> ) . CompareOrContrast train
We used the revised experimental setup ( <CITED HERE> ) , based on discrete relatedness scores and presentation of word pairs in isolation , that is scalable to the higher number of pairs . Uses val
<CITED HERE> ; Bobrow , 1978 ) consult relatively small lexicons , typically generated by hand . CompareOrContrast train
Typed feature grammars can be used as the basis for implementations of Head-driven Phrase Structure Grammar ( <CITED HERE> ) .3 ( Meurers and Minnen , 1997 ) propose a compilation of lexical rules into TIT definite clauses Background train
In the United States , for example , governmental bodies are providing and soliciting political documents via the Internet , with lofty goals in mind : electronic rulemaking ( eRulemaking ) initiatives involving the `` electronic collection , distribution , synthesis , and analysis of public commentary in the regulatory rulemaking process '' , may `` [ alter ] the citizen-government relationship '' ( <CITED HERE> ) . Background train
This will become even more interesting when mappings of our synonym identifiers to a large medical thesaurus ( MeSH , ( <CITED HERE> ) ) are incorporated into our system . Future train
( Och and Ney , 2002 ; Blunsom et al. , 2008 ) used maximum likelihood estimation to learn weights for MT. ( Och , 2003 ; <CITED HERE> ; Zhao and Chen , 2009 ; Galley and Quirk , 2011 ) employed an evaluation metric as a loss function and directly optimized it . CompareOrContrast train
raw length value as a feature , we follow our previous work ( Rubino et al. , 2013 ; <CITED HERE> ) and create multiple features for length using a decision tree ( J48 ) . Extends train
Most DOP models , such as in Bod ( 1993 ) , Goodman ( 1996 ) , Bonnema et al. ( 1997 ) , Sima'an ( 2000 ) and <CITED HERE> , use a likelihood criterion in defining the best parse tree : they take ( some notion of ) the most likely ( i.e. most probable ) tree as a candidate for the best tree of a sentence . Background train
In some systems such dependencies are learned from labeled examples ( <CITED HERE> ) . Background train
CD for this type of descriptions along the lines of Section 4 is not difficult once relational descriptions are integrated with a standard GRE algorithm ( <CITED HERE> , Section 8.6.2 ) : Suppose an initial description is generated describing the set of all those dogs that are in sheds over a given size ( say , size 5 ) ; if this description happens to distinguish an individual dog then this legitimizes the use of the noun phrase the dog in the large shed . Background train
Michiels proposed rules for doing this for infinitive complement codes ; however there seems to be no principled reason not to extend this approach to computing the underlying relations in other types of VP as well as in cases of NP , AP and PP predication ( see <CITED HERE> , for further discussion ) . Background val
While wikis have spread from a detailed design ( <CITED HERE> ) , unfortunately blogs have not been designed under a model . Background train
In our experiment , we annotated a high number of pairs similar in size to the test sets by Finkelstein ( 2002 ) and <CITED HERE> . CompareOrContrast val
notation of <CITED HERE> is more sophisticated , and may be considered another possibility . CompareOrContrast train
The recent great advances in speech and language technologies have made it possible to build fully implemented spoken dialogue systems ( Aust et al. , 1995 ; <CITED HERE> ; Zue et al. , 2000 ; Walker et al. , 2000 ) . Background val
Most approaches rely on VerbNet ( <CITED HERE> ) and FrameNet ( Baker et al. , 1998 ) to provide associations between verbs and semantic roles , that are then mapped onto the current instance , as shown by the systems competing in semantic role labelling competitions ( Carreras and Marquez , 2004 ; Carreras and Marquez , 2005 ) and also ( Gildea and Jurafsky , 2002 ; Pradhan et al. , 2005 ; Shi and Mihalcea , 2005 ) . Background train
<CITED HERE> aimed to embed information by exploiting the linguistic phenomenon of presupposition , with the idea that some presuppositional information can be removed without changing the meaning of a sentence . Background train
We first identified the most informative unigrams and bigrams using the information gain measure ( Yang and Pedersen 1997 ) , and then selected only the positive outcome predictors using odds ratio ( <CITED HERE> ) . Uses train
Perhaps some variation of multi-level bulleted lists , appropriately integrated with interface elements for expanding and hiding items , might provide physicians a better overview of the information landscape ; see , for example , <CITED HERE> . Future train
Most DOP models , such as in Bod ( 1993 ) , <CITED HERE> , Bonnema et al. ( 1997 ) , Sima'an ( 2000 ) and Collins & Duffy ( 2002 ) , use a likelihood criterion in defining the best parse tree : they take ( some notion of ) the most likely ( i.e. most probable ) tree as a candidate for the best tree of a sentence . Background train
Some previous works ( <CITED HERE> ; Zhao et al. , 2009 ; Kouylekov et al. , 2009 ) indicate , as main limitations of the mentioned resources , their limited coverage , their low precision , and the fact that they are mostly suitable to capture relations mainly between single words . Background train
Nugget F-score has been employed as a metric in the TREC question-answering track since 2003 , to evaluate so-called definition and `` other '' questions ( <CITED HERE> ) . Background train
Manually defined heuristics are used to automatically annotate each tree in the treebank with partially specified HPSG derivation trees : Head/argument/modifier distinctions are made for each node in the tree based on Magerman ( 1994 ) and <CITED HERE> ; Uses train
Regarding future work , there are many research line that may be followed : i ) Capturing more features by employing external knowledge such as ontological , lexical resource or WordNet-based features ( Basili et al. , 2005a ; Basili et al. , 2005b ; Bloehdorn et al. , 2006 ; Bloehdorn and Moschitti , 2007 ) or shallow semantic trees , ( Giuglea and Moschitti , 2004 ; Giuglea and Moschitti , 2006 ; <CITED HERE> ; Moschitti et al. , 2007 ; Moschitti , 2008 ; Moschitti et al. , 2008 ) . Future train
In addition , there has been much work on the application of linguistic and semantic knowledge to information retrieval ; see <CITED HERE>a ) for a brief overview . Background train
However , <CITED HERE> claims that the log-likelihood chisquared statistic ( G2 ) is more appropriate for corpus-based NLP . Motivation train
In a log-linear parameterization , for example , a prior that penalizes feature strengths far from 1 can be used to do feature selection and avoid overfitting ( <CITED HERE> ) . Uses val
This alignment is done on the basis of both length ( Gale and Church <CITED HERE> ) and a notion of cognateness ( Simard [ 16 ] ) . Uses train
Since earlier versions of the SNoW based CSCL were used only to identify single phrases ( Punyakanok and Roth , 2001 ; <CITED HERE> ) and never to identify a collection of several phrases at the same time , as we do here , we also trained and tested it under the exact conditions of CoNLL-2000 ( Tjong Kim Sang and Buchholz , 2000 ) to compare it to other shallow parsers . Extends train
A more detailed discussion of the various available Arabic tag sets can be found in <CITED HERE> . Background train
The high Dirichlet priors are chosen to prevent sparsity in topic distributions , while the other parameters are selected as the best from <CITED HERE> . Uses val
The resulting speech understanding system is called the Voice Natural Language Computer with Expectation ( VNLCE , <CITED HERE> ) . Background train
In our previous papers ( <CITED HERE> ; Zhang , Blackwood , and Clark 2012 ) , we applied a set of beams to this structure , which makes it similar to the data structure used for phrase-based MT decoding ( Koehn 2010 ) . CompareOrContrast train
A number of speech understanding systems have been developed during the past fifteen years ( <CITED HERE> , Dixon and Martin 1979 , Erman et al. 1980 , Haton and Pierrel 1976 , Lea 1980 , Lowerre and Reddy 1980 , Medress 1980 , Reddy 1976 , Walker 1978 , and Wolf and Woods 1980 ) . CompareOrContrast train
Another line of research approaches grounded language knowledge by augmenting distributional approaches of word meaning with perceptual information ( Andrews et al. , 2009 ; Steyvers , 2010 ; Feng and Lapata , 2010b ; Bruni et al. , 2011 ; <CITED HERE> ; Johns and Jones , 2012 ; Bruni et al. , 2012a ; Bruni et al. , 2012b ; Silberer et al. , 2013 ) . Background train
In particular , since we treat each individual speech within a debate as a single `` document '' , we are considering a version of document-level sentiment-polarity classification , namely , automatically distinguishing between positive and negative documents ( <CITED HERE> ; Pang et al. , 2002 ; Turney , 2002 ; Dave et al. , 2003 ) . Background train
A number of alignment techniques have been proposed , varying from statistical methods ( Brown et al. , 1991 ; Gale and Church , 1991 ) to lexical methods ( Kay and Roscheisen , 1993 ; <CITED HERE> ) . Background train
( <CITED HERE> contains further description and discussion of LDOCE . ) Background train
<CITED HERE> , 1997 ) assumes that words ending in - ed are verbs . CompareOrContrast train
For the development of these lists we used a collection of texts of about 300,000 words derived from the New York Times ( NYT ) corpus that was supplied as training data for the 7th Message Understanding Conference ( MUC-7 ) ( <CITED HERE> ) . Uses train
We would also provide a user facility for choosing the right licence for every lexia , following the model of Creative Commons licences ( <CITED HERE> ) . Uses train
The language grounding problem has received significant attention in recent years , owed in part to the wide availability of data sets ( e.g. Flickr , Von Ahn ( 2006 ) ) , computing power , improved computer vision models ( Oliva and Torralba , 2001 ; <CITED HERE> ; Farhadi et al. , 2009 ; Parikh and Grauman , 2011 ) and neurological evidence of ties between the language , perceptual and motor systems in the brain ( Pulverm ¨ uller et al. , 2005 ; Tettamanti et al. , 2005 ; Aziz-Zadeh et al. , 2006 ) . Background train
There has been some controversy , at least for simple stemmers ( Lovins , 1968 ; Porter , 1980 ) , about the effectiveness of morphological analysis for document retrieval ( Harman , 1991 ; <CITED HERE> ; Hull , 1996 ) . Background train
We have yet to import such a constraint into our model , but we plan to do so in the near future using the weighted majority algorithm ( <CITED HERE> ) . Future train
Generally speaking , we find that the personal public diary metaphor behind blogs ( <CITED HERE> ) may bring to an unsatisfactory representation of the context . Background train
Most probabilistic translation model reestimation algorithms published to date are variations on the theme proposed by <CITED HERE>b ) . Background train
A number of proposals in the 1990s deliberately limited the extent to which they relied on domain and/or linguistic knowledge and reported promising results in knowledge-poor operational environments ( Dagan and Itai 1990 , 1991 ; Lappin and Leass 1994 ; <CITED HERE> ; Kennedy and Boguraev 1996 ; Williams , Harvey , and Preston 1996 ; Baldwin 1997 ; Mitkov 1996 , 1998b ) . Background train
The significance testing is performed by paired bootstrap re-sampling ( <CITED HERE> ) . Uses train
The system is implemented based on ( <CITED HERE> ) and ( Marcu et al. 2006 ) . Uses train
IGEN uses standard chart generation techniques ( <CITED HERE> ) in its base generator to efficiently produce generation candidates . Background train
Position , subcat frame , phrase type , first word , last word , subcat frame + , predicate , path , head word and its POS , predicate + head word , predicate + phrase type , path to BA and BEI , verb class 3 , verb class + head word , verb class + phrase type , from <CITED HERE> . Uses train
The psycholinguistic studies of Martin ( 1970 ) , Allen ( 1975 ) , Hillinger et al. ( 1976 ) , Grosjean et al. ( 1979 ) , Dommergues and Grosjean ( 1983 ) , and <CITED HERE> , responding to the idea of readjusted syntax as the source of prosodic phrasing , show that grammatical structure , even if readjusted , is not in itself a reliable predictor of prosodic phrasing : mismatches between syntax and prosody occur often and systematically , and can be related to specific nonsyntactic factors such as length and word frequency . Background train
A number of studies ( e.g. , Hildebrandt , Katz , and Lin 2004 ) have pointed out shortcomings of the original nugget scoring model , although a number of these issues have been recently addressed ( <CITED HERE>a , 2006b ) . Background train
<CITED HERE> reported that the translation lexicon that our model induced from this tiny bitext accounted for 30 % of the word types with precision between 84 % and 90 % . Background train
<CITED HERE> describe a simple tool which uses fine-grained rules to identify the arguments of verb occurrences in the Penn-II Treebank . Background train
The contextual interpreter then uses a reference resolution approach similar to <CITED HERE> , and an ontology mapping mechanism ( Dzikovska et al. , 2008a ) to produce a domain-specific semantic representation of the student 's output . CompareOrContrast train
<CITED HERE> maintains a survey of this area . Background train
The principle of maximum entropy states that when one searches among probability distributions that model the observed data ( evidence ) , the preferred one is the one that maximizes the entropy ( a measure of the uncertainty of the model ) ( <CITED HERE> ) . Background train
In the context of word alignment , <CITED HERE> use a state-duration HMM in order to model word-to-phrase translations . Background val
With respect to the focus on function words , our reordering model is closely related to the UALIGN system ( <CITED HERE> ) . CompareOrContrast train
We could also introduce new variables , e.g. , nonterminal refinements ( <CITED HERE> ) , or secondary links Mid ( not constrained by TREE/PTREE ) that augment the parse with representations of control , binding , etc. ( Sleator and Temperley , 1993 ; Buch-Kromann , 2006 ) . Future train
Selectional Preferences have also been a recent focus of researchers investigating the learning of paraphrases and inference rules ( <CITED HERE> ; Roberto et al. , 2007 ) . Background train
Previously LDA has been successfully used to infer unsupervised joint topic distributions over words and feature norms together ( Andrews et al. , 2009 ; <CITED HERE> ) . Background train
Something like this approach is in fact used in some systems ( e.g. , Elhadad and Robin 1992 ; PenMan 1989 ; <CITED HERE>a ) . Background train
The emphasis on narrativity takes into account the use of blogs as public diaries on the web , that is still the main current interpretation of this literary genre , or metagenre ( <CITED HERE> ) . Background train
For example , it would be helpful to consider strong correspondence between certain English and Chinese words , as in ( <CITED HERE> ) . Future train
The system utilizes several large size biological databases including three NCBI databases ( GenPept [ 11 ] , RefSeq [ 12 ] , and Entrez GENE [ 13 ] ) , PSD database from Protein Information Resources ( PIR ) <CITED HERE> , and Uses train
We take some core ideas from our previous work on mining script information ( <CITED HERE> ) . Extends train
While IA is generally thought to be consistent with findings on human language production ( Hermann and Deutsch 1976 ; Levelt 1989 ; <CITED HERE> ; Sonnenschein 1982 ) , the hypothesis that incrementality is a good model of human GRE seems unfalsifiable until a preference order is specified for the properties on which it operates . Background train
Typical examples are Bulgarian ( <CITED HERE> ; Simov and Osenova , 2003 ) , Chinese ( Chen et al. , 2003 ) , Danish ( Kromann , 2003 ) , and Swedish ( Nilsson et al. , 2005 ) . Background val
• A user study was performed , but it was either very small compared to the corpus ( Carmel , Shtalhaim , and Soffer 2000 ; Jijkoun and de Rijke 2005 ) , or the corpus itself was significantly smaller than ours ( <CITED HERE> ; Leuski et al. 2006 ) . CompareOrContrast train
Our work on the prosodic phrase status of clause final prepositional phrases , which we discuss below , suggests the existence of a discourse-neutral phrasing that depends on syntactic constituency mediated by string adjacency and length of a potential prosodic phrase .3 Such phrasing provides us with a typical phrasing pattern analogous to the typical phrasal stress patterns examined in <CITED HERE> , which `` are often overwhelmed by the chiaroscuro of highlight and background in discourse , but retain the status of null-hypothesis patterns that emerge when Computational Linguistics Volume 16 , Number 3 , September 1990 157 J. Bachenko and E. Fitzpatrick Discourse-Neutral Prosodic Phrasing in English there is no good reason to take some other option '' ( p. 251 ) . CompareOrContrast train
Another line of research approaches grounded language knowledge by augmenting distributional approaches of word meaning with perceptual information ( Andrews et al. , 2009 ; <CITED HERE> ; Feng and Lapata , 2010b ; Bruni et al. , 2011 ; Silberer and Lapata , 2012 ; Johns and Jones , 2012 ; Bruni et al. , 2012a ; Bruni et al. , 2012b ; Silberer et al. , 2013 ) . Background val
When objects are compared in terms of several dimensions , these dimensions can be weighed in different ways ( e.g. , <CITED HERE> ) . Background train
Most DOP models , such as in Bod ( 1993 ) , Goodman ( 1996 ) , Bonnema et al. ( 1997 ) , <CITED HERE> and Collins & Duffy ( 2002 ) , use a likelihood criterion in defining the best parse tree : they take ( some notion of ) the most likely ( i.e. most probable ) tree as a candidate for the best tree of a sentence . Background train
While IA is generally thought to be consistent with findings on human language production ( Hermann and Deutsch 1976 ; <CITED HERE> ; Pechmann 1989 ; Sonnenschein 1982 ) , the hypothesis that incrementality is a good model of human GRE seems unfalsifiable until a preference order is specified for the properties on which it operates . Background train
Other molecular biology databases We also included several model organism databases or nomenclature databases in the construction of the dictionary , i.e. , mouse Mouse Genome Database ( MGD ) [ 18 ] , fly FlyBase <CITED HERE> , yeast Saccharomyces Genome Database ( SGD ) [ 20 ] , rat -- Rat Genome Database ( RGD ) [ 21 ] , worm -- WormBase [ 22 ] , Human Nomenclature Database ( HUGO ) [ 23 ] , Online Mendelian Inheritance in Man ( OMIM ) [ 24 ] , and Enzyme Nomenclature Database ( ECNUM ) [ 25 , 26 ] . Uses train
The RenTAL system automatically converts an FB-LTAG grammar into a strongly equivalent HPSG-style grammar ( <CITED HERE> ) . Background train
The features can be easily obtained by modifying the TAT extraction algorithm described in ( <CITED HERE> ) . Extends train
<CITED HERE> present preliminary work on the automatic extraction of subcategorization frames for Bulgarian from the BulTreeBank ( Simov , Popova , and Osenova 2002 ) . Background train
The use of the web as a corpus for teaching and research on language has been proposed a number of times ( Kilgarriff , 2001 ; Robb , 2003 ; Rundell , 2000 ; Fletcher , 2001 , 2004b ) and received a special issue of the journal Computational Linguistics ( <CITED HERE> ) . Background val
ASARES is based on a Machine Learning technique , Inductive Logic Programming ( ILP ) ( <CITED HERE> ) , which infers general morpho-syntactic patterns from a set of examples ( this set is noted E + hereafter ) and counter-examples ( E − ) of the elements one Background val
<CITED HERE> built a corpus by iteratively searching Google for a small set of seed terms . Background train
Expanding on a suggestion of <CITED HERE> , we classify verbs as Subject Equi , Object Equi , Subject Raising or Object Raising for each sense which has a predicate complement code associated with it . Extends train
This Principle of Finitism is also assumed by Johnson-Laird ( 1983 ) , <CITED HERE> , Kamp ( 1981 ) , and implicitly or explicitly by almost all researchers in computational linguistics . CompareOrContrast train
For more information on CATiB , see <CITED HERE> and Habash , Faraj , and Roth ( 2009 ) . Background train
With the exception of ( Fung , 1995b ) , previous methods for automatically constructing statistical translation models begin by looking at word cooccurrence frequencies in bitexts ( <CITED HERE> ; Kumano & Hirakawa , 1994 ; Fung , 1995a ; Melamed , 1995 ) . Background val
Experiments ( Section 5 ) show that forestbased extraction improves BLEU score by over 1 point on a state-of-the-art tree-to-string system ( Liu et al. , 2006 ; <CITED HERE> ) , which is also 0.5 points better than ( and twice as fast as ) extracting on 30-best parses . Extends train
Our approach to extract and classify social events builds on our previous work ( <CITED HERE> ) , which in turn builds on work from the relation extraction community ( Nguyen et al. , 2009 ) . Extends train
Previous sentiment-analysis work in different domains has considered inter-document similarity ( Agarwal and Bhattacharyya , 2005 ; <CITED HERE> ; Goldberg and Zhu , 2006 ) or explicit Background val
For example , speech repairs , particle omission , and fillers can be dealt with in the framework of unification grammar ( <CITED HERE> ; Nakano and Shimazu , 1999 ) . Future train
<CITED HERE> employed a Bayesian method to learn discontinuous SCFG rules . CompareOrContrast train
Table 5 shows our mapping from publication type and MeSH headings to evidence grades based on principles defined in the Strength of Recommendations Taxonomy ( <CITED HERE> ) . Uses train
We use the same splits as <CITED HERE> . Uses train
These results are slightly worse than those obtained in previous studies using the same annotation scheme ( <CITED HERE> ) , but are still sat - CompareOrContrast train
In <CITED HERE> , this flattening process is not part of the grammar . Background train
MEDLINE , the authoritative repository of abstracts from the medical and biomedical primary literature maintained by the National Library of Medicine , provides the clinically relevant sources for answering physicians ' questions , and is commonly used in that capacity ( <CITED HERE> ; De Groote and Dorsch 2003 ) . Background train
There is a rich literature on organization and lexical access of morphologically complex words where experiments have been conducted mainly for derivational suffixed words of English , Hebrew , Italian , French , Dutch , and few other languages ( Marslen-Wilson et al. , 2008 ; <CITED HERE> ; Grainger , et al. , 1991 ; Drews and Zwitserlood , 1995 ) . Background train
He lists , classifies , and discusses various types of inference , by which he means , generally , `` the linguistic-logical notions of consequent and presupposition '' <CITED HERE>:112 ) have collected convincing evidence of the existence of language chunks -- real structures , not just orthographic conventions -- that are smaller than a discourse , larger than a sentence , generally composed of sentences , and recursive in nature ( like sentences ) . Background val
For this research , we used a coreference resolution system ( ( <CITED HERE> ) ) that implements different sets of heuristics corresponding to various forms of coreference . Uses train
The work that is most similar to ours is that of <CITED HERE> , who introduced the Constraint Driven Learning algorithm ( CODL ) . CompareOrContrast train
For example , in ordinary HMM training , xi = E * and represents a completely hidden state sequence ( cfXXX <CITED HERE> , who allows any regular set ) , while yi is a single string representing a completely observed emission sequence .11 What to optimize ? Background val
<CITED HERE> tried to solve the inflection prediction problem by simply building an SMT system for translating from stems to inflected forms . CompareOrContrast train
As shown in ( <CITED HERE> ) , using this representation , a linear classifier can not distinguish sentences sampled from a trigram and real sentences . Motivation train
Various approaches for computing semantic relatedness of words or concepts have been proposed , e.g. dictionary-based ( Lesk , 1986 ) , ontology-based ( Wu and Palmer , 1994 ; <CITED HERE> ) , information-based ( Resnik , 1995 ; Jiang and Conrath , 1997 ) or distributional ( Weeds and Weir , 2005 ) . Background train
This framework , where the `` semantic load '' is spread more evenly throughout the lexicon to lexical categories not typically thought to bear semantic content , is essentially the model advocated by <CITED HERE>a ) , among many others . Background val
<CITED HERE> also presents a similar method for the extraction of a TAG from the Penn Treebank . Background train
To a first approximation , a CURRENT-FOCUS reaches only nodes that are c-commanded ( <CITED HERE> ) by its generator . Background train
A previous work along this line is <CITED HERE> , which is based on weighted finite-state transducers ( FSTs ) . CompareOrContrast val
Most approaches rely on VerbNet ( Kipper et al. , 2000 ) and FrameNet ( Baker et al. , 1998 ) to provide associations between verbs and semantic roles , that are then mapped onto the current instance , as shown by the systems competing in semantic role labelling competitions ( Carreras and Marquez , 2004 ; Carreras and Marquez , 2005 ) and also ( Gildea and Jurafsky , 2002 ; <CITED HERE> ; Shi and Mihalcea , 2005 ) . Background val
<CITED HERE> describe an efficient algorithm for accomplishing this in which approximations to Pst ( TIS ) are computed in parallel for all ( new ) features ft by holding all weights in the existing model fixed and optimizing only over a8t . Background train
As suggested in <CITED HERE> this can be done by looking up the ranks of each of the four given words ( i.e. the words occurring in a particular word equation ) within the association vector of a translation candidate , and by multiplying these ranks . Motivation val
32 In certain cases an extension of the constraint language with named disjunctions or contexted constraints ( <CITED HERE> ; Eisele and Dorre 1990 ; Griffith 1996 ) can be used to circumvent constraint propagation . Background train
This article represents an extension of our previous work on unsupervised event coreference resolution ( Bejan et al. 2009 ; <CITED HERE> ) . Extends val
These include devices such as interleaving the components ( McDonald 1983 ; Appelt 1983 ) , backtracking on failure ( Appelt 1985 ; Nogier 1989 ) , allowing the linguistic component to interrogate the planner ( <CITED HERE> ; Sondheimer and Nebel 1986 ) , and Hovy 's notion of restrictive ( i.e. , bottom-up ) planning ( Hovy 1988a , 1988c ) . Background train
See , among others , ( <CITED HERE> ) . Background train
In Table 2 , lem refers to the LTAG parser ( <CITED HERE> ) , ANSI C implementation of the two-phase parsing algorithm that performs the head corner parsing ( van Noord , 1994 ) without features ( phase 1 ) , and then executes feature unification ( phase 2 ) . CompareOrContrast train
<CITED HERE> and Burkett et al. ( 2010 ) focused on joint parsing and alignment . CompareOrContrast train
In modern syntactic theories ( e.g. , lexical-functional grammar [ LFG ] [ <CITED HERE> ; Bresnan 2001 ; Dalrymple 2001 ] , head-driven phrase structure grammar [ HPSG ] [ Pollard and Sag 1994 ] , tree-adjoining grammar [ TAG ] [ Joshi 1988 ] , and combinatory categorial grammar [ CCG ] [ Ades and Steedman 1982 ] ) , the lexicon is the central repository for much morphological , syntactic , and semantic information . Background train
For example , when books should n't be copied by hand any longer , authors took the advantage and start writing original books and evaluation -- i.e. literary criticism -- unlike in the previous times ( <CITED HERE> ) . Background train
Furthermore , the availability of rich ontological resources , in the form of the Unified Medical Language System ( UMLS ) ( Lindberg et al. , 1993 ) , and the availability of software that leverages this knowledge -- MetaMap ( <CITED HERE> ) for concept identification and SemRep ( Rindflesch and Fiszman , 2003 ) for relation extraction -- provide a foundation for studying the role of semantics in various tasks . Background train
Since the language generation module works in parallel with the language understanding module , utterance generation is possible even while the system is listening to user utterances and that utterance understanding is possible even while it is speaking ( <CITED HERE>a ) . Background train
This idea was expanded to include nouns and their modifiers through verb nominalizations ( <CITED HERE> ; Quirk et al. , 1985 ) . Background val
For example , such schema can serve as a mean to represent translation examples , or find structural correspondences for the purpose of transfer grammar learning ( <CITED HERE> ) , ( Aramaki et al. , 2001 ) , ( Watanabe et al. , 2000 ) , ( Meyers et al. , 2000 ) , ( Matsumoto et al. , 1993 ) , ( kaji et al. , 1992 ) , and example-base machine translation EBMT3 ( Sato & Nagao , 1990 ) , ( Sato , 1991 ) , ( Richardson et al. , 2001 ) , ( Al-Adhaileh & Tang , 1999 ) . Background train
<CITED HERE> define a kernel over parse trees and apply it to re-ranking the output of a parser , but the resulting feature space is restricted by the need to compute the kernel efficiently , and the results are not as good as Collins ' previous work on re-ranking using a finite set of features ( Collins , 2000 ) . Background train
9 We only use the minimal GHKM rules ( <CITED HERE> ) here to reduce the complexity of the sampler . Uses val
FBLTAG ( <CITED HERE> ; Vijay-Shanker and Joshi , 1988 ) is an extension of the LTAG formalism . Background train
In addition to headwords , dictionary search through the pronunciation field is available ; <CITED HERE> has merged information from the pronunciation and hyphenation fields , creating an enhanced phonological representation which allows access to entries by broad phonetic class and syllable structure ( Huttenlocher and Zue , 1983 ) . Uses train
Research on shallow parsing was inspired by psycholinguistics arguments ( <CITED HERE> ) that suggest that in many scenarios ( e.g. , conversational ) full parsing is not a realistic strategy for sentence processing and analysis , and was further motivated by several arguments from a natural language engineering viewpoint . Background train
Despite this , to date , there has been little work on corpus-based approaches to help-desk response automation ( notable exceptions are Carmel , Shtalhaim , and Soffer 2000 ; Lapalme and Kosseim 2003 ; <CITED HERE> ; Malik , Subramaniam , and Kaushik 2007 ) . Background train
results are based on a corpus of movie subtitles ( Tiedemann 2007 ) , and are consequently shorter sentences , whereas the En → Es results are based on a corpus of parliamentary proceedings ( <CITED HERE> ) . Uses train
A statistical technique which has recently become popular for NLP is Maximum Entropy/Minimum Divergence ( MEMD ) modeling ( <CITED HERE> ) . Uses train
Our baseline coreference system uses the C4 .5 decision tree learner ( <CITED HERE> ) to acquire a classifier on the training texts for determining whether two NPs are coreferent . Uses train
This alignment is obtained by following the same set of rules learned from the development dataset as in ( <CITED HERE> ) . Uses train
Developed Systems Our developed system is built on the work by <CITED HERE> , using Constrained Latent Left-Linking Model ( CL3M ) as our mention-pair coreference model in the joint framework10 . Uses train
The best performance on the Brown corpus , a 0.2 % error rate , was reported by <CITED HERE> , who trained a decision tree classifier on a 25-million-word corpus . CompareOrContrast train
Most approaches rely on VerbNet ( Kipper et al. , 2000 ) and FrameNet ( Baker et al. , 1998 ) to provide associations between verbs and semantic roles , that are then mapped onto the current instance , as shown by the systems competing in semantic role labelling competitions ( Carreras and Marquez , 2004 ; Carreras and Marquez , 2005 ) and also ( Gildea and Jurafsky , 2002 ; Pradhan et al. , 2005 ; <CITED HERE> ) . Background train
These include devices such as interleaving the components ( McDonald 1983 ; <CITED HERE> ) , backtracking on failure ( Appelt 1985 ; Nogier 1989 ) , allowing the linguistic component to interrogate the planner ( Mann 1983 ; Sondheimer and Nebel 1986 ) , and Hovy 's notion of restrictive ( i.e. , bottom-up ) planning ( Hovy 1988a , 1988c ) . Background val
The system was trained on the Penn Treebank ( Marcus et al. , 1993 ) WSJ Sections 221 and tested on Section 23 ( Table 1 ) , same as used by Magerman ( 1995 ) , Collins ( 1997 ) , and <CITED HERE> , and became a common testbed . CompareOrContrast train
<CITED HERE> was the first scholar who stressed the impact of the digital revolution to the medium of writing . Background train
A similar problem is discussed in the psycholinguistics of interpretation ( <CITED HERE> ) : Interpretation is widely assumed to proceed incrementally , but vague descriptions resist strict incrementality , since an adjective in a vague description can only be fully interpreted when its comparison set is known . Background train
Much of theoretical linguistics can be formulated in a very natural manner as stating correspondences ( translations ) between layers of representation structures ( <CITED HERE> ) . Background train
In our previous work ( <CITED HERE> ) , we applied this method to a small subset of WordNet nouns and showed potential applicability . Extends train
Much of the earlier work in anaphora resolution heavily exploited domain and linguistic knowledge ( Sidner 1979 ; Carter 1987 ; <CITED HERE> ; Carbonell and Brown 1988 ) , which was difficult both to represent and to process , and which required considerable human input . Background train
the mention sub-type , which is a sub-category of the mention type ( <CITED HERE> ) ( e.g. OrgGovernmental , FacilityPath , etc. ) . Uses train
For more details on the proprieties of SSTC , see <CITED HERE> . Background train
This has been reported for other languages , too , dependent on the generality of the chosen approach ( J ¨ appinen and Niemist ¨ o , 1988 ; Choueka , 1990 ; <CITED HERE> ; Ekmekc ¸ ioglu et al. , 1995 ; Hedlund et al. , 2001 ; Pirkola , 2001 ) . Background train
Specifically , we used Decision Graphs ( Oliver 1993 ) for Doc-Pred , and SVMs ( <CITED HERE> ) for Sent-Pred .11 Additionally , we used unigrams for clustering documents and sentences , and unigrams and bigrams for predicting document clusters and sentence clusters ( Sections 3.1.2 and 3.2.2 ) . Uses train
To prepare SMT outputs for post-editing , the creators of the corpus used their own WMT10 system ( Potet et al. , 2010 ) , based on the Moses phrase-based decoder ( <CITED HERE> ) with dense features . Uses val
To model o ( Li , S → T ) , o ( Ri , S → T ) , i.e. the reordering of the neighboring phrases of a function word , we employ the orientation model introduced by <CITED HERE> . Uses val
The syntactic structures of the input data are produced by a parser with good coverage and detailed syntactic information , DIPETT ( <CITED HERE> ) . Uses train
In particular , ( <CITED HERE> ) lists the converses of some 3 500 predicative nouns . Future train
This is similar to the `` deletion '' strategy employed by <CITED HERE> , but we do it directly in the grammar . CompareOrContrast val
Although this study falls under the general topic of discourse modeling , our work differs from previous attempts to characterize text in terms of domainindependent rhetorical elements ( McKeown , 1985 ; <CITED HERE> ) . CompareOrContrast train
Many NLP applications require knowledge about semantic relatedness rather than just similarity ( <CITED HERE> ) . Background train
Other work on modeling the meanings of verbs using video recognition has also begun showing great promise ( <CITED HERE> ; Regneri et al. , 2013 ) . Background train
It has been shown ( <CITED HERE> ) that the subcategorization tendencies of verbs vary across linguistic domains . Motivation train
In the system , we extract both the minimal GHKM rules ( <CITED HERE> ) , and the rules of SPMT Model 1 ( Galley et al. , 2006 ) with phrases up to length L = 5 on the source side . Uses train
Note that this ensures that greater importance is attributed to longer chunks , as is usual in most EBMT systems ( cfXXX Sato and Nagao 1990 ; <CITED HERE> ; Carl 1999 ) .7 As an example , consider the translation into French of the house collapsed . Background train
One of the better-known approaches is described in <CITED HERE> , which suggested that abbreviations first be extracted from a corpus using abbreviation-guessing heuristics akin to those described in Section 6 and then reused in further processing . Background train
Unlike the models proposed by <CITED HERE>b ) , this model is symmetric , because both word bags are generated together from a joint probability distribution . CompareOrContrast val
We apply two different priming experiments namely , the cross modal priming and masked priming experiment discussed in ( Forster and Davis , 1984 ; Rastle et al. , 2000 ; <CITED HERE> ; Marslen-Wilson et al. , 2008 ) for Bangla morphologically complex words . Uses train
( Watanabe et al. , 2007 ; Chiang et al. , 2008 ; <CITED HERE> ) proposed other optimization objectives by introducing a margin-based and ranking-based indirect loss functions . Background train
Our work extends directions taken in systems such as Ariane ( Vauquois and Boitet , 1985 ) , FoG ( Kittredge and Polguere , 1991 ) , JOYCE ( Rambow and <CITED HERE> ) , and LFS ( Iordanskaja et al. , 1992 ) . Extends train
Since the arguments can provide useful semantic information , the SRL is crucial to many natural language processing tasks , such as Question and Answering ( Narayanan and Harabagiu 2004 ) , Information Extraction ( Surdeanu et al. 2003 ) , and Machine Translation ( <CITED HERE> ) . Background train
Such systems extract information from some types of syntactic units ( clauses in ( Fillmore and Atkins , 1998 ; Gildea and Jurafsky , 2002 ; Hull and Gomez , 1996 ) ; noun phrases in ( Hull and Gomez , 1996 ; <CITED HERE> ) ) . Background val
Association Norms ( AN ) is a collection of association norms collected by Schulte im <CITED HERE> . Uses train
Stanford University is developing the English Resource Grammar , an HPSG grammar for English , as a part of the Linguistic Grammars Online ( LinGO ) project ( <CITED HERE> ) . Background train
The changes made were inspired by those described in <CITED HERE> , page 75 ) . Motivation train
The grammar code system used in LDOCE is based quite closely on the descriptive grammatical framework of <CITED HERE> , 1985 ) . Extends train
Our task was made possible by the fact that while far from being a database in the accepted sense of the word , the LDOCE typesetting tape is the only truly computerised dictionary of English ( <CITED HERE> ) . Background train
A detailed description of the kinds of expectation mechanisms appearing in these systems appears in <CITED HERE> . Background train
Using the GHKM algorithm ( <CITED HERE> ) , we can get two different STSG derivations from the two U-trees based on the fixed word alignment . Uses train
The need for information systems to support physicians at the point of care has been well studied ( Covell et al. , 1985 ; Gorman et al. , 1994 ; <CITED HERE> ) . Background train
Task properties Determining whether or not a speaker supports a proposal falls within the realm of sentiment analysis , an extremely active research area devoted to the computational treatment of subjective or opinion-oriented language ( early work includes Wiebe and Rapaport ( 1988 ) , Hearst ( 1992 ) , <CITED HERE> , and Wiebe ( 1994 ) ; see Esuli ( 2006 ) for an active bibliography ) . Background val
Table look-up using an explicit translation lexicon is sufficient and preferable for many multilingual NLP applications , including `` crummy '' MT on the World Wide Web ( Church & Hovy , 1993 ) , certain machine-assisted translation tools ( e.g. ( Macklovitch , 1994 ; Melamed , 1996b ) ) , concordancing for bilingual lexicography ( <CITED HERE> ; Gale & Church , 1991 ) , computerassisted language learning , corpus linguistics ( Melby . Background val
Our HDP extension is also inspired from the Bayesian model proposed by <CITED HERE> . Motivation train
The EDR has close ties to the named entity recognition ( NER ) and coreference resolution tasks , which have been the focus of several recent investigations ( Bikel et al. , 1997 ; Miller et al. , 1998 ; <CITED HERE> ; Mikheev et al. , 1999 ; Soon et al. , 2001 ; Ng and Cardie , 2002 ; Florian et al. , 2004 ) , and have been at the center of evaluations such as : MUC-6 , MUC-7 , and the CoNLL '02 and CoNLL '03 shared tasks . Background train
As has been previously observed and exploited in the NLP literature ( Pang and Lee , 2004 ; <CITED HERE> ; Barzilay and Lapata , 2005 ) , the above optimization function , unlike many others that have been proposed for graph or set partitioning , can be solved exactly in an provably efficient manner via methods for finding minimum cuts in graphs . CompareOrContrast train
To combine the phrasal matching scores obtained at each n-gram level , and optimize their relative weights , we trained a Support Vector Machine classifier , SVMlight ( <CITED HERE> ) , using each score as a feature . Uses train
Against the background of a growing interest in multilingual NLP , multilingual anaphora / coreference resolution has gained considerable momentum in recent years ( Aone and McKee 1993 ; Azzam , Humphreys , and Gaizauskas 1998 ; Harabagiu and Maiorano 2000 ; <CITED HERE> ; Mitkov 1999 ; Mitkov and Stys 1997 ; Mitkov , Belguith , and Stys 1998 ) . Background train
For an overview of systems designed to answer open-domain factoid questions , the TREC QA track overview papers are a good place to start ( <CITED HERE> ) . Background train
Another interesting extension is to broaden the definition of a `` word '' to include multi-word lexical units ( <CITED HERE> ) . Future train
For example , 10 million words of the American National Corpus ( <CITED HERE> ) will have manually corrected POS tags , a tenfold increase over the Penn Treebank ( Marcus et al. , 1993 ) , currently used for training POS taggers . Background train
From an IR view , a lot of specialized research has already been carried out for medical applications , with emphasis on the lexico-semantic aspects of dederivation and decomposition ( Pacak et al. , 1980 ; <CITED HERE> ; Wolff , 1984 ; Wingert , 1985 ; Dujols et al. , 1991 ; Baud et al. , 1998 ) . Background train
This approach has occasionally been taken , as in Kantrowitz and Bates ( 1992 ) and Danlos ( 1987 ) and , at least implicitly , in <CITED HERE> and Delin et al. ( 1994 ) ; however , under this approach , all of the flexibility and simplicity of modular design is lost . Background val
Typical examples are Bulgarian ( Simov et al. , 2005 ; Simov and Osenova , 2003 ) , Chinese ( <CITED HERE> ) , Danish ( Kromann , 2003 ) , and Swedish ( Nilsson et al. , 2005 ) . Background train
In this paper , we use the Constrained Latent Left-Linking Model ( CL3M ) described in <CITED HERE> in our experiments . Uses train
We use a standard split of 268 training documents , 68 development documents , and 106 testing documents ( <CITED HERE> ; Bengtson and Roth , 2008 ) . Uses train
Aside from the extraction of theory-neutral subcategorization lexicons , there has also been work in the automatic construction of lexical resources which comply with the principles of particular linguistic theories such as LTAG , CCG , and HPSG ( Chen and Vijay-Shanker 2000 ; <CITED HERE> ; Hockenmaier , Bierner , and Baldridge 2004 ; Nakanishi , Miyao , and Tsujii 2004 ) . Background train
An alternative representation based on <CITED HERE> is presented in Selkirk ( 1984 ) , which contends that prosody , including prosodic phrasing , is more properly represented as a grid instead of a tree . CompareOrContrast train
For the joint segmentation and POS-tagging task , we present a novel solution using the framework in this article , and show that it gives comparable accuracies to our previous work ( <CITED HERE>a ) , while being more than an order of magnitude faster . CompareOrContrast val
According to the data available from 1990 U.S. Census Bureau , only 90,000 different names are shared by 100 million people ( <CITED HERE> ) . Background train
Since then this idea has been applied to several tasks , including word sense disambiguation ( <CITED HERE> ) and named-entity recognition ( Cucerzan and Yarowsky 1999 ) . Background val
• History-based feature models for predicting the next parser action ( <CITED HERE> ) . Uses train
For each co-occurring pair of word types u and v , these likelihoods are initially set proportional to their co-occurrence frequency ( „ , v ) and inversely proportional to their marginal frequencies n ( u ) and n ( v ) 1 , following ( <CITED HERE> ) 2 . Uses train
However , rather than output this wrong translation directly , we use a post hoc validation and ( if required ) correction process based on <CITED HERE> . Uses train
Most approaches rely on VerbNet ( Kipper et al. , 2000 ) and FrameNet ( Baker et al. , 1998 ) to provide associations between verbs and semantic roles , that are then mapped onto the current instance , as shown by the systems competing in semantic role labelling competitions ( Carreras and Marquez , 2004 ; Carreras and Marquez , 2005 ) and also ( Gildea and Jurafsky , 2002 ; Pradhan et al. , 2005 ; <CITED HERE> ) . Background val
The powerful mechanism of lexical rules ( <CITED HERE> ) has been used in many natural language processing systems . Background train
The version proposed here combines a basic insight from Lewin ( 1990 ) with higher-order unification to give an analysis that has a strong resemblance to that proposed in <CITED HERE> , 1991 ) , with some differences that are commented on below . CompareOrContrast train
<CITED HERE> presented a perceptron-based algorithm for learning the phrase-translation parameters in a statistical machine translation system . CompareOrContrast train
NLG has to do more than select a distinguishing description ( i.e. , one that unambiguously denotes its referent ; <CITED HERE> ) : The selected expression should also be felicitous . Background train
<CITED HERE> , 1997 ) conducts some small experiments using his METLA system to show the viability of this approach for English − > French and English − > Urdu . Background train
This idea of preserving properties can be considered an instance of the well-known frame problem in AT ( <CITED HERE> ) , and we will therefore refer to the specifications left implicit by the linguist as the frame specification , or simply frame , of a lexical rule . CompareOrContrast train
All experiments have been performed using MaltParser ( <CITED HERE> ) , version 0.4 , which is made available together with the suite of programs used for preand post-processing .1 Uses val
One area of current interest concerns the left-to-right arrangement of premodifying adjectives within an NP ( e.g. , Shaw and Hatzivassiloglou 1999 ; <CITED HERE> ) . Background train
Better results would be expected by combining the PCFG-LA parser with discriminative reranking approaches ( <CITED HERE> ; Huang , 2008 ) for self training . Future train
Some researchers ( <CITED HERE> ; Nguyen and Cao , 2008 ) have explored the use of Wikipedia information to improve the disambiguation process . Background train
Others have applied the NLP technologies of near-duplicate detection and topic-based text categorization to politically oriented text ( <CITED HERE> ; Purpura and Hillard , 2006 ) . Background val
We chose to follow <CITED HERE> and split the sentences evenly to facilitate further comparison . Uses train
That is , a document that contains terms al , a2 and a3 may be ranked higher than a document which contains terms al and b.f. However , the second document is more likely to be relevant since correct translations of the query terms are more likely to co-occur ( <CITED HERE> ) . Background train
The last years have seen considerable advances in the field of anaphora resolution , but a number of outstanding issues either remain unsolved or need more attention and , as a consequence , represent major challenges to the further development of the field ( <CITED HERE>a ) . Future train
Using an accumulator passing technique ( <CITED HERE> ) , we ensure that upon execution of a call to the interaction predicate q_1 a new lexical entry is derived as the result of successive application of a number of lexical rules . Uses train
However , most strategies are based on `` internal '' or `` external methods '' ( <CITED HERE> ) , i.e. methods that rely on the form of terms or on the information gathered from contexts . CompareOrContrast train
With the exception of ( Fung , 1995b ) , previous methods for automatically constructing statistical translation models begin by looking at word cooccurrence frequencies in bitexts ( Gale & Church , 1991 ; Kumano & Hirakawa , 1994 ; Fung , 1995a ; <CITED HERE> ) . Background train
A substring in the sentence that corresponds to a node in the representation tree is denoted by assigning the interval of the substring to SNODE of 2 These definitions are based on the discussion in ( <CITED HERE> ) and Boitet & Zaharin ( 1988 ) . Uses train
The Web People Search task , as defined in the first WePS evaluation campaign ( <CITED HERE> ) , consists of grouping search results for a given name according to the different people that share it . Background train
Although this is only true in cases where y occurs in an upward monotone context ( <CITED HERE> ) , in practice genuine contradictions between y-values sharing a meronym relationship are extremely rare . Motivation train
We use the same data setting with <CITED HERE> , however a bit different from Xue and Palmer ( 2005 ) . Uses train
Erk ( 2007 ) compared a number of techniques for creating similar-word sets and found that both the Jaccard coefficient and <CITED HERE>a ) 's information-theoretic metric work best . Background train
We previously showed that incorporating this intuition into a Bayesian prior can help train a CCG supertagger ( <CITED HERE> ) . Extends val
4 To turn this likelihood into a certainty , one can add a test at the end of the algorithm , which adds a type-related property if none is present yet ( cfXXX , <CITED HERE> ) . Background train
For descriptions of SMT systems see for example ( Germann et al. , 2001 ; <CITED HERE> ; Tillmann and Ney , 2002 ; Vogel et al. , 2000 ; Wang and Waibel , 1997 ) . Background train
Such tools make it easy to run most current approaches to statistical markup , chunking , normalization , segmentation , alignment , and noisy-channel decoding , ' including classic models for speech recognition ( <CITED HERE> ) and machine translation ( Knight and Al-Onaizan , 1998 ) . Background train
It is analogous to the step in other translation model induction algorithms that sets all probabilities below a certain threshold to negligible values ( Brown et al. , 1990 ; Dagan et al. , 1993 ; <CITED HERE> ) . CompareOrContrast train
The ConTroll grammar development system as described in ( <CITED HERE>b ) implements the above mentioned techniques for compiling an HPSG theory into typed feature grammars . Background train
After calculating the raw score of each sentence , we use a modified version of the Adaptive Greedy Algorithm by <CITED HERE> to penalize redundant sentences in cohesive clusters . Uses train
If each word 's translation is treated as a sense tag ( <CITED HERE> ) , then `` translational '' collocations have the unique property that the collocate and the word sense are one and the same ! Uses train
The values of a vector correspond to the presence or absence of each ( lemmatized ) corpus word in the document in question ( after removing stop-words and words with very low frequency ) .4 The predictive model is a Decision Graph ( <CITED HERE> ) , which , like Snob , is based on the MML principle . Uses train
Therefore , in order to be able to incorporate long-range dependencies in our models , we chose to adopt a re-ranking approach ( <CITED HERE> ) , which selects from likely assignments generated by a model which makes stronger independence assumptions . Uses val
However , it is possible to think about constraining linguistic or logical predicates by simulating physical experiences ( cfXXX <CITED HERE> ) . Background train
One , the VOYAGER domain ( <CITED HERE> ) , answers questions about places of interest in an urban area , in our case , the vicinity of MIT and Harvard University . Uses train
Our task is closer to the work of <CITED HERE> , who looked at the problem of intellectual attribution in scientific texts . CompareOrContrast val
In the seminal work by <CITED HERE> , similarity judgments were obtained from 51 test subjects on 65 noun pairs written on paper cards . Background train
( <CITED HERE> ; Blunsom et al. , 2008 ) used maximum likelihood estimation to learn weights for MT. ( Och , 2003 ; Moore and Quirk , 2008 ; Zhao and Chen , 2009 ; Galley and Quirk , 2011 ) employed an evaluation metric as a loss function and directly optimized it . CompareOrContrast val
<CITED HERE> , p. 14 ) writes `` it would be perverse not to take as a working assumption that language is a relatively efficient and accurate encoding of the information it conveys . '' CompareOrContrast train
See also ( <CITED HERE> ; Naish , 1986 ) . Background train
Michiels ( 1982 ) and <CITED HERE> provide a more detailed analysis of the information encoded by the LDOCE grammar codes and discuss their efficacy as a system of linguistic description . Background val
We parsed the 3 GB AQUAINT corpus ( Voorhees , 2002 ) using Minipar ( <CITED HERE>b ) , and collected verb-object and verb-subject frequencies , building an empirical MI model from this data . Uses train
More specifically , we use LIBSVM ( <CITED HERE> ) with a quadratic kernel K ( xZ , xj ) = ( - yxT xj + r ) 2 and the built-in one-versus-all strategy for multi-class classification . Uses train
It is wasteful to compute ti as suggested earlier , by minimizing ( cxxi ) of o ( yixE ) , since then the real work is done by an c-closure step ( <CITED HERE> ) that implements the all-pairs version of algebraic path , whereas all we need is the single-source version . Background train
positional features that have been employed by highwe can see , the baseline achieves an F-measure of performing resolvers such as <CITED HERE> 57.0 and a resolution accuracy of 48.4 . CompareOrContrast train
Such a component would serve as the first stage of a clinical question answering system ( Demner-Fushman and Lin , 2005 ) or summarization system ( <CITED HERE> ) . Future train
Briscoe and Carroll ( 1997 ) predefine 163 verbal subcategorization frames , obtained by manually merging the classes exemplified in the COMLEX ( MacLeod , Grishman , and Meyers 1994 ) and ANLT ( <CITED HERE> ) dictionaries and adding around 30 frames found by manual inspection . Background train
The idea resurfaced forcefully at several points in the more recent history of linguistic research ( Tesni`ere , 1959 ; <CITED HERE> ; Fillmore , 1968 ) . Background train
It is therefore no surprise that early attempts at response automation were knowledge-driven ( Barr and Tessler 1995 ; Watson 1997 ; <CITED HERE> ) . Background val
For example , the suite of LT tools ( Mikheev et al. , 1999 ; <CITED HERE> ) perform tokenization , tagging and chunking on XML marked-up text directly . Background train
It is only recently that the web name ambiguity has been approached as a separate problem and defined as an NLP task Web People Search on its own ( Artiles et al. , 2005 ; <CITED HERE> ) . Uses train
To retrieve translation examples for a test sentence , ( <CITED HERE> ) defined a metric based on the combination of edit distance and TF-IDF ( Manning and Sch ¨ utze , 1999 ) as follows : Uses train
By using the EM algorithm ( <CITED HERE> ) , they can guarantee convergence towards the globally optimum parameter set . Background train
• use of low level knowledge from the speech recognition phase , • use of high level knowledge about the domain in particular and the dialogue task in general , • a `` continue '' facility and an `` auto-loop '' facility as described by <CITED HERE> , • a `` conditioning '' facility as described by Fink et al. ( 1985 ) , • implementation of new types of paraphrasing , • checking a larger environment in the expectation acquisition algorithm when deciding if an incoming sentence is the same or similar to one already seen , and • examining inter-speaker dialogue patterns . Future train
Previous work has developed various approaches for grounded semantics mainly for the reference resolution task , i.e. , identifying visual objects in the environment given language descriptions ( Dhande , 2003 ; Gorniak and Roy , 2004 ; Tenbrink and Moratz , 2003 ; Siebert and Schlangen , 2008 ; <CITED HERE> ) . Extends val
In most recent research , NEs ( person , location and organisations ) are extracted from the text and used as a source of evidence to calculate the similarity between documents - see for instance ( Blume , 2005 ; Chen and Martin , 2007 ; <CITED HERE> ; Kalashnikov et al. , 2007 ) . Background train
To provide the required configurability in the static version of the code we will use policy templates ( <CITED HERE> ) , and for the dynamic version we will use configuration classes . Uses train
<CITED HERE> demonstrates a technique for segmenting Arabic text and uses it as a morphological processing step in machine translation . Uses train
In informal experiments described elsewhere ( <CITED HERE> ) , I found that the G2 statistic suggested by Dunning ( 1993 ) slightly outperforms 02 . Extends train
• language learning ( Green 1979 ; Mori and Moeser 1983 ; Morgan , Meier , and Newport 1989 ) • monolingual grammar induction ( Juola 1998 ) • grammar optimization ( <CITED HERE> ) • insights into universal grammar ( Juola 1998 ) • machine translation ( Juola 1994 , 1997 ; Veale and Way 1997 ; Gough , Way , and Hearne 2002 ) Background train
A recent study by <CITED HERE> also investigates the task of training parsers to improve MT reordering . CompareOrContrast train
Due to their remarkable ability to incorporate context structure information and long distance reordering into the translation process , tree-based translation models have shown promising progress in improving translation quality ( Liu et al. , 2006 , 2009 ; Quirk et al. , 2005 ; <CITED HERE> , 2006 ; Marcu et al. , 2006 ; Shen et al. , 2008 ; Zhang et al. , 2011b ) . Background train
15 <CITED HERE> show that the question of whether the application criterion of lexical rules should be a subsumption or a unification test is an important question deserving of more attention . Background val
Latent Dirichlet Allocation ( <CITED HERE> ) , or LDA , is an unsupervised Bayesian probabilistic model of text documents . Background train
Regarding future work , there are many research line that may be followed : i ) Capturing more features by employing external knowledge such as ontological , lexical resource or WordNet-based features ( Basili et al. , 2005a ; Basili et al. , 2005b ; Bloehdorn et al. , 2006 ; Bloehdorn and Moschitti , 2007 ) or shallow semantic trees , ( Giuglea and Moschitti , 2004 ; <CITED HERE> ; Moschitti and Bejan , 2004 ; Moschitti et al. , 2007 ; Moschitti , 2008 ; Moschitti et al. , 2008 ) . Future train
Third , the paradigm of evidence-based medicine ( <CITED HERE> ) provides a task-based model of the clinical information-seeking process . Background train
<CITED HERE> introduced factored SMT . CompareOrContrast train
Both systems are built around from the maximum-entropy technique ( <CITED HERE> ) . Uses train
We chose the adjectives as follows : we first compiled a list of all the polysemous adjectives mentioned in the lexical semantics literature ( Vendler , 1968 ; <CITED HERE> ) . Uses train
Most web-derived corpora have exploited raw text or HTML pages , so efforts have focussed on boilerplate removal and cleanup of these formats with tools like Hyppia-BTE , Tidy and Parcels3 ( <CITED HERE> ) . Background train
Undesirable consequences of this fact have been termed `` label bias '' ( <CITED HERE> ) . Background train
For the full parser , we use the one developed by Michael Collins ( <CITED HERE> ; Collins , 1997 ) -- one of the most accurate full parsers around . Uses train
Some researchers , however , including <CITED HERE> , train on predicted feature values instead . CompareOrContrast train
Their computational significance arises from the issue of their storage in lexical resources like WordNet ( <CITED HERE> ) and raises the questions like , how to store morphologically complex words , in a lexical resource like WordNet keeping in mind the storage and access efficiency . Background train
Fortunately , there exists a compact PCFG-reduction of DOP1 that generates the same trees with the same probabilities , as shown by <CITED HERE> , 2002 ) . Background train
<CITED HERE> claims that prosodic phrase boundaries will co-occur with grammatical functions such as subject , predicate , modifier , and adjunct . Background train
We rephrase the method of <CITED HERE> as follows : First , we construct the approximating finite automaton according to the unparameterized RTN method above . Uses train
6The analysis is reminiscent of the treatment of coordination in the Collins parser ( <CITED HERE> ) . CompareOrContrast val
The work is carried out in order to assist terminographers in the enrichment of a dictionary on computing that includes collocational information ( <CITED HERE> ) . Motivation train
CornmandTalk ( <CITED HERE> ) , Circuit Fix-It Shop ( Smith , 1997 ) and TRAINS-96 ( Traum and Allen , 1994 ; Traum and Andersen , 1999 ) are spoken language systems but they interface to simulation or help facilities rather than semi-autonomous agents . Background train
The annotation procedure is dependent on locating the head daughter , for which an amended version of <CITED HERE> is used . Uses val
<CITED HERE> ; Oliva 1994 ; Frank 1994 ; Opalka 1995 ; Sanfilippo 1995 ) . CompareOrContrast train
For instance , <CITED HERE> recently wrote : `` To our knowledge , learning algorithms , although promising , have not ( yet ) reached the level of rule sets developed by humans '' ( p. 520 ) . Background train
Much previous work looks at the impact of using source side information ( i.e. , feature functions on the aligned English ) , such as those of Avramidis and Koehn ( 2008 ) , <CITED HERE> and others . CompareOrContrast train
We use an in-house statistical tagger ( based on ( <CITED HERE> ) ) to tag the text in which the unknown word occurs . Uses train
Based on a computational grammar that associates natural language expressions with both a syntactic and a semantic representation , a paraphrastic gram ` As we shall briefly discuss in section 4 , the grammar is developed with the help of a meta-grammar ( <CITED HERE> ) thus ensuring an additional level of abstraction . Uses train
There are very few reported attempts at corpus-based automation of help-desk responses ( Carmel , Shtalhaim , and Soffer 2000 ; <CITED HERE> ; Bickel and Scheffer 2004 ; Malik , Subramaniam , and Kaushik 2007 ) . CompareOrContrast train
Both tasks are performed with a statistical framework : the mention detection system is similar to the one presented in ( <CITED HERE> ) and the coreference resolution system is similar to the one described in ( Luo et al. , 2004 ) . CompareOrContrast train
The disambiguation of person names in Web results is usually compared to two other Natural Language Processing tasks : Word Sense Disambiguation ( WSD ) ( Agirre and Edmonds , 2006 ) and Cross-document Coreference ( CDC ) ( <CITED HERE> ) . Background train
Our own work ( <CITED HERE> ) extends the first idea to paraphrase fragment extraction on monolingual parallel and comparable corpora . Extends train
They proved to be useful in a number of NLP applications such as natural language generation ( <CITED HERE> ) , multidocument summarization ( McKeown et al. , 2002 ) , automatic evaluation of MT ( Denkowski and Lavie , 2010 ) , and TE ( Dinu and Wang , 2009 ) . Motivation train
Current state-of-the-art statistical parsers ( <CITED HERE> ; Charniak 2000 ) are all trained on large annotated corpora such as the Penn Treebank ( Marcus , Santorini , and Marcinkiewicz 1993 ) . Background train
Although the parser only derives projective graphs , the fact that graphs are labeled allows non-projective dependencies to be captured using the pseudoprojective approach of <CITED HERE> . Background train
Riehemann 1993 ; Oliva 1994 ; <CITED HERE> ; Opalka 1995 ; Sanfilippo 1995 ) . CompareOrContrast train
The idea resurfaced forcefully at several points in the more recent history of linguistic research ( Tesni`ere , 1959 ; Gruber , 1965 ; <CITED HERE> ) . Background train
A detailed introduction to the SBD problem can be found in <CITED HERE> . Background train
Task properties Determining whether or not a speaker supports a proposal falls within the realm of sentiment analysis , an extremely active research area devoted to the computational treatment of subjective or opinion-oriented language ( early work includes Wiebe and Rapaport ( 1988 ) , Hearst ( 1992 ) , Sack ( 1994 ) , and <CITED HERE> ; see Esuli ( 2006 ) for an active bibliography ) . Background train
The BEETLE II system architecture is designed to overcome these limitations ( <CITED HERE> ) . Background train
For the evaluation of the results we use the BLEU score ( <CITED HERE> ) . Uses train
There are very few reported attempts at corpus-based automation of help-desk responses ( Carmel , Shtalhaim , and Soffer 2000 ; Lapalme and Kosseim 2003 ; <CITED HERE> ; Malik , Subramaniam , and Kaushik 2007 ) . CompareOrContrast train
For an introduction to maximum entropy modeling and training procedures , the reader is referred to the corresponding literature , for instance ( <CITED HERE> ) or ( Ratnaparkhi , 1997 ) . Background train
This design idea was adopted from TANKA ( <CITED HERE>b ) . Uses train
The problem of handling ill-formed input has been studied by <CITED HERE> , Granger ( 1983 ) , Jensen et al. ( 1983 ) , Kwasny and Sondheimer ( 1981 ) , Riesbeck and Schank ( 1976 ) , Thompson ( 1980 ) , Weischedel and Black ( 1980 ) , and Weischedel and Sondheimer ( 1983 ) . CompareOrContrast train
In addition to its explanatory capacity , this symbolic acquisition technique has obtained good results for other acquisition tasks when compared to existing statistical techniques ( <CITED HERE> ) . Motivation val
The context of a current token ti is clearly one of the most important features in predicting whether ti is a mention or not ( <CITED HERE> ) . Background train
Features using the word context ( left and right tokens ) have been shown to be very helpful in coreference resolution ( <CITED HERE> ) . Uses train
Here , the PET and GR kernel perform similar : this is different from the results of ( <CITED HERE> ) where GR performed much worse than PET for ACE data . CompareOrContrast val
In modern syntactic theories ( e.g. , lexical-functional grammar [ LFG ] [ Kaplan and Bresnan 1982 ; Bresnan 2001 ; <CITED HERE> ] , head-driven phrase structure grammar [ HPSG ] [ Pollard and Sag 1994 ] , tree-adjoining grammar [ TAG ] [ Joshi 1988 ] , and combinatory categorial grammar [ CCG ] [ Ades and Steedman 1982 ] ) , the lexicon is the central repository for much morphological , syntactic , and semantic information . Background train
Some works abstract perception via the usage of symbolic logic representations ( Chen et al. , 2010 ; Chen and Mooney , 2011 ; <CITED HERE> ; Artzi and Zettlemoyer , 2013 ) , while others choose to employ concepts elicited from psycholinguistic and cognition studies . Background val
This is where robust syntactic systems like SATZ ( Palmer and Hearst 1997 ) or the POS tagger reported in <CITED HERE> , which do not heavily rely on word capitalization and are not sensitive to document length , have an advantage . CompareOrContrast train
Our work is inspired by the latent left-linking model in <CITED HERE> and the ILP formulation from Chang et al. ( 2011 ) . Background val
For example , <CITED HERE> proves that Chinese numerals such as wu zhao zhao zhao zhao zhao wu zhao zhao zhao zhao wu zhao zhao zhao wu zhao zhao wu zhao , for the number 5000000000000000005000000000000005000000000005000000005000 , are not context-free , which implies that Chinese is not a context-free language and thus might parse in exponential worst-case time . Background train
<CITED HERE> also note that the applicability of paraphrases is strongly influenced by context . Background train
Promising features for classification include part of speech , frequency of co-occurrence , relative word position , and translational entropy ( <CITED HERE> ) . Future train
For descriptions of SMT systems see for example ( <CITED HERE> ; Och et al. , 1999 ; Tillmann and Ney , 2002 ; Vogel et al. , 2000 ; Wang and Waibel , 1997 ) . Background train
like information extraction ( <CITED HERE> ) and textual entailment ( Berant et al. , 2010 ) . Background train
In a final processing stage , we generalize over the marker lexicon following a process found in <CITED HERE> . Uses val
One approach to this problem consists in defining , within the Cut-free atomic-id space , normal form derivations in which the succession of rule application is regulated ( <CITED HERE> , Hepple 1990 , Hendriks 1993 ) . Background train
The SPR uses rules automatically learned from training data , using techniques similar to ( <CITED HERE> ; Freund et al. , 1998 ) . CompareOrContrast train
<CITED HERE> have previously examined the task of categorizing sentences in medical abstracts using supervised discriminative machine learning techniques . CompareOrContrast train
For complementing this database and for converse constructions , the LADL tables ( <CITED HERE> ) can furthermore be resorted to , which list detailed syntactico-semantic descriptions for 5 000 verbs and 25 000 verbal expressions . Future train
These operations are not domain-specific and are similar to those of previous aggregation components ( Rambow and Korelsky ,1992 ; <CITED HERE> ; Danlos , 2000 ) , although the various MERGE operations are , to our knowledge , novel in this form . Background train
Following construction of the marker lexicon , the ( source , target ) chunks are generalized further using a methodology based on <CITED HERE> to permit a limited form of insertion in the translation process . Uses train
ment ( Sarkar and Wintner , 1999 ; <CITED HERE> ; Makino et al. , 1998 ) . Background train
The ability to explicitly identify these sections in unstructured text could play an important role in applications such as document summarization ( Teufel and Moens , 2000 ) , information retrieval ( Tbahriti et al. , 2005 ) , information extraction ( <CITED HERE> ) , and question answering . Background train
Recent developments in linguistics , and especially on grammatical theory -- for example , Generalised Phrase Structure Grammar ( GPSG ) ( Gazdar et al. , 1985 ) , Lexical Functional Grammar ( LFG ) ( Kaplan and Bresnan , 1982 ) -- and on natural language parsing frameworks -- for example , Functional Unification Grammar ( FUG ) ( Kay , 1984a ) , PATR-II ( <CITED HERE> ) -- make it feasible to consider the implementation of efficient systems for the syntactic analysis of substantial fragments of natural language . Background train
The latter question is tackled by applicationspecific evaluation , where a measure is tested within the framework of a certain application , e.g. word sense disambiguation ( <CITED HERE> ) or malapropism detection ( Budanitsky and Hirst , 2006 ) . Background train
<CITED HERE> argue for application-specific evaluation of similarity measures , because measures are always used for some task . Background train
The first direct application of parse forest in translation is our previous work ( <CITED HERE> ) which translates a packed forest from a parser ; it is also the base system in our experiments ( see below ) . Extends train
4 This interpretation of the signature is sometimes referred to as closed world ( Gerdemann and <CITED HERE> ; Gerdemann 1995 ) . Background train
In their Gaijin system , <CITED HERE> give a result of 63 % accurate translations obtained for English − > German on a test set of 791 sentences from CorelDRAW manuals . CompareOrContrast train
This method can be generalized , inspired by <CITED HERE> , who derive N-gram probabilities from stochastic context-free grammars . Background train
The system was trained on the Penn Treebank ( Marcus et al. , 1993 ) WSJ Sections 221 and tested on Section 23 ( Table 1 ) , same as used by Magerman ( 1995 ) , <CITED HERE> , and Ratnaparkhi ( 1997 ) , and became a common testbed . CompareOrContrast val
The Nash arbitration plan , for example , would allow a doubly graded description whenever the product of the Values for the referent r exceeds that of all distractors ( Nash 1950 ; cfXXX <CITED HERE> ; Thorisson 1994 , for other plans ) . Background train
The paraphrase dictionary that we use was generated for us by Chris Callison-Burch , using the technique described in <CITED HERE> , which exploits a parallel corpus and methods developed for statistical machine translation . Uses train
To name a few examples , <CITED HERE> and Socher et al. ( 2013 ) show how semantic information from text can be used to improve zero-shot classification ( i.e. , classifying never-before-seen objects ) , and Motwani and Mooney ( 2012 ) show that verb clusters can be used to improve activity recognition in videos . Background val
This approach has now gained wide usage , as exemplified by the work of <CITED HERE> , 1999 ) , Charniak ( 1996 , 1997 ) , Johnson ( 1998 ) , Chiang ( 2000 ) , and many others . Motivation train
For example , such schema can serve as a mean to represent translation examples , or find structural correspondences for the purpose of transfer grammar learning ( Menezes & Richardson , 2001 ) , ( Aramaki et al. , 2001 ) , ( Watanabe et al. , 2000 ) , ( Meyers et al. , 2000 ) , ( Matsumoto et al. , 1993 ) , ( kaji et al. , 1992 ) , and example-base machine translation EBMT3 ( Sato & Nagao , 1990 ) , ( Sato , 1991 ) , ( <CITED HERE> ) , ( Al-Adhaileh & Tang , 1999 ) . Background train
Increasingly , corpus researchers are tapping the Web to overcome the sparse data problem ( <CITED HERE> ) . Background val
As such it resembles the parser of the grammar development system Attribute Language Engine ( ALE ) of ( <CITED HERE> ) . CompareOrContrast train
de URL : http://www.sfs.nphil.uni-tuebingen.de/sfb / b4home.html 1 This is , for example , the case for all proposals working with verbal lexical entries that raise the arguments of a verbal complement ( Hinrichs and Nakazawa 1989 ) that also use lexical rules such as the Complement Extraction Lexical Rule ( Pollard and Sag 1994 ) or the Complement Cliticization Lexical Rule ( <CITED HERE> ) to operate on those raised elements . Background train
Research that is more similar in goal to that outlined in this paper is Vosse ( <CITED HERE> ) . CompareOrContrast train
<CITED HERE> pointed out that little attention had been paid in the named-entity recognition field to the discourse properties of proper names . Uses train
Table 5 shows our mapping from publication type and MeSH headings to evidence grades based on principles defined in the Strength of Recommendations Taxonomy ( <CITED HERE> ) . Uses train
While we have observed reasonable results with both G2 and Fisher 's exact test , we have not yet discussed how these results compare to the results that can be obtained with a technique commonly used in corpus linguistics based on the mutual information ( MI ) measure ( <CITED HERE> ) : Uses val
Following <CITED HERE> , we consider an anaphoric reference , NPi , correctly resolved if NPi and its closest antecedent are in the same coreference chain in the resulting partition . Uses train
From an IR view , a lot of specialized research has already been carried out for medical applications , with emphasis on the lexico-semantic aspects of dederivation and decomposition ( <CITED HERE> ; Norton and Pacak , 1983 ; Wolff , 1984 ; Wingert , 1985 ; Dujols et al. , 1991 ; Baud et al. , 1998 ) . Background train
The Gsearch system ( <CITED HERE> ) also selects sentences by syntactic criteria from large on-line text collections . Background train
Thus , over the past few years , along with advances in the use of learning and statistical methods for acquisition of full parsers ( Collins , 1997 ; Charniak , 1997a ; Charniak , 1997b ; Ratnaparkhi , 1997 ) , significant progress has been made on the use of statistical learning methods to recognize shallow parsing patterns syntactic phrases or words that participate in a syntactic relationship ( Church , 1988 ; Ramshaw and Marcus , 1995 ; Argamon et al. , 1998 ; <CITED HERE> ; Munoz et al. , 1999 ; Punyakanok and Roth , 2001 ; Buchholz et al. , 1999 ; Tjong Kim Sang and Buchholz , 2000 ) . Background train
See also the work of <CITED HERE> , which considers computer-based pronunciation by analogy but does not mention the possible application to text-to-speech synthesis . Background train
The resulting list of POS-tagged lemmas is weighted using the SMART ` ltc ' 8 tf.idf-weighting scheme ( <CITED HERE> ) . Uses train
The psycholinguistic studies of Martin ( 1970 ) , <CITED HERE> , Hillinger et al. ( 1976 ) , Grosjean et al. ( 1979 ) , Dommergues and Grosjean ( 1983 ) , and Gee and Grosjean ( 1983 ) , responding to the idea of readjusted syntax as the source of prosodic phrasing , show that grammatical structure , even if readjusted , is not in itself a reliable predictor of prosodic phrasing : mismatches between syntax and prosody occur often and systematically , and can be related to specific nonsyntactic factors such as length and word frequency . Background train
Some methods are based on likelihood ( Och and Ney , 2002 ; Blunsom et al. , 2008 ) , error rate ( <CITED HERE> ; Zhao and Chen , 2009 ; Pauls et al. , 2009 ; Galley and Quirk , 2011 ) , margin ( Watanabe et al. , 2007 ; Chiang et al. , 2008 ) and ranking ( Hopkins and May , 2011 ) , and among which minimum error rate training ( MERT ) ( Och , 2003 ) is the most popular one . Background val
Numerous previous pseudodisambiguation evaluations only include arguments that occur between 30 and 3000 times ( Erk , 2007 ; <CITED HERE> ; Rooth et al. , 1999 ) . CompareOrContrast train
• cross-language information retrieval ( e.g. , McCarley 1999 ) , • multilingual document filtering ( e.g. , Oard 1997 ) , • computer-assisted language learning ( e.g. , <CITED HERE> ) , • certain machine-assisted translation tools ( e.g. , Macklovitch 1994 ; Melamed 1996a ) , • concordancing for bilingual lexicography ( e.g. , Catizone , Russell , and Warwick 1989 ; Gale and Church 1991 ) , Background train
One would think that the type information ti , which is more specific than that 16 A linguistic example based on the signature given by <CITED HERE> would be a lexical rule deriving predicative signs from nonpredicative ones , i.e. , changing the PRD value of substantive signs from -- to - F , much like the lexical rule for NPs given by Pollard and Sag ( 1994 , p. 360 , fn . Background train
• language learning ( Green 1979 ; <CITED HERE> ; Morgan , Meier , and Newport 1989 ) • monolingual grammar induction ( Juola 1998 ) • grammar optimization ( Juola 1994 ) • insights into universal grammar ( Juola 1998 ) • machine translation ( Juola 1994 , 1997 ; Veale and Way 1997 ; Gough , Way , and Hearne 2002 ) Background train
Notable early papers on graph-based semisupervised learning include <CITED HERE> , Bansal et al. ( 2002 ) , Kondor and Lafferty ( 2002 ) , and Joachims ( 2003 ) . Background train
As a result , researchers have re-adopted the once-popular knowledge-rich approach , investigating a variety of semantic knowledge sources for common noun resolution , such as the semantic relations between two NPs ( e.g. , Ji et al. ( 2005 ) ) , their semantic similarity as computed using WordNet ( e.g. , Poesio et al. ( 2004 ) ) or Wikipedia ( Ponzetto and Strube , 2006 ) , and the contextual role played by an NP ( see <CITED HERE> ) . Background train
An example of psycholinguistically oriented research work can be found in <CITED HERE> . Background val
Alternatively , we may think of user-centered comparative studies ( <CITED HERE> ) . Future train
<CITED HERE> , p. 112 ) , for example , bemoans the fact that his `` theory lacks a world knowledge component , a mental ` encyclopedia , ' which could be invoked to generate inferences ... '' . Background train
However , more recent work ( Cahill et al. 2002 ; Cahill , McCarthy , et al. 2004 ) has presented efforts in evolving and scaling up annotation techniques to the Penn-II Treebank ( <CITED HERE> ) , containing more than 1,000,000 words and 49,000 sentences . Background train
The current system learns finite state flowcharts whereas typical learning systems usually acquire coefficient values as in <CITED HERE> , assertional statements as in Michalski ( 1980 ) , or semantic nets as in Winston ( 1975 ) . CompareOrContrast train
It also shows the structural identity to bilingual grammars as used in ( <CITED HERE> ) . Uses val
Relationships between the unlabeled items <CITED HERE> consider sequential relations between different types of emails ( e.g. , between requests and satisfactions thereof ) to classify messages , and thus also explicitly exploit the structure of conversations . Background train
100000 word stems of German ( <CITED HERE> ) . Uses train
The computational treatment of lexical rules proposed can be seen as an extension to the principled method discussed by Gotz and <CITED HERE> , 1996 , 1997b ) for encoding the main building block of HPSG grammars -- the implicative constraints -- as a logic program . Extends train
Identical to the standard perceptron proof , e.g. , <CITED HERE> , by inserting in loss-separability for normal separability . Background train
The goal of the JAVOX toolkit is to speech-enable traditional desktop applications -- this is similar to the goals of the MELISSA project ( <CITED HERE> ) . CompareOrContrast train
The simplest strategy for ordering adjectives is what <CITED HERE> call the direct evidence method . Background train
The numeral ( whether it is implicit , as in ( 3 ) , or explicit ) can be construed as allowing the reader to draw inferences about the standards employed ( Kyburg and Morreau 2000 ; <CITED HERE> ) : ( 3 ) , for example , implies a standard that counts 10 cm as large and 8 cm as not large . Background train
Such systems extract information from some types of syntactic units ( clauses in ( <CITED HERE> ; Gildea and Jurafsky , 2002 ; Hull and Gomez , 1996 ) ; noun phrases in ( Hull and Gomez , 1996 ; Rosario et al. , 2002 ) ) . Background val
`` petty conversational implicature '' ( <CITED HERE> ) , or the metarules of Section 5.2 ? Background train
Unlike other POS taggers , this POS tagger ( <CITED HERE> ) was also trained to disambiguate sentence boundaries . Uses train
On the other hand , experiments indicate that mental representation and processing of morphologically complex words are not quite language independent ( <CITED HERE> ) . Background val
ASARES has been previously applied to the acquisition of word pairs sharing semantic relations defined in the Generative Lexicon framework ( <CITED HERE> ) and called qualia relations ( Bouillon et al. , 2001 ) . Background train
People are much more likely to consult such evaluative statements than the actual text of a bill or law under discussion , given the dense nature of legislative language and the fact that ( U.S. ) bills often reach several hundred pages in length ( <CITED HERE> ) . Background train
For english , there is for instance the 15 year old HewlettPackard test suite , a simple text file listing test sentences and grouping them according to linguistics phenomena ( Flickinger et al. , 1987 ) ; and more recently , the much more sophisticated TSNLP ( Test Suite for Natural Language Processing ) which includes some 9500 test items for English , French and German , each of them being annotated with syntactic and application related information ( <CITED HERE> ) . Background train
Its significance is reflected both in the growing interest in annotation software for word sense tagging ( <CITED HERE> ) and in the long-standing use of part-of-speech taggers , parsers and morphological analysers for data from English and many other languages . Background train
Each of these Values has equal status , so the notion of a basic-level Value can not play a role ( cfXXX , <CITED HERE> ) . Background train
Previous work on Chinese SRL mainly focused on how to transplant the machine learning methods which has been successful with English , such as Sun and Jurafsky ( 2004 ) , Xue and Palmer ( 2005 ) and <CITED HERE> . Background val
Accuracy is not the best measure to assess segmentation quality , therefore we also conducted experiments using the WindowDiff measure as proposed by <CITED HERE> . Uses train
The system is in the form of an agenda-driven chart-based parser whose foundation is similar to previous formalizations of Chomsky 's Minimalist Program ( Stabler , 1997 ; Harkema , 2000 ; <CITED HERE> ) . CompareOrContrast train
Optimizing for dependency arc length is particularly important as parsers tend to do worse on longer dependencies ( <CITED HERE> ) and these dependencies are typically the most meaningful for downstream tasks , e.g. , main verb dependencies for tasks Motivation train
Whereas <CITED HERE> dealt only with an English corpus , the current work shows that this methodology is applicable to a wide range of languages and corpora . CompareOrContrast train
We also experiment with a CCG parser ( <CITED HERE> ) , requiring that the contexts surrounding the original phrase and paraphrase are assigned Uses train
Their kernel is also very time consuming and in their more general sparse setting it requires O ( mn3 ) time and O ( mn2 ) space , where m and n are the number of nodes of the two trees ( m > = n ) ( <CITED HERE> ) . Future train
Other molecular biology databases We also included several model organism databases or nomenclature databases in the construction of the dictionary , i.e. , mouse Mouse Genome Database ( MGD ) [ 18 ] , fly FlyBase [ 19 ] , yeast Saccharomyces Genome Database ( SGD ) [ 20 ] , rat -- Rat Genome Database ( RGD ) [ 21 ] , worm -- WormBase [ 22 ] , Human Nomenclature Database ( HUGO ) [ 23 ] , Online Mendelian Inheritance in Man ( OMIM ) [ 24 ] , and Enzyme Nomenclature Database ( ECNUM ) <CITED HERE> . Uses train
ones , DIRT ( Lin and Pantel , 2001 ) , VerbOcean ( <CITED HERE> ) , FrameNet ( Baker et al. , 1998 ) , and Wikipedia ( Mehdad et al. , 2010 ; Kouylekov et al. , 2009 ) . Background train
include decision tree learning and Bayesian learning , nearest neighbor learning , and artificial neural networks , early such works may be found in ( <CITED HERE> ) , ( Creecy and Masand , 1992 ) and ( Wiene and Pedersen , 1995 ) , respectively . Background val
In other words AJAX is a web development technique for creating interactive web applications using a combination of XHTML and CSS , Document Object Model ( or DOM ) , the XMLHTTPRequest object ( <CITED HERE> ) . Background train
For these or for a specific domain , basic synonymic dictionaries can be complemented using learning methods based on distributional similarity ( <CITED HERE> ; Lin , 1998 ) . Future train
converted to numerical features using the standard technique of binarization , and we split values of the FEATS field into its atomic components .4 For some languages , we divide the training data into smaller sets , based on some feature s ( normally the CPOS or POS of the next input token ) , which may reduce training times without a significant loss in accuracy ( <CITED HERE> ) . Background val
• language learning ( Green 1979 ; Mori and Moeser 1983 ; Morgan , Meier , and Newport 1989 ) • monolingual grammar induction ( <CITED HERE> ) • grammar optimization ( Juola 1994 ) • insights into universal grammar ( Juola 1998 ) • machine translation ( Juola 1994 , 1997 ; Veale and Way 1997 ; Gough , Way , and Hearne 2002 ) Background train
Linguistic preprocessing of text documents is carried out by re-using smes , an information extraction core system for real-world German text processing ( <CITED HERE> ) . Uses val
Regarding future work , there are many research line that may be followed : i ) Capturing more features by employing external knowledge such as ontological , lexical resource or WordNet-based features ( Basili et al. , 2005a ; Basili et al. , 2005b ; Bloehdorn et al. , 2006 ; Bloehdorn and Moschitti , 2007 ) or shallow semantic trees , ( Giuglea and Moschitti , 2004 ; Giuglea and Moschitti , 2006 ; Moschitti and Bejan , 2004 ; Moschitti et al. , 2007 ; Moschitti , 2008 ; <CITED HERE> ) . Future train
For English − > Urdu , <CITED HERE> , page 213 ) notes that `` the system learned the original training corpus ... perfectly and could reproduce it without errors '' ; that is , it scored 100 % accuracy when tested against the training corpus . Background train
Much previous work looks at the impact of using source side information ( i.e. , feature functions on the aligned English ) , such as those of <CITED HERE> , Yeniterzi and Oflazer ( 2010 ) and others . CompareOrContrast val
Recent work ( <CITED HERE> ; Curran and Moens , 2002 ) has suggested that some tasks will benefit from using significantly more data . Background train
See <CITED HERE> for a variant of this approximation that constructs finite transducers rather than finite automata . Background train
Due to their remarkable ability to incorporate context structure information and long distance reordering into the translation process , tree-based translation models have shown promising progress in improving translation quality ( Liu et al. , 2006 , 2009 ; <CITED HERE> ; Galley et al. , 2004 , 2006 ; Marcu et al. , 2006 ; Shen et al. , 2008 ; Zhang et al. , 2011b ) . Background val
These include devices such as interleaving the components ( McDonald 1983 ; Appelt 1983 ) , backtracking on failure ( Appelt 1985 ; Nogier 1989 ) , allowing the linguistic component to interrogate the planner ( Mann 1983 ; Sondheimer and Nebel 1986 ) , and Hovy 's notion of restrictive ( i.e. , bottom-up ) planning ( <CITED HERE>a , 1988c ) . Background train
The shallow parser used is the SNoW-based CSCL parser ( Punyakanok and Roth , 2001 ; <CITED HERE> ) . Uses val
Other approaches use less deep linguistic resources ( e.g. , POS-tags <CITED HERE> ) or are ( almost ) knowledge-free ( e.g. , Koehn and Knight ( 2003 ) ) . CompareOrContrast train
Some well-known approaches include rule-based models ( <CITED HERE> ) , backed-off models ( Collins and Brooks 1995 ) , and a maximumentropy model ( Ratnaparkhi 1998 ) . Background train
One approach to partial parsing was presented by <CITED HERE> , who extended a shallow-parsing technique to partial parsing . Background train
<CITED HERE> compared a predictive approach ( statistical translation ) , a retrieval approach based on a language-model , and a hybrid approach which combines statistical chunking and traditional retrieval . CompareOrContrast train
Indeed , such rich semantic links can be used to extend indices or reformulate queries ( similar to the work by <CITED HERE> with WoRDNET relations ) . CompareOrContrast train
In modern syntactic theories ( e.g. , lexical-functional grammar [ LFG ] [ Kaplan and Bresnan 1982 ; Bresnan 2001 ; Dalrymple 2001 ] , head-driven phrase structure grammar [ HPSG ] [ <CITED HERE> ] , tree-adjoining grammar [ TAG ] [ Joshi 1988 ] , and combinatory categorial grammar [ CCG ] [ Ades and Steedman 1982 ] ) , the lexicon is the central repository for much morphological , syntactic , and semantic information . Background train
Despite these arguments , most applied NLG systems use a pipelined architecture ; indeed , a pipeline was used in every one of the systems surveyed by <CITED HERE> and Paiva ( 1998 ) . Background val
Other similar approaches include those of Cicekli and G ¨ uvenir ( 1996 ) , McTait and Trujillo ( 1999 ) , <CITED HERE> , and Brown ( 2000 ) , inter alia . Background train
The EDR has close ties to the named entity recognition ( NER ) and coreference resolution tasks , which have been the focus of several recent investigations ( Bikel et al. , 1997 ; Miller et al. , 1998 ; Borthwick , 1999 ; Mikheev et al. , 1999 ; <CITED HERE> ; Ng and Cardie , 2002 ; Florian et al. , 2004 ) , and have been at the center of evaluations such as : MUC-6 , MUC-7 , and the CoNLL '02 and CoNLL '03 shared tasks . Background train
In addition to headwords , dictionary search through the pronunciation field is available ; Carter ( 1987 ) has merged information from the pronunciation and hyphenation fields , creating an enhanced phonological representation which allows access to entries by broad phonetic class and syllable structure ( <CITED HERE> ) . Background train
Following Hockenmaier , Bierner , and Baldridge ( 2002 ) , <CITED HERE> , and Miyao , Ninomiya , and Tsujii ( 2004 ) , we extract a reference lexicon from Sections 02 -- 21 of the WSJ . Uses train
The only disambiguation metric that we used in our previous work ( <CITED HERE>b ) was the shape-based metric , according to which the `` best '' trees are those that are skewed to the right . Extends train
The need for information systems to support physicians at the point of care has been well studied ( <CITED HERE> ; Gorman et al. , 1994 ; Ely et al. , 2005 ) . Background val
Every arc always has a definite direction , i.e. arcs are arrows ( <CITED HERE> ) . Background train
In the first experiment , we use an induction algorithm ( <CITED HERE>a ) based on the expectation-maximization ( EM ) principle that induces parsers for PLTIGs . Uses train
To create the baseline system , we use the opensource Joshua 4.0 system ( Ganitkevitch et al. , 2012 ) to build a hierarchical phrase-based ( HPB ) system , and a syntax-augmented MT ( SAMT ) 11 system ( <CITED HERE> ) respectively . Uses val
6 For Sent-Pred we also experimented with grammatical and sentence-based syntactic features , such as number of syntactic phrases , grammatical mood , and grammatical person ( <CITED HERE> ) , but the simple binary bag-of-lemmas representation yielded similar results . Uses val
At present , the system uses a heuristic matching algorithm to classify relations into the appropriate category , though in the future we may consider a classifier similar to <CITED HERE> . Future train
It is interesting to compare this analysis with that described in Dalrymple , Shieber , and Pereira ( 1991 ) and <CITED HERE> , 1991 ) . CompareOrContrast train
The importance of including nonheadwords has become uncontroversial ( e.g. Collins 1999 ; <CITED HERE> ; Goodman 1998 ) . Background train
Some methods of semantic relation analysis rely on predefined templates filled with information from processed texts ( <CITED HERE> ) . Background train
The example used to illustrate the power of ATNs ( <CITED HERE> ) , `` John was believed to have been shot , '' also parses correctly , because the [ object ] node following the verb `` believed '' acts as both an absorber and a ( re ) generator . CompareOrContrast train
Problems such as these have motivated research on more abstract , dependencybased parser evaluation ( e.g. , Lin 1995 ; Carroll , Briscoe , and Sanfilippo 1998 ; Carroll et al. 2002 ; Clark and Hockenmaier 2002 ; King et al. 2003 ; <CITED HERE> ; Kaplan et al. 2004 ; Miyao and Tsujii 2004 ) . Motivation train
LiLFeS is one of the fastest inference engines for processing feature structure logic , and efficient HPSG parsers have already been built on this system ( Nishida et al. , 1999 ; <CITED HERE> ) . Background train
As in ( <CITED HERE> ) , we used unsupervised training data which is automatically segmented to discover previously unseen stems . Uses train
To address this limitation , our previous work ( <CITED HERE> ) has initiated an investigation on the problem of conversation entailment . Extends train
∗ A brief version of this work , with some additional material , first appeared as ( <CITED HERE>a ) . Extends train
They also proposed two major categories of meta-learning approaches for recommender systems , merging and ensemble , each subdivided into the more specific subclasses suggested by <CITED HERE> as follows . Background train
Here , I adopt the model proposed by <CITED HERE> and decompose lexical verbs into verbalizing heads and verbal roots . Uses val
This heuristic is called soft union ( <CITED HERE> ) . Uses train
This approach has its roots in Fillmore 's Case Grammar ( 1968 ) , and serves as the foundation for two current large-scale semantic annotation projects : FrameNet ( <CITED HERE> ) and PropBank ( Kingsbury et al. , 2002 ) . Background train
Many investigators ( e.g. Allen 1976 ; Elowitz et al. 1976 ; <CITED HERE> ; Cahn 1988 ) have suggested that the poor prosody of synthetic speech , in comparison with natural speech , is the primary factor leading to difficulties in the comprehension of fluent synthetic speech . Motivation train
The ability to explicitly identify these sections in unstructured text could play an important role in applications such as document summarization ( Teufel and Moens , 2000 ) , information retrieval ( <CITED HERE> ) , information extraction ( Mizuta et al. , 2005 ) , and question answering . Background train
This semantics was constructed ( <CITED HERE>a , 1987b ) as a formal framework for default and commonsense reasoning . Uses val
There is a general consensus among theoretical linguists that the proper representation of verbal argument structure is event structure -- representations grounded in a theory of events that decompose semantic roles in terms of primitive predicates representing concepts such as causality and inchoativity ( <CITED HERE> ; Jackendoff , 1983 ; Pustejovsky , 1991b ; Rappaport Hovav and Levin , 1998 ) . Background train
Similarly , the notion of R + M-abduction is spiritually related to the `` abductive inference '' of Reggia ( 1985 ) , the `` diagnosis from first principles '' of Reiter ( 1987 ) , `` explainability '' of Poole ( 1988 ) , and the subset principle of <CITED HERE> . CompareOrContrast train
The EDR has close ties to the named entity recognition ( NER ) and coreference resolution tasks , which have been the focus of several recent investigations ( Bikel et al. , 1997 ; Miller et al. , 1998 ; Borthwick , 1999 ; <CITED HERE> ; Soon et al. , 2001 ; Ng and Cardie , 2002 ; Florian et al. , 2004 ) , and have been at the center of evaluations such as : MUC-6 , MUC-7 , and the CoNLL '02 and CoNLL '03 shared tasks . Background train
Representative systems are described in Boisen et al. ( 1989 ) , De Mattia and Giachin ( 1989 ) , <CITED HERE> , Niemann ( 1990 ) , and Young ( 1989 ) . Background train
R98 ( , , , , „ ) uses a variant of Kozima 's semantic similarity measure ( <CITED HERE> ) to compute block similarity . Extends train
<CITED HERE> , Meral et al. ( 2007 ) , Murphy ( 2001 ) , Murphy and Vogel ( 2007 ) and Topkara et al. ( 2006a ) all belong to the syntactic transformation category . Background val
We used a publicly available tagger ( <CITED HERE> ) to tag the words and then used these in the input to the system . Uses train
Some works abstract perception via the usage of symbolic logic representations ( Chen et al. , 2010 ; <CITED HERE> ; Matuszek et al. , 2012 ; Artzi and Zettlemoyer , 2013 ) , while others choose to employ concepts elicited from psycholinguistic and cognition studies . Background train
Multilingual lexical databases aligned with the English WordNet ( e.g. MultiWordNet ( <CITED HERE> ) ) have been created for several languages , with different degrees of coverage . Background train
An off-the-shelf speech recognition device , a Nippon Electric Corporation DP-200 , was added to an existing natural language processing system , the Natural Language Computer ( NLC ) ( <CITED HERE> , Biermann and Ballard 1980 ) . Background train
For future work , we might investigate how machine learning algorithms , which are specifically designed for the problem of domain adaptation ( Blitzer et al. , 2007 ; <CITED HERE> ) , perform in comparison to our approach . Future train
But their importance has grown far beyond machine translation : for instance , transferring annotations between languages ( Yarowsky and Ngai 2001 ; <CITED HERE> ; Ganchev , Gillenwater , and Taskar 2009 ) ; discovery of paraphrases ( Bannard and Callison-Burch 2005 ) ; and joint unsupervised POS and parser induction across languages ( Snyder and Barzilay 2008 ) . Motivation train
That is , if the current hypothesis is unable to label a candidate or is uncertain about it , then the candidate might be a good training example ( <CITED HERE> ) . Background train
Other milestones of recent research include the deployment of probabilistic and machine learning techniques ( Aone and Bennett 1995 ; Kehler 1997 ; Ge , Hale , and Charniak 1998 ; Cardie and Wagstaff 1999 ; the continuing interest in centering , used either in original or in revised form ( Abracos and Lopes 1994 ; Strube and Hahn 1996 ; Hahn and Strube 1997 ; <CITED HERE> ) ; and proposals related to the evaluation methodology in anaphora resolution ( Mitkov 1998a , 2001b ) . Background val
For instance , the derived morphological forms are believed to be represented as a whole , whereas the representation of the inflected forms follows the morphemic model ( <CITED HERE> ) . Background train
We perceive that these results can be extended to other language models that properly embed bilexical context-free grammars , as for instance the more general history-based models used in ( Ratnaparkhi , 1997 ) and ( <CITED HERE> ) . Future train
Another paper ( <CITED HERE> ) describes the detailed analysis on the factor of the difference of parsing performance . Background train
The original and the obtained grammar generated exactly the same number of derivation trees in the parsing experiment with 457 sentences from the ATIS corpus ( <CITED HERE> ) 6 ( the average length is 6.32 words ) . Uses train
The use of running tallies and percentages is based on the assumption that these features are likely to produce generalized predictors ( <CITED HERE> ) . Motivation train
The Longman lexicographers have developed a grammar coding system capable of representing in compact form a nontrivial amount of information , usually to be found only in large descriptive grammars of English ( such as <CITED HERE> ) . Background train
19 The paper by <CITED HERE> presents additional , more sophisticated models that we do not use in this article . CompareOrContrast train
Empirical evidence has been brought forward that inflectional and/or derivational stemmers augmented by dictionaries indeed perform substantially better than those without access to such lexical repositories ( Krovetz , 1993 ; Kraaij and Pohlmann , 1996 ; <CITED HERE> ) . Background train
Although there are other discussions of the paragraph as a central element of discourse ( e.g. Chafe 1979 , Halliday and Hasan 1976 , Longacre 1979 , <CITED HERE> ) , all of them share a certain limitation in their formal techniques for analyzing paragraph structure . CompareOrContrast train
( Details of how the average-expert model performs can be found in our prior work ( <CITED HERE> ) . ) Extends train
Self-training should also benefit other discriminatively trained parsers with latent annotations ( <CITED HERE> ) , although training would be much slower compared to using generative models , as in our case . Future train
We introduce here a clearly defined and replicable split of the <CITED HERE> data , so that future investigations can accurately and correctly compare against the results presented here . Uses train
In most cases , the accuracy of parsers degrades when run on out-of-domain data ( <CITED HERE> ; McClosky et al. , 2006 ; Blitzer et al. , 2006 ; Petrov et al. , 2010 ) . Background train
<CITED HERE> relies on morphosyntactic cues in the untagged Brown corpus as indicators of six predefined subcategorization frames . Background train
To build the above s2t system , we first use the parse tree , which is generated by parsing the English side of the bilingual data with the Berkeley parser ( <CITED HERE> ) . Uses val
One obvious approach to this problem is to employ parser reranking ( <CITED HERE> ) . Background train
Ideally , to distinguish between raising and equi verbs , a number of syntactic criteria should be employed ( <CITED HERE>:460 ff . ) Future train
The table also presents the closest comparable experimental results reported by <CITED HERE> .1 McKnight and Srinivasan ( henceforth , M&S ) created a test collection consisting of 37,151 RCTs from approximately 12 million MEDLINE abstracts dated between 1976 and 2001 . CompareOrContrast train
How this mismatched perceptual basis affects referential communication in situated dialogue was investigated in our previous work ( <CITED HERE> ) . Extends val
<CITED HERE> furthered this work by showing that a bimodal topic model , consisting of both text and feature norms , outperformed models using only one modality on the prediction of association norms , word substitution errors , and semantic interference tasks . Background train
A cooccurrence based stemmer ( <CITED HERE> ) was used to stem Spanish words . Uses train
The best performance on the WSJ corpus was achieved by a combination of the SATZ system ( Palmer and Hearst 1997 ) with the Alembic system ( <CITED HERE> ) : a 0.5 % error rate . CompareOrContrast train
In a number of proposals , lexical generalizations are captured using lexical underspecification ( <CITED HERE> ; Krieger and Nerbonne 1992 ; CompareOrContrast val
Although a grid may be more descriptively suitable for some aspects of prosody ( for example , <CITED HERE> use the grid representation for their implementation of stress assignment in compound nominals ) , we are not aware of any evidence for or against a grid representation of discourseneutral phrasing . CompareOrContrast train
The bottom panel of table 1 lists the results for the chosen lexicalized model ( SSN-Freq > 200 ) and five recent statistical parsers ( Ratnaparkhi , 1999 ; Collins , 1999 ; <CITED HERE> ; Collins , 2000 ; Bod , 2001 ) . CompareOrContrast train
Other factors , such as the role of focus ( Grosz 1977 , 1978 ; <CITED HERE> ) or quantifier scoping ( Webber 1983 ) must play a role , too . Background train
linguistic in nature , rather than dealing with superficial properties of the text , e.g. the amount of white space between words ( <CITED HERE> ) . CompareOrContrast train
The second version ( RM ) concerns the Resource Management task ( <CITED HERE> ) that has been popular within the DARPA community in recent years . Uses train
Another line of research that is correlated with ours is recognition of agreement/disagreement ( <CITED HERE> ; Yin et al. , 2012 ; Abbott et al. , 2011 ; Andreas et al. , 2012 ; Galley et al. , 2004 ; Hillard et al. , 2003 ) and classification of stances ( Walker et al. , 2012 ; Somasundaran and Wiebe , 2010 ) in online forums . CompareOrContrast train
The studies presented by <CITED HERE> and Johnson ( 2007 ) differed in the number of states that they used . CompareOrContrast train
A number of alignment techniques have been proposed , varying from statistical methods ( <CITED HERE> ; Gale and Church , 1991 ) to lexical methods ( Kay and Roscheisen , 1993 ; Chen , 1993 ) . Background train
We would like to use features that look at wide context on the input side , which is inexpensive ( <CITED HERE> ) . Future train
Much of the earlier work in anaphora resolution heavily exploited domain and linguistic knowledge ( Sidner 1979 ; Carter 1987 ; Rich and LuperFoy 1988 ; <CITED HERE> ) , which was difficult both to represent and to process , and which required considerable human input . Background train
Experiments on Chinese SRL ( Xue and Palmer 2005 , <CITED HERE> ) reassured these findings . Background train
This is then generalized , following a methodology based on <CITED HERE> , to generate the `` generalized marker lexicon . '' Uses train
A number of speech understanding systems have been developed during the past fifteen years ( Barnett et al. 1980 , Dixon and Martin 1979 , Erman et al. 1980 , Haton and Pierrel 1976 , Lea 1980 , Lowerre and Reddy 1980 , Medress 1980 , <CITED HERE> , Walker 1978 , and Wolf and Woods 1980 ) . CompareOrContrast train
Our proposed method is based on the automatically acquired paraphrase dictionary described in <CITED HERE> , in which the application of paraphrases from the dictionary encodes secret bits . Uses train
For example , frequent words are translated less consistently than rare words ( <CITED HERE> ) . Background train
Over the last decade there has been a lot of interest in developing tutorial dialogue systems that understand student explanations ( Jordan et al. , 2006 ; Graesser et al. , 1999 ; Aleven et al. , 2001 ; Buckley and Wolska , 2007 ; Nielsen et al. , 2008 ; VanLehn et al. , 2007 ) , because high percentages of selfexplanation and student contentful talk are known to be correlated with better learning in humanhuman tutoring ( Chi et al. , 1994 ; Litman et al. , 2009 ; Purandare and Litman , 2008 ; <CITED HERE> ) . Background train
These knowledge sources were effectively used to build a state-of-the-art WSD program in one of our prior work ( <CITED HERE> ) . Extends train
Another line of research approaches grounded language knowledge by augmenting distributional approaches of word meaning with perceptual information ( Andrews et al. , 2009 ; Steyvers , 2010 ; Feng and Lapata , 2010b ; <CITED HERE> ; Silberer and Lapata , 2012 ; Johns and Jones , 2012 ; Bruni et al. , 2012a ; Bruni et al. , 2012b ; Silberer et al. , 2013 ) . Background val
Due to advances in statistical syntactic parsing techniques ( <CITED HERE> ; Charniak , 2001 ) , attention has recently shifted towards the harder question of analyzing the meaning of natural language sentences . Background val
This is where robust syntactic systems like SATZ ( <CITED HERE> ) or the POS tagger reported in Mikheev ( 2000 ) , which do not heavily rely on word capitalization and are not sensitive to document length , have an advantage . CompareOrContrast train
category relationships from the weak supervision : the tag dictionary and raw corpus ( <CITED HERE> ; Garrette et al. , 2015 ) .4 This procedure attempts to automatically estimate the frequency of each word/tag combination by dividing the number of raw-corpus occurrences of each word in the dictionary evenly across all of its associated tags . Uses train
WIT has been implemented in Common Lisp and C on UNIX , and we have built several experimental and demonstration dialogue systems using it , including a meeting room reservation system ( Nakano et al. , 1999b ) , a video-recording programming system , a schedule management system ( Nakano et al. , 1999a ) , and a weather infomiation system ( <CITED HERE> ) . Extends train
Lexical functional grammar ( <CITED HERE> ; Bresnan 2001 ; Dalrymple 2001 ) is a member of the family of constraint-based grammars . Background val
<CITED HERE> present an approach to learn previously unknown frames for Czech from the Prague Dependency Bank ( Hajic Background train
A companion paper describes the evaluation process and results in further detail ( <CITED HERE> ) . Extends train
SNoW ( Carleson et al. , 1999 ; <CITED HERE> ) is a multi-class classifier that is specifically tailored for learning in domains in which the potential number of information sources ( features ) taking part in decisions is very large , of which NLP is a principal example . Uses train
Hence , enumerating morphological variants in a semi-automatically generated lexicon , such as proposed for French ( <CITED HERE> ) , turns out to be infeasible , at least for German and related languages . Background train
It is analogous to the step in other translation model induction algorithms that sets all probabilities below a certain threshold to negligible values ( <CITED HERE> ; Dagan et al. , 1993 ; Chen , 1996 ) . CompareOrContrast train
They proved to be useful in a number of NLP applications such as natural language generation ( Iordanskaja et al. , 1991 ) , multidocument summarization ( McKeown et al. , 2002 ) , automatic evaluation of MT ( Denkowski and Lavie , 2010 ) , and TE ( <CITED HERE> ) . Motivation val
The results , which partly confirm those obtained on a smaller dataset in <CITED HERE> , must be seen in light of the fact that our gesture annotation scheme comprises more fine-grained categories than most of the studies mentioned earlier for both head movements and face expressions . CompareOrContrast train
We employ the idea of ultraconservative update ( <CITED HERE> ; Crammer et al. , 2006 ) to propose two incremental methods for local training in Algorithm 2 as follows . Uses train
It has also been shown to be useful in joint inference of text with visual attributes obtained using visual classifiers ( <CITED HERE> ) . Background train
<CITED HERE> reported a correlation of r = .69 . CompareOrContrast train
A more subtle example is weighted FSAs that approximate PCFGs ( Nederhof , 2000 ; <CITED HERE> ) , or to extend the idea , weighted FSTs that approximate joint or conditional synchronous PCFGs built for translation . Background train
As has been previously observed and exploited in the NLP literature ( Pang and Lee , 2004 ; Agarwal and Bhattacharyya , 2005 ; <CITED HERE> ) , the above optimization function , unlike many others that have been proposed for graph or set partitioning , can be solved exactly in an provably efficient manner via methods for finding minimum cuts in graphs . CompareOrContrast train
WIT features an incremental understanding method ( <CITED HERE>b ) that makes it possible to build a robust and real-time system . Uses train
This choice is inspired by recent work on learning syntactic categories ( <CITED HERE> ) , which successfully utilized such language models to represent word window contexts of target words . Motivation train
As for work on Arabic ( MSA ) , results have been reported on the PATB ( Kulick , Gabbard , and Marcus 2006 ; <CITED HERE> ; Green and Manning 2010 ) , the Prague Dependency Treebank ( PADT ) ( Buchholz and Marsi 2006 ; Nivre 2008 ) and the CATiB ( Habash and Roth 2009 ) . Background train
Other studies which view lR as a query generation process include Maron and Kuhns , 1960 ; Hiemstra and Kraaij , 1999 ; <CITED HERE> ; Miller et al , 1999 . CompareOrContrast train
In Section 5 , we discuss the difficulties associated with such user studies , and describe a human-based evaluation we conducted for a small subset of the responses generated by our system ( <CITED HERE>b ) . Uses train
<CITED HERE> considers the second verb V2 as an aspectual complex comparable to the auxiliaries . Background train
2We could just as easily use other symmetric `` association '' measures , such as 02 ( <CITED HERE> ) or the Dice coefficient ( Smadja , 1992 ) . CompareOrContrast val
In this situation , <CITED HERE>b , 293 ) recommend `` evaluating the expectations using only a single , probable alignment . '' Motivation val
The use of the web as a corpus for teaching and research on language has been proposed a number of times ( <CITED HERE> ; Robb , 2003 ; Rundell , 2000 ; Fletcher , 2001 , 2004b ) and received a special issue of the journal Computational Linguistics ( Kilgarriff and Grefenstette , 2003 ) . Background val
• Learnability ( Zernik and Dyer 1987 ) • Text generation ( Hovy 1988 ; Milosavljevic , Tulloch , and Dale 1996 ) • Speech generation ( <CITED HERE> ) • Localization ( Sch ¨ aler 1996 ) Background train
Following our previous work on stance classification ( <CITED HERE>c ) , we employ three types of features computed based on the frame-semantic parse of each sentence in a post obtained from SEMAFOR ( Das et al. , 2010 ) . Extends train
However , the literature on Linguistic Steganography , in which linguistic properties of a text are modified to hide information , is small compared with other media ( <CITED HERE> ) . Background train
This has been reported for other languages , too , dependent on the generality of the chosen approach ( J ¨ appinen and Niemist ¨ o , 1988 ; Choueka , 1990 ; Popovic and Willett , 1992 ; Ekmekc ¸ ioglu et al. , 1995 ; Hedlund et al. , 2001 ; <CITED HERE> ) . Background train
We offer a theorem that highlights the broad applicability of these modeling techniques .4 If f ( input , output ) is a weighted regular relation , then the following statements are equivalent : ( 1 ) f is a joint probabilistic relation ; ( 2 ) f can be computed by a Markovian FST that halts with probability 1 ; ( 3 ) f can be expressed as a probabilistic regexp , i.e. , a regexp built up from atomic expressions a : b ( for a E E U -LCB- E -RCB- , b E A U -LCB- E -RCB- ) using concatenation , probabilistic union + p , and probabilistic closure * p. For defining conditional relations , a good regexp language is unknown to us , but they can be defined in several other ways : ( 1 ) via FSTs as in Fig. 1c , ( 2 ) by compilation of weighted rewrite rules ( <CITED HERE> ) , ( 3 ) by compilation of decision trees ( Sproat and Riley , 1996 ) , ( 4 ) as a relation that performs contextual left-to-right replacement of input substrings by a smaller conditional relation ( Gerdemann and van Noord , 1999 ) ,5 ( 5 ) by conditionalization of a joint relation as discussed below . Background train
We use the <CITED HERE> CCG parser to analyse the sentence before and after paraphrasing . Uses train
For example , consider a relational description ( cfXXX , <CITED HERE> ) involving a gradable adjective , as in the dog in the large shed . Background train
Previous versions of our work , as described in <CITED HERE> also assume that phrasing is dependent on predicate-argument structure . Extends train
like information extraction ( Yates and Etzioni , 2009 ) and textual entailment ( <CITED HERE> ) . Background train
The inclusion of the coreference task in the Sixth and Seventh Message Understanding Conferences ( MUC-6 and MUC-7 ) gave a considerable impetus to the development of coreference resolution algorithms and systems , such as those described in <CITED HERE> , Gaizauskas and Humphreys ( 1996 ) , and Kameyama ( 1997 ) . Background train
Lee et al. ( 2012 ) model entity coreference and event coreference jointly ; <CITED HERE> consider joint coreference and entity-linking . Background train
However , learning-based resolvers have not been able to benefit from having an SC agreement feature , presumably because the method used to compute the SC of an NP is too simplistic : while the SC of a proper name is computed fairly accurately using a named entity ( NE ) recognizer , many resolvers simply assign to a common noun the first ( i.e. , most frequent ) WordNet sense as its SC ( e.g. , <CITED HERE> , Markert and Nissim ( 2005 ) ) . Background train
This includes work on generalized expectation ( Mann and McCallum , 2010 ) , posterior regularization ( Ganchev et al. , 2010 ) and constraint driven learning ( <CITED HERE> ; Chang et al. , 2010 ) . CompareOrContrast train
The final machine is a trigram language model , specifically a Kneser-Ney ( <CITED HERE> ) based backoff language model . Uses train
Hovy has described another text planner that builds similar plans ( <CITED HERE>b ) . Background train
In particular , since we treat each individual speech within a debate as a single `` document '' , we are considering a version of document-level sentiment-polarity classification , namely , automatically distinguishing between positive and negative documents ( Das and Chen , 2001 ; <CITED HERE> ; Turney , 2002 ; Dave et al. , 2003 ) . Background val
transition-based dependency parsing framework ( <CITED HERE> ) using an arc-eager transition strategy and are trained using the perceptron algorithm as in Zhang and Clark ( 2008 ) with a beam size of 8 . Uses train
Just as easily , we can model link types that coincide with entries in an on-line bilingual dictionary separately from those that do not ( cfXXX <CITED HERE> ) . Uses train
As an alternative , we rely on PubMed to retrieve an initial set of hits that we then postprocess in greater detail -- this is the standard pipeline architecture commonly employed in other question-answering systems ( Voorhees and Tice 1999 ; <CITED HERE> ) . CompareOrContrast train
( Och and Ney , 2002 ; Blunsom et al. , 2008 ) used maximum likelihood estimation to learn weights for MT. ( <CITED HERE> ; Moore and Quirk , 2008 ; Zhao and Chen , 2009 ; Galley and Quirk , 2011 ) employed an evaluation metric as a loss function and directly optimized it . CompareOrContrast val
We also compare the results with the output generated by the statistical translation system GIZA + + / ISI ReWrite Decoder ( AlOnaizan et al. , 1999 ; <CITED HERE> ; Germann et al. , 2001 ) , trained on the same parallel corpus . CompareOrContrast train
Some efforts have tackled tasks such as automatic image caption generation ( Feng and Lapata , 2010a ; Ordonez et al. , 2011 ) , text illustration ( <CITED HERE> ) , or automatic location identification of Twitter users ( Eisenstein et al. , 2010 ; Wing and Baldridge , 2011 ; Roller et al. , 2012 ) . Background train
Since we are not generating from the model , this does not introduce difficulties ( <CITED HERE> ) . Motivation train
The extraction procedure consists of three steps : First , the bracketing of the trees in the Penn Treebank is corrected and extended based on the approaches of Magerman ( 1994 ) and <CITED HERE> . Background train
As they are required to enable test subjects to distinguish between senses , we use artificial glosses composed from synonyms and hypernyms as a surrogate , e.g. for brother : `` brother , male sibling '' vs. `` brother , comrade , friend '' ( <CITED HERE> ) . Uses train
Over the last decade there has been a lot of interest in developing tutorial dialogue systems that understand student explanations ( Jordan et al. , 2006 ; Graesser et al. , 1999 ; <CITED HERE> ; Buckley and Wolska , 2007 ; Nielsen et al. , 2008 ; VanLehn et al. , 2007 ) , because high percentages of selfexplanation and student contentful talk are known to be correlated with better learning in humanhuman tutoring ( Chi et al. , 1994 ; Litman et al. , 2009 ; Purandare and Litman , 2008 ; Steinhauser et al. , 2007 ) . Background val
<CITED HERE> argues that there are cases , albeit exceptional ones , in which constraints on syntactic category are an issue in subcategorization . Motivation train
Corpus frequency : ( <CITED HERE> ) differentiates between misspellings and neologisms ( new words ) in terms of their frequency . Uses train
We use an in-house developed hierarchical phrase-based translation ( <CITED HERE> ) as our baseline system , and we denote it as In-Hiero . Uses train
The types of sentences accepted are essentially those accepted by the original NLC grammar , imperative sentences with nested noun groups and conjunctions ( <CITED HERE> ) . Background train
The task we used to compare different generalisation techniques is similar to that used by Pereira et al. ( 1993 ) and <CITED HERE> . CompareOrContrast train
One of the proposed methods to extract paraphrases relies on a pivot-based approach using phrase alignments in a bilingual parallel corpus ( <CITED HERE> ) . Background val
The Nash arbitration plan , for example , would allow a doubly graded description whenever the product of the Values for the referent r exceeds that of all distractors ( <CITED HERE> ; cfXXX Gorniak and Roy 2003 ; Thorisson 1994 , for other plans ) . Background train
<CITED HERE> extracts word co-occurrence probabilities from unlabelled text collected from a web crawler . Background train
Notable early papers on graph-based semisupervised learning include Blum and Chawla ( 2001 ) , <CITED HERE> , Kondor and Lafferty ( 2002 ) , and Joachims ( 2003 ) . Background val
Due to this inherent ambiguity , manual annotations usually distinguish between sure correspondences for unambiguous translations , and possible , for ambiguous translations ( <CITED HERE> ) . Background train
Thus for instance , ( Copestake and Flickinger , 2000 ; Copestake et al. , 2001 ) describes a Head Driven Phrase Structure Grammar ( HPSG ) which supports the parallel construction of a phrase structure ( or derived ) tree and of a semantic representation and ( <CITED HERE> ) show how to equip Lexical Functional grammar ( LFG ) with a glue semantics . Background train
The contextual interpreter then uses a reference resolution approach similar to Byron ( 2002 ) , and an ontology mapping mechanism ( <CITED HERE>a ) to produce a domain-specific semantic representation of the student 's output . Uses train
only the available five relative scopings of the quantifiers are produced ( <CITED HERE> , 47 ) , but without the need for a free variable constraint -- the HOU algorithm will not produce any solutions in which a previously bound variable becomes free ; • the equivalences are reversible , and thus the above sentences cart be generated from scoped logical forms ; • partial scopings are permitted ( see Reyle [ 19961 ) • scoping can be freely interleaved with other types of reference resolution ; • unscoped or partially scoped forms are available for inference or for generation at every stage . Background train
In the transducers produced by the training method described in this paper , the source and target positions are in the set -LCB- -1 , 0,1 -RCB- , though we have also used handcoded transducers ( <CITED HERE> ) and automatically trained transducers ( Alshawi and Douglas 2000 ) with a larger range of positions . Uses train
Similar approaches are being explored for parsing ( Steedman , <CITED HERE> ; Hwa et al. 2003 ) . Background train
In particular , since we treat each individual speech within a debate as a single `` document '' , we are considering a version of document-level sentiment-polarity classification , namely , automatically distinguishing between positive and negative documents ( Das and Chen , 2001 ; Pang et al. , 2002 ; Turney , 2002 ; <CITED HERE> ) . Background train
would be chunked as follows ( Tjong Kim <CITED HERE> ) : [ NP He ] [ VP reckons ] [ NP the current account deficit ] [ VP will narrow ] [ PP Background train
The flexible architecture we have presented enables interesting future research : ( i ) a straightforward improvement is the use of lexical similarity to reduce data sparseness , e.g. ( <CITED HERE> ; Basili et al. , 2006 ; Bloehdorn et al. , 2006 ) . Background train
5An alternative strategy to step ( 4 ) is to perform a database lookup based on the ambiguous query and summarize the results ( <CITED HERE> ) , which we leave for future work . Future train
Such approaches have been tried recently in restricted cases ( <CITED HERE> ; Eisner , 2001b ; Lafferty et al. , 2001 ) . Background train
Typed feature grammars can be used as the basis for implementations of Head-driven Phrase Structure Grammar ( HPSG ; Pollard and Sag , 1994 ) as discussed in ( <CITED HERE>a ) and ( Meurers and Minnen , 1997 ) . Extends val
It is therefore no surprise that early attempts at response automation were knowledge-driven ( Barr and Tessler 1995 ; <CITED HERE> ; Delic and Lahaix 1998 ) . Background val
A more subtle example is weighted FSAs that approximate PCFGs ( <CITED HERE> ; Mohri and Nederhof , 2001 ) , or to extend the idea , weighted FSTs that approximate joint or conditional synchronous PCFGs built for translation . Background train
ones , DIRT ( Lin and Pantel , 2001 ) , VerbOcean ( Chklovski and Pantel , 2004 ) , FrameNet ( Baker et al. , 1998 ) , and Wikipedia ( <CITED HERE> ; Kouylekov et al. , 2009 ) . Background train
The Ruby on <CITED HERE> framework permits us to quickly develop web applications without rewriting common functions and classes . Uses train
We further add rules for combining with punctuation to the left and right and allow for the merge rule X → X X of <CITED HERE> . Uses train
Politically-oriented text Sentiment analysis has specifically been proposed as a key enabling technology in eRulemaking , allowing the automatic analysis of the opinions that people submit ( Shulman et al. , 2005 ; <CITED HERE> ; Kwon et al. , 2006 ) . Background train
<CITED HERE> has made some preliminary attempt on the idea of hierarchical semantic Background train
<CITED HERE> pointed out that many relations between words in a text are non-classical ( i.e. other than typical taxonomic relations like synonymy or hypernymy ) and therefore not covered by semantic similarity . Background train
Some of the intuitions we associate with this notion have been very well expressed by <CITED HERE> , pp. 7-8 ) : ... Semantics is constrained by our models of ourselves and our worlds . Background val
In 2009 , the second WePS campaign showed similar trends regarding the use of NE features ( <CITED HERE> ) . Background train
The head words can be automatically extracted using a heuristic table lookup in the manner described by <CITED HERE> . Uses train
Reiter describes a pipelined modular approach as a consensus architecture underlying most recent work in generation ( <CITED HERE> ) . Background train
For example , the suite of LT tools ( <CITED HERE> ; Grover et al. , 2000 ) perform tokenization , tagging and chunking on XML marked-up text directly . Background train
Liu et al. ( 2005 ) , <CITED HERE> , Murphy ( 2001 ) , Murphy and Vogel ( 2007 ) and Topkara et al. ( 2006a ) all belong to the syntactic transformation category . Background train
Brockmann and Lapata ( 2003 ) have showed that WordNet-based approaches do not always outperform simple frequency-based models , and a number of techniques have been recently proposed which may offer ideas for refining our current unsupervised approach ( <CITED HERE> ; Bergsma et al. , 2008 ) . Future val
The RenTAL system is implemented in LiLFeS ( <CITED HERE> ) 2 . Uses train
In the future , we hope to evaluate the automatic annotations and extracted lexicon against Propbank ( <CITED HERE> ) . Future train
Each set of translations is stored separately , and for each set the `` marker hypothesis '' ( <CITED HERE> ) is used to segment the phrasal lexicon into a `` marker lexicon . '' Uses train
One way to increase the precision of the mapping process is to impose some linguistic constraints on the sequences such as simple noun-phrase contraints ( Gaussier , 1995 ; <CITED HERE> ; hua Chen and Chen , 94 ; Fung , 1995 ; Evans and Zhai , 1996 ) . Uses train
Although this study falls under the general topic of discourse modeling , our work differs from previous attempts to characterize text in terms of domainindependent rhetorical elements ( <CITED HERE> ; Marcu and Echihabi , 2002 ) . CompareOrContrast train
`` Coherence , '' as outlined above , can be understood as a declarative ( or static ) version of marker passing ( Hirst 1987 ; <CITED HERE> ) , with one difference : the activation spreads to theories that share a predicate , not through the IS-A hierarchy , and is limited to elementary facts about predicates appearing in the text . CompareOrContrast val
Task properties Determining whether or not a speaker supports a proposal falls within the realm of sentiment analysis , an extremely active research area devoted to the computational treatment of subjective or opinion-oriented language ( early work includes <CITED HERE> , Hearst ( 1992 ) , Sack ( 1994 ) , and Wiebe ( 1994 ) ; see Esuli ( 2006 ) for an active bibliography ) . Background val
First , it has been noted that in many natural language applications it is sufficient to use shallow parsing information ; information such as noun phrases ( NPs ) and other syntactic sequences have been found useful in many large-scale language processing applications including information extraction and text summarization ( <CITED HERE> ; Appelt et al. , 1993 ) . Background train
In addition , a fully flexible access system allows the retrieval of dictionary entries on the basis of constraints specifying any combination of phonetic , lexical , syntactic , and semantic information ( <CITED HERE> ) . Background val
<CITED HERE> developed a way of incorporating standard n-grams into the cache model , using mixtures of language models and also exponentially decaying the weight for the cache prediction depending on the recency of the word 's last Extends train
Riehemann 1993 ; Oliva 1994 ; Frank 1994 ; Opalka 1995 ; <CITED HERE> ) . CompareOrContrast val
Machine learning methods should be interchangeable : Transformation-based learning ( TBL ) ( <CITED HERE> ) and Memory-based learning ( MBL ) ( Daelemans et al. , 2002 ) have been applied to many different problems , so a single interchangeable component should be used to represent each method . Motivation train
Recently , several alternative , often quite sophisticated approaches to collective classification have been proposed ( Neville and Jensen , 2000 ; Lafferty et al. , 2001 ; <CITED HERE> ; Taskar et al. , 2002 ; Taskar et al. , 2003 ; Taskar et al. , 2004 ; McCallum and Wellner , 2004 ) . Background train
We tested the classification of verbs into semantic types using a verb list of 139 pre-classified items drawn from the lists published in <CITED HERE> and Stockwell et al. ( 1973 ) . Uses train
The combination of likelihood and prior modeling , HMMs , and Viterbi decoding is fundamentally the same as the standard probabilistic approaches to speech recognition ( Bahl , Jelinek , and Mercer 1983 ) and tagging ( <CITED HERE> ) . CompareOrContrast train
Cross-lingual Textual Entailment ( CLTE ) has been proposed by ( Mehdad et al. , 2010 ) as an extension of Textual Entailment ( <CITED HERE> ) that consists in deciding , given two texts T and H in different languages , if the meaning of H can be inferred from the meaning of T . Background train
The EDR has close ties to the named entity recognition ( NER ) and coreference resolution tasks , which have been the focus of several recent investigations ( Bikel et al. , 1997 ; <CITED HERE> ; Borthwick , 1999 ; Mikheev et al. , 1999 ; Soon et al. , 2001 ; Ng and Cardie , 2002 ; Florian et al. , 2004 ) , and have been at the center of evaluations such as : MUC-6 , MUC-7 , and the CoNLL '02 and CoNLL '03 shared tasks . Background train
Other solutions such as complete caching of the corpora are not typically adopted due to legal concerns over copyright and redistribution of web data , issues considered at length by <CITED HERE>a ) . Background val
This includes work on generalized expectation ( <CITED HERE> ) , posterior regularization ( Ganchev et al. , 2010 ) and constraint driven learning ( Chang et al. , 2007 ; Chang et al. , 2010 ) . Background train
A very similar formulation , for another grammar transformation , is given in <CITED HERE> . CompareOrContrast train
We run GIZA + + ( <CITED HERE> ) on the training corpus in both directions ( Koehn et al. , 2003 ) to obtain the word alignment for each sentence pair . Uses train
Other attempts to address efficiency include the fast Transformation Based Learning ( TBL ) Toolkit ( <CITED HERE> ) which dramatically speeds up training TBL systems , and the translation of TBL rules into finite state machines for very fast tagging ( Roche and Schabes , 1997 ) . Background train
Consequently , fusion has been applied to a wide variety of pattern recognition and decision theoretic problems -- using a plethora of theories , techniques , and tools -- including some applications in computational linguistics ( e.g. , <CITED HERE> ; van Halteren , Zavrel , and Daelemans 1998 ) and speech technology ( e.g. , Bowles and Damper 1989 ; Romary and Pierre11989 ) . Background train
This observation has led some researchers , e.g. , <CITED HERE> , to claim a direct mapping between the syntactic phrase and the prosodic phrase . Background train
These features are carefully designed to reduce the data sparseness problem and some of them are inspired by previous work ( He et al. , 2008 ; <CITED HERE> ; Marton and Resnik , 2008 ; Chiang et al. , 2009 ; Setiawan et al. , 2009 ; Shen et al. , 2009 ; Xiong et al. , 2009 ) : 1 . Motivation val
In our previous work ( <CITED HERE> ) , conversation entailment is formulated as the following : given a conversation segment D which is represented by a set of clauses D = d1 ∧ ... ∧ dm , and a hypothesis H represented by another set of clauses H = h1 ∧ ... ∧ hn , the prediction on whether D entails H is determined by the product of probabilities that each hypothesis clause hj is entailed from all the conversation segment clauses d1 ... dm as follows . Extends train
Notable early papers on graph-based semisupervised learning include Blum and Chawla ( 2001 ) , Bansal et al. ( 2002 ) , <CITED HERE> , and Joachims ( 2003 ) . Background train
In addition , we consider several types of lexical features ( LexF ) inspired by previous work on agreement and disagreement ( <CITED HERE> ; Misra and Walker , 2013 ) . Motivation train
<CITED HERE> compared two retrieval approaches ( TF.IDF and query expansion ) and two predictive approaches ( statistical translation and latent variable models ) . Background train
Others provide automatic mappings of natural language instructions to executable actions , such as interpreting navigation directions ( Chen and Mooney , 2011 ) or robot commands ( Tellex et al. , 2011 ; <CITED HERE> ) . Background train
The dialogue state is represented by a cumulative answer analysis which tracks , over multiple turns , the correct , incorrect , and not-yet-mentioned parts 1Other factors such as student confidence could be considered as well ( <CITED HERE> ) . Future train
As noted above , it is well documented ( <CITED HERE> ) that subcategorization frames ( and their frequencies ) vary across domains . Motivation train
<CITED HERE> explore a number of related approaches to the extraction of a lexicalized TAG from the Penn-II Treebank with the aim of constructing a statistical model for parsing . Background train
At the same time , we believe our method has advantages over the approach developed initially at IBM ( <CITED HERE> ; Brown et al. 1993 ) for training translation systems automatically . CompareOrContrast train
Building on the work of Ruch et al. ( 2003 ) in the same domain , we present a generative approach that attempts to directly model the discourse structure of MEDLINE abstracts using Hidden Markov Models ( HMMs ) ; cfXXX ( <CITED HERE> ) . Background train
The basic Python reflection has already been implemented and used for large scale experiments with POS tagging , using pyMPI ( a message passing interface library for Python ) to coordinate experiments across a cluster of over 100 machines ( Curran and Clark , 2003 ; <CITED HERE> ) . Background train
As an alternative , we rely on PubMed to retrieve an initial set of hits that we then postprocess in greater detail -- this is the standard pipeline architecture commonly employed in other question-answering systems ( <CITED HERE> ; Hirschman and Gaizauskas 2001 ) . CompareOrContrast train
The language grounding problem has received significant attention in recent years , owed in part to the wide availability of data sets ( e.g. Flickr , Von Ahn ( 2006 ) ) , computing power , improved computer vision models ( Oliva and Torralba , 2001 ; Lowe , 2004 ; Farhadi et al. , 2009 ; Parikh and Grauman , 2011 ) and neurological evidence of ties between the language , perceptual and motor systems in the brain ( Pulverm ¨ uller et al. , 2005 ; Tettamanti et al. , 2005 ; <CITED HERE> ) . Background train
Although in this paper we take modus ponens as the main rule of inference , in general one can consider deductive closures with respect to weaker , nonstandard logics , ( cfXXX <CITED HERE> ; Frisch 1987 ; Patel-Schneider 1985 ) . CompareOrContrast train
As <CITED HERE> points out , given that no situations were envisaged where the information from the tape would be altered once installed in secondary storage , this simple and convenComputational Linguistics , Volume 13 , Numbers 3-4 , July-December 1987 205 Bran Boguraev and Ted Briscoe Large Lexicons for Natural Language Processing tional access strategy is perfectly adequate . Background train
Other factors , such as the role of focus ( <CITED HERE> , 1978 ; Sidner 1983 ) or quantifier scoping ( Webber 1983 ) must play a role , too . Background train
This paper describes an approach for sharing resources in various grammar formalisms such as Feature-Based Lexicalized Tree Adjoining Grammar ( FB-LTAG1 ) ( <CITED HERE> ; Vijay-Shanker and Joshi , 1988 ) and Head-Driven Phrase Structure Grammar ( HPSG ) ( Pollard and Sag , 1994 ) by a method of grammar conversion . Background train
This is because the binary structure has been verified to be very effective for tree-based translation ( <CITED HERE> ; Zhang et al. , 2011a ) . Motivation train
There is a general consensus among theoretical linguists that the proper representation of verbal argument structure is event structure -- representations grounded in a theory of events that decompose semantic roles in terms of primitive predicates representing concepts such as causality and inchoativity ( Dowty , 1979 ; <CITED HERE> ; Pustejovsky , 1991b ; Rappaport Hovav and Levin , 1998 ) . Background train
measure the standard intrinsic parser metrics unlabeled attachment score ( UAS ) and labeled attachment score ( LAS ) ( <CITED HERE> ) . Uses train
The EDR has close ties to the named entity recognition ( NER ) and coreference resolution tasks , which have been the focus of several recent investigations ( <CITED HERE> ; Miller et al. , 1998 ; Borthwick , 1999 ; Mikheev et al. , 1999 ; Soon et al. , 2001 ; Ng and Cardie , 2002 ; Florian et al. , 2004 ) , and have been at the center of evaluations such as : MUC-6 , MUC-7 , and the CoNLL '02 and CoNLL '03 shared tasks . Background train
Accordingly , we convert examples such as ( 27 ) into their generalized equivalents , as in ( 28 ) : ( 28 ) <DET> good man : bon homme That is , where <CITED HERE> substitutes variables for various words in his templates , we replace certain lexical items with their marker tag . CompareOrContrast train
How it is done is beyond the scope of this paper but is explained in detail in <CITED HERE> . Background train
These tools use a highly optimised GIS implementation and provide sophisticated Gaussian smoothing ( <CITED HERE> ) . Uses train
Also relevant is work on the general problems of dialog-act tagging ( Stolcke et al. , 2000 ) , citation analysis ( Lehnert et al. , 1990 ) , and computational rhetorical analysis ( Marcu , 2000 ; <CITED HERE> ) . Background train
The inference rules that were necessary to convert one list of properties into another do not sit comfortably within the received NLG pipeline model ( e.g. , <CITED HERE> ) . Background train
<CITED HERE> has developed an agenda-driven chart parser for the feature-driven formalism described above ; please refer to his paper for a description of the parsing algorithm . Extends val
An example of psycholinguistically oriented research work can be found in <CITED HERE> . Background train
In addition to a referring function , noun phrases ( NP ) can also serve communicative goals such as providing new information about the referent and expressing the speaker 's emotional attitude towards the referent ( Appelt , 1985 ; <CITED HERE> ) . Background train
Some well-known approaches include rule-based models ( Brill and Resnik 1994 ) , backed-off models ( Collins and Brooks 1995 ) , and a maximumentropy model ( <CITED HERE> ) . Background train
There is some literature on procedure acquisition such as the LISP synthesis work described in <CITED HERE> and the PROLOG synthesis method of Shapiro ( 1982 ) . CompareOrContrast val
Against the background of a growing interest in multilingual NLP , multilingual anaphora / coreference resolution has gained considerable momentum in recent years ( Aone and McKee 1993 ; Azzam , Humphreys , and Gaizauskas 1998 ; <CITED HERE> ; Mitkov and Barbu 2000 ; Mitkov 1999 ; Mitkov and Stys 1997 ; Mitkov , Belguith , and Stys 1998 ) . Background train
The list of semantic relations with which we work is based on extensive literature study ( <CITED HERE>a ) . Uses train
The research described below is taking place in the context of three collaborative projects ( <CITED HERE> ; Russell et al. , 1986 ; Phillips and Thompson , 1986 ) to develop a general-purpose , wide coverage morphological and syntactic analyser for English . Background train
In most cases , the accuracy of parsers degrades when run on out-of-domain data ( Gildea , 2001 ; McClosky et al. , 2006 ; Blitzer et al. , 2006 ; <CITED HERE> ) . Background train
This paper presents experiments with generative content models for analyzing the discourse structure of medical abstracts , which has been confirmed to follow the four-section pattern discussed above ( <CITED HERE> ) . Background train
Following <CITED HERE> , one approach for achieving this objective consists of applying supervised learning , where a winning method is selected for each case in the training set , all the training cases are labeled accordingly , and then the system is trained to predict a winner for unseen cases . CompareOrContrast train
Thus , over the past few years , along with advances in the use of learning and statistical methods for acquisition of full parsers ( Collins , 1997 ; Charniak , 1997a ; Charniak , 1997b ; Ratnaparkhi , 1997 ) , significant progress has been made on the use of statistical learning methods to recognize shallow parsing patterns syntactic phrases or words that participate in a syntactic relationship ( Church , 1988 ; Ramshaw and Marcus , 1995 ; <CITED HERE> ; Cardie and Pierce , 1998 ; Munoz et al. , 1999 ; Punyakanok and Roth , 2001 ; Buchholz et al. , 1999 ; Tjong Kim Sang and Buchholz , 2000 ) . Background val
Due to their remarkable ability to incorporate context structure information and long distance reordering into the translation process , tree-based translation models have shown promising progress in improving translation quality ( <CITED HERE> , 2009 ; Quirk et al. , 2005 ; Galley et al. , 2004 , 2006 ; Marcu et al. , 2006 ; Shen et al. , 2008 ; Zhang et al. , 2011b ) . Background train
11 <CITED HERE> reports that non-projective and pseudo-projective algorithms outperform the `` eager '' projective algorithm in MaltParser , but our training data did not contain any non-projective dependencies . CompareOrContrast train
In particular , boosting ( Schapire , 1999 ; <CITED HERE> ) offers the possibility of achieving high accuracy from a collection of classifiers which individually perform quite poorly . Future train
For MT the most commonly used heuristic is called grow diagonal final ( <CITED HERE> ) . CompareOrContrast train
ECM-F is an entity-constrained mention Fmeasure ( cfXXX ( <CITED HERE> ) for how ECM-F is computed ) , and ACE-Value is the official ACE evaluation metric . Uses val
In order to obtain semantic representations of each word , we apply our previous strategy ( <CITED HERE> ) . Extends train
<CITED HERE> annotated a larger set of word pairs ( 353 ) , too . Background train
A more flexible approach is used by <CITED HERE> , where users can specify boundary values for attributes like rainfall , specifying , for example , rain counts as moderate above 7 mm/h , as heavy above 20 mm/h , and so on . Background train
Furthermore , a number of performance features , largely based on the PARADISE dialogue evaluation scheme ( <CITED HERE> ) , were automatically logged , derived , or manually annotated . Uses train
<CITED HERE> propose to generalize the direct evidence method so that it can apply to unseen pairs of adjectives by computing the transitive closure of the ordering relation . Background train
We evaluated our translations with IBM 's BLEU evaluation metric ( <CITED HERE> ) , using the same evaluation method and reference retranslations that were used for evaluation at HLT Workshop 2002 at CLSP ( Haji 6 et al. , 2002 ) . Uses train
Other attempts to address efficiency include the fast Transformation Based Learning ( TBL ) Toolkit ( Ngai and Florian , 2001 ) which dramatically speeds up training TBL systems , and the translation of TBL rules into finite state machines for very fast tagging ( <CITED HERE> ) . Background train
PR is closely related to the work of <CITED HERE> , 2008 ) , who concurrently developed the idea of using penalties based on posterior expectations of features to guide semi-supervised learning . Background train
The best results on most of our data were obtained using Hidden Naive Bayes ( HNB ) ( <CITED HERE> ) . Uses train
de URL : http://www.sfs.nphil.uni-tuebingen.de/sfb / b4home.html 1 This is , for example , the case for all proposals working with verbal lexical entries that raise the arguments of a verbal complement ( Hinrichs and Nakazawa 1989 ) that also use lexical rules such as the Complement Extraction Lexical Rule ( <CITED HERE> ) or the Complement Cliticization Lexical Rule ( Miller and Sag 1993 ) to operate on those raised elements . Background train
A number of proposals in the 1990s deliberately limited the extent to which they relied on domain and/or linguistic knowledge and reported promising results in knowledge-poor operational environments ( Dagan and Itai 1990 , 1991 ; Lappin and Leass 1994 ; Nasukawa 1994 ; Kennedy and Boguraev 1996 ; Williams , Harvey , and Preston 1996 ; Baldwin 1997 ; <CITED HERE> , 1998b ) . Background train
ASARES has been previously applied to the acquisition of word pairs sharing semantic relations defined in the Generative Lexicon framework ( Pustejovsky , 1995 ) and called qualia relations ( <CITED HERE> ) . Background train
mlSystem ruleFeats + atomFeats We augment mlSystem ruleFeats with more features from our previous work ( Markert et al. , 2012 ; <CITED HERE>a ; Hou et al. , 2013b ) on bridging anaphora recognition and antecedent selection . Extends train
The description of the EAGLE workbench for linguistic engineering ( <CITED HERE> ) mentions a case normalization module that uses a heuristic in which a capitalized word in an ambiguous position should be rewritten without capitalization if it is found lower-cased in the same document . CompareOrContrast val
An exception is <CITED HERE> , who experimented with determining the political orientation of websites essentially by classifying the concatenation of all the documents found on that site . Background val
Until now , translation models have been evaluated either subjectively ( e.g. White and O'Connell 1993 ) or using relative metrics , such as perplexity with respect to other models ( <CITED HERE>b ) . CompareOrContrast val
The three preprocessing steps ( tokenization , POS-tagging , lemmatization ) are performed using TreeTagger ( <CITED HERE> ) . Uses train
Many statistical parsers ( Ratnaparkhi , 1999 ; <CITED HERE> ; Charniak , 2001 ) are based on a history-based probability model ( Black et al. , 1993 ) , where the probability of each decision in a parse is conditioned on the previous decisions in the parse . Background train
In informal experiments described elsewhere ( Melamed 1995 ) , I found that the G2 statistic suggested by <CITED HERE> slightly outperforms 02 . Background val
For right-branching structures , the leftcorner ancestor is the parent , conditioning on which has been found to be beneficial ( Johnson , 1998 ) , as has conditioning on the left-corner child ( <CITED HERE> ) . Background train
This strategy is certainly the right one to start out with , since anaphora is always the more typical direction of reference in English prose ( <CITED HERE> , p. 329 ) . Motivation train
This indicates that parse trees are usually not the optimal choice for training tree-based translation models ( <CITED HERE> ) . Background val
In <CITED HERE> and Krotov et al. ( 1998 ) , it was observed that treebank grammars ( CFGs extracted from treebanks ) are very large and grow with the size of the treebank . Background val
For example , the forward-backward algorithm ( <CITED HERE> ) trains only Hidden Markov Models , while ( Ristad and Yianilos , 1996 ) trains only stochastic edit distance . Background train
ment ( Sarkar and Wintner , 1999 ; Doran et al. , 2000 ; <CITED HERE> ) . Background train
<CITED HERE> argues that , aside from missing domain-specific complementation trends , dictionaries produced by hand will tend to lag behind real language use because of their static nature . Motivation train
Finally , the Natural Language Toolkit ( NLTK ) is a package of NLP components implemented in Python ( <CITED HERE> ) . Background train
Politically-oriented text Sentiment analysis has specifically been proposed as a key enabling technology in eRulemaking , allowing the automatic analysis of the opinions that people submit ( <CITED HERE> ; Cardie et al. , 2006 ; Kwon et al. , 2006 ) . Background train
Burkett and Klein ( 2008 ) and <CITED HERE> focused on joint parsing and alignment . CompareOrContrast train
But the general outlines are reasonably clear , and we can adapt some of the UDRS ( <CITED HERE> ) work to our own framework . Uses train
KUbler , McDonald , and <CITED HERE> describe a `` typical '' MaltParser model configuration of attributes and features .13 Starting with it , in a series of initial controlled experiments , we settled on using buf [ 0-1 ] + stk [ 0-1 ] for word-forms , and buf [ 0-3 ] + stk [ 0-2 ] for POS tags . Uses train
For this evaluation , we randomly selected 50 abstracts with disorders indexed as the main topic from abstracts retrieved using PubMed on the five clinical questions described in <CITED HERE> . Uses train
The need for information systems to support physicians at the point of care has been well studied ( Covell et al. , 1985 ; <CITED HERE> ; Ely et al. , 2005 ) . Background train
This approach is taken in computational syntactic grammars ( e.g. <CITED HERE> ) ; the number of unlikely parses is severely reduced whenever possible , but no attempt is made to define only the so-called grammatical strings of a language . Background train
Previous work on Chinese SRL mainly focused on how to transplant the machine learning methods which has been successful with English , such as Sun and Jurafsky ( 2004 ) , <CITED HERE> and Xue ( 2008 ) . Background train
For example , the interaction of lexical rules is explored at run-time , even though the possible interaction can be determined at compile-time given the information available in the lexical rules and the base lexical entries .2 Based on the research results reported in <CITED HERE> , 1996 ) , we propose a new computational treatment of lexical rules that overcomes these shortcomings and results in a more efficient processing of lexical rules as used in HPSG . Motivation train
`` Coherence , '' as outlined above , can be understood as a declarative ( or static ) version of marker passing ( <CITED HERE> ; Charniak 1983 ) , with one difference : the activation spreads to theories that share a predicate , not through the IS-A hierarchy , and is limited to elementary facts about predicates appearing in the text . CompareOrContrast train
On small data sets all of the Bayesian estimators strongly outperform EM ( and , to a lesser extent , VB ) with respect to all of our evaluation measures , confirming the results reported in <CITED HERE> . CompareOrContrast train
In addition , the advantages of using linguistically annotated data over raw data are well documented ( <CITED HERE> ; Granger and Rayson , 1998 ) . Background train
For the A * algorithm ( <CITED HERE> ) as applied to speech recognition , the actual path score is typically augmented with an estimated score for the unseen portion . Uses train
Robust natural language understanding in Atlas-Andes is provided by Rosé 's CARMEL system ( Rosé 2000 ) ; it uses the spelling correction algorithm devised by <CITED HERE> . Uses train
The framework represents a generalization of several predecessor NLG systems based on Meaning-Text Theory : FoG ( Kittredge and Polguere , 1991 ) , LFS ( Iordanskaja et al. , 1992 ) , and JOYCE ( Rambow and <CITED HERE> ) . Extends train
Nevertheless , <CITED HERE> , page 23 ) observes that `` a slightly more general mapping , where two adjacent terminal symbols can be merged into a single lexical item ( for example , a word and its case-marking ) , can capture this sort of result quite handily . '' Background train
Other similar approaches include those of Cicekli and G ¨ uvenir ( 1996 ) , McTait and Trujillo ( 1999 ) , Carl ( 1999 ) , and <CITED HERE> , inter alia . Background train
( Watanabe et al. , 2007 ; Chiang et al. , 2008 ; <CITED HERE> ) proposed other optimization objectives by introducing a margin-based and ranking-based indirect loss functions . CompareOrContrast train
<CITED HERE> utilized a transformation-based method to learn a sequence of monolingual tree transformations for translation . CompareOrContrast train
Baseline language model : For P0 we used a trigram with modified kneser-ney smoothing [ Chen and <CITED HERE> ] , which is still considered one of the best smoothing methods for n-gram language models . Uses train
Therefore , we preprocess Ontonote-5 .0 to derive mention heads using Collins head rules ( <CITED HERE> ) with gold constituency parsing information and gold named entity information . Uses train
Our most accurate single grammar achieves an F score of 91.6 on the WSJ test set , rivaling discriminative reranking approaches ( <CITED HERE> ) and products of latent variable grammars ( Petrov , 2010 ) , despite being a single generative PCFG . CompareOrContrast train
We have presented an ensemble approach to word sense disambiguation ( <CITED HERE> ) where multiple Naive Bayesian classifiers , each based on co -- occurrence features from varying sized windows of context , is shown to perform well on the widely studied nouns interest and line . Background train
Over the last decade there has been a lot of interest in developing tutorial dialogue systems that understand student explanations ( Jordan et al. , 2006 ; <CITED HERE> ; Aleven et al. , 2001 ; Buckley and Wolska , 2007 ; Nielsen et al. , 2008 ; VanLehn et al. , 2007 ) , because high percentages of selfexplanation and student contentful talk are known to be correlated with better learning in humanhuman tutoring ( Chi et al. , 1994 ; Litman et al. , 2009 ; Purandare and Litman , 2008 ; Steinhauser et al. , 2007 ) . Background train
We shall see this in the next example : two sentences , regarded as a fragment of paragraph , are a variation on a theme by <CITED HERE> . Extends train
Nevertheless , the full document text is present in most systems , sometimes as the only feature ( <CITED HERE> ) and sometimes in combination with others see for instance ( Chen and Martin , 2007 ; Popescu and Magnini , 2007 ) - . Background train
The recognizer for these systems is the SUMMIT system ( <CITED HERE> ) , which uses a segmental-based framework and includes an auditory model in the front-end processing . Uses train
Some recent GRE algorithms have done away with the separation between content determination and linguistic realization , interleaving the two processes instead ( <CITED HERE> ; Krahmer and Theune 2002 ) . CompareOrContrast train
To sample from our proposal distribution , we use a blocked Gibbs sampler based on the one proposed by Goodman ( 1998 ) and used by <CITED HERE> that samples entire parse trees . Uses train
Semantic construction proceeds from the derived tree ( <CITED HERE> ) rather than -- as is more common in TAG -- from the derivation tree . Background train
It allows the construction of a non-TAL ( <CITED HERE> ) , ( Harbusch & Poller , 2000 ) . Background val
Following previous work ( e.g. , <CITED HERE> and Ponzetto and Strube ( 2006 ) ) , we generate training instances as follows : a positive instance is created for each anaphoric NP , NPj , and its closest antecedent , NPi ; and a negative instance is created for NPj paired with each of the intervening NPs , NPi +1 , NPi +2 , ... , NPj_1 . Uses train
TF-IDF ( term frequency-inverse document frequency ) is one of the widely used feature selection techniques in information retrieval ( <CITED HERE> ) . Background train
The gap mechanism resembles the Hold register idea of ATNs ( <CITED HERE> ) and the treatment of bounded domination metavariables in lexical functional grammars ( LFGs ) ( Bresnan 1982 , p. 235 ff . ) CompareOrContrast train
To solve these scaling issues , we implement Online Variational Bayesian Inference ( Hoffman et al. , 2010 ; <CITED HERE> ) for our models . Uses val
Typical examples are Bulgarian ( Simov et al. , 2005 ; <CITED HERE> ) , Chinese ( Chen et al. , 2003 ) , Danish ( Kromann , 2003 ) , and Swedish ( Nilsson et al. , 2005 ) . Background train
This appeared to solve the problem , and the results presented later for the average degree of generalisation do not show an over-generalisation compared with those given in <CITED HERE> . CompareOrContrast train
These observations and this line of reasoning has not escaped the attention of theoretical linguists : <CITED HERE> propose that argument structure is , in fact , encoded syntactically . Background train
inter-document references in the form of hyperlinks ( <CITED HERE> ) . Background train
While IA is generally thought to be consistent with findings on human language production ( <CITED HERE> ; Levelt 1989 ; Pechmann 1989 ; Sonnenschein 1982 ) , the hypothesis that incrementality is a good model of human GRE seems unfalsifiable until a preference order is specified for the properties on which it operates . Background train
Arabic has two kinds of plurals : broken plurals and sound plurals ( Wightwick and Gaafar , 1998 ; <CITED HERE> ) . Background val
Secondly , as ( <CITED HERE> ) show , marginalizing out the different segmentations during decoding leads to improved performance . Future train
Nivre ( 2008 ) reports experiments on Arabic parsing using his MaltParser ( <CITED HERE> ) , trained on the PADT . Background train
More recently , ( Sebastiani , 2002 ) has performed a good survey of document categorization ; recent works can also be found in ( <CITED HERE> ) , ( Crammer and Singer , 2003 ) , and ( Lewis et al. , 2004 ) . Background train
The grammar conversion from LTAG to HPSG ( <CITED HERE> ) is the core portion of the RenTAL system . Background train
This includes work on question answering ( <CITED HERE> ) , sentiment analysis ( Nakagawa et al. , 2010 ) , MT reordering ( Xu et al. , 2009 ) , and many other tasks . Background val
Our knowledge extractors rely extensively on MetaMap ( <CITED HERE> ) , a system for identifying segments of text that correspond to concepts in the UMLS Metathesaurus . Uses train
We study the cases where a 9Recall that even the <CITED HERE> system , built on the world 's largest corpus , achieves only 34 % recall ( Table 1 ) ( with only 48 % of positives and 27 % of all pairs previously observed , but see Footnote 5 ) . CompareOrContrast train
Inspired by ( Blunsom et al. , 2009 ) and ( <CITED HERE> ) , we define P ( str | frag ) as follows : where csw is the number of words in the source string . Motivation train
Inspired by ( <CITED HERE> ) and ( Cohn and Blunsom , 2009 ) , we define P ( str | frag ) as follows : where csw is the number of words in the source string . Motivation train
The TNT POS tagger ( <CITED HERE> ) has also been designed to train and run very quickly , tagging between 30,000 and 60,000 words per second . Background train
Some previous works ( Bannard and Callison-Burch , 2005 ; <CITED HERE> ; Kouylekov et al. , 2009 ) indicate , as main limitations of the mentioned resources , their limited coverage , their low precision , and the fact that they are mostly suitable to capture relations mainly between single words . Background val
Similarly , <CITED HERE> report that the use of a subset of Spanish morphological features ( number for adjectives , determiners , nouns , pronouns , and verbs ; and mode for verbs ) outperforms other combinations . Background train
To solve these scaling issues , we implement Online Variational Bayesian Inference ( <CITED HERE> ; Hoffman et al. , 2012 ) for our models . Uses train
Other studies which view lR as a query generation process include <CITED HERE> ; Hiemstra and Kraaij , 1999 ; Ponte and Croft , 1998 ; Miller et al , 1999 . CompareOrContrast train
As a result , researchers have re-adopted the once-popular knowledge-rich approach , investigating a variety of semantic knowledge sources for common noun resolution , such as the semantic relations between two NPs ( e.g. , Ji et al. ( 2005 ) ) , their semantic similarity as computed using WordNet ( e.g. , Poesio et al. ( 2004 ) ) or Wikipedia ( <CITED HERE> ) , and the contextual role played by an NP ( see Bean and Riloff ( 2004 ) ) . Background train
There have been many studies on parsing techniques ( <CITED HERE> ; Flickinger et al. , 2000 ) , ones on disambiguation models ( Chiang , 2000 ; Kanayama et al. , 2000 ) , and ones on programming/grammar-development environ - Background train
An example of this is the estimation of maximum entropy models , from simple iterative estimation algorithms used by Ratnaparkhi ( 1998 ) that converge very slowly , to complex techniques from the optimisation literature that converge much more rapidly ( <CITED HERE> ) . Background train
For example , such schema can serve as a mean to represent translation examples , or find structural correspondences for the purpose of transfer grammar learning ( Menezes & Richardson , 2001 ) , ( Aramaki et al. , 2001 ) , ( Watanabe et al. , 2000 ) , ( Meyers et al. , 2000 ) , ( Matsumoto et al. , 1993 ) , ( kaji et al. , 1992 ) , and example-base machine translation EBMT3 ( Sato & Nagao , 1990 ) , ( <CITED HERE> ) , ( Richardson et al. , 2001 ) , ( Al-Adhaileh & Tang , 1999 ) . Background train
Later works , such as <CITED HERE>a ) , Bolshakov ( 2004 ) , Taskiran et al. ( 2006 ) and Topkara et al. ( 2006b ) , further made use of part-ofspeech taggers and electronic dictionaries , such as WordNet and VerbNet , to increase the robustness of the method . Background train
results are based on a corpus of movie subtitles ( Tiedemann 2007 ) , and are consequently shorter sentences , whereas the En → Es results are based on a corpus of parliamentary proceedings ( <CITED HERE> ) . Uses train
We work with a semi-technical text on meteorological phenomena ( <CITED HERE> ) , meant for primary school students . Uses train
This Principle of Finitism is also assumed by Johnson-Laird ( 1983 ) , Jackendoff ( 1983 ) , <CITED HERE> , and implicitly or explicitly by almost all researchers in computational linguistics . CompareOrContrast train
The candidate feature templates include : Voice from <CITED HERE> . Uses train
Over the past decade , researchers at IBM have developed a series of increasingly sophisticated statistical models for machine translation ( Brown et al. , 1988 ; Brown et al. , 1990 ; <CITED HERE>a ) . Background train
The PICO framework ( <CITED HERE> ) for capturing well-formulated clinical queries ( described in Section 2 ) can serve as the basis of a knowledge representation that bridges the needs of clinicians and analytical capabilities of a system . Background train
The Chinese PropBank has labeled the predicateargument structures of sentences from the Chinese TreeBank ( <CITED HERE> ) . Uses val
Furthermore , manually selected word pairs are often biased towards highly related pairs ( <CITED HERE> ) , because human annotators tend to select only highly related pairs connected by relations they are aware of . Background train
Disjunctive feature descriptions are also possible ; WIT incorporates an efficient method for handling disjunctions ( <CITED HERE> ) . Uses train
Our strategy is based on the approach presented by <CITED HERE> . Uses train
Indeed , contrary to the more classical statistical methods ( Mutual Information , Loglike ... , see below ) used for collocation acquisition ( see ( <CITED HERE> ) for a review ) , these patterns allow : Background train
A further complication is that different speakers can regard very different values as prototypical , making it difficult to assess which of two objects is greener even on one dimension ( <CITED HERE> , pages 10 -- 12 ) . Background train
The M step then treats c as fixed , observed data and adjusts 0 until the predicted vector of total feature counts equals c , using Improved Iterative Scaling ( Della <CITED HERE> ; Chen and Uses train
Both kinds of annotation were carried out using ANVIL ( <CITED HERE> ) . Uses train
The article classifier is a discriminative model that draws on the state-of-the-art approach described in <CITED HERE> . Uses train
The language grounding problem has received significant attention in recent years , owed in part to the wide availability of data sets ( e.g. Flickr , Von Ahn ( 2006 ) ) , computing power , improved computer vision models ( <CITED HERE> ; Lowe , 2004 ; Farhadi et al. , 2009 ; Parikh and Grauman , 2011 ) and neurological evidence of ties between the language , perceptual and motor systems in the brain ( Pulverm ¨ uller et al. , 2005 ; Tettamanti et al. , 2005 ; Aziz-Zadeh et al. , 2006 ) . Background val
<CITED HERE> , p. 294 ) studied , among other simple text normalization techniques , the effect of case normalization for different words and showed that `` sometimes case variants refer to the same thing ( hurricane and Hurricane ) , sometimes they refer to different things ( continental and Continental ) and sometimes they do n't refer to much of anything ( e.g. , anytime and Anytime ) . '' Background train
We follow the notation convention of <CITED HERE> . Uses train
Later works , such as Atallah et al. ( 2001a ) , Bolshakov ( 2004 ) , Taskiran et al. ( 2006 ) and <CITED HERE>b ) , further made use of part-ofspeech taggers and electronic dictionaries , such as WordNet and VerbNet , to increase the robustness of the method . Background train
<CITED HERE> did not report inter-subject correlation for their larger dataset . CompareOrContrast train
This is a similar conclusion to our previous work in <CITED HERE> . CompareOrContrast train
We then use the program Snob ( <CITED HERE> ; Wallace 2005 ) to cluster these experiences . Uses train
The priorities are used for disambiguating interpretation in the incremental understanding method ( <CITED HERE>b ) . Uses train
A number of alignment techniques have been proposed , varying from statistical methods ( Brown et al. , 1991 ; <CITED HERE> ) to lexical methods ( Kay and Roscheisen , 1993 ; Chen , 1993 ) . Background train
<CITED HERE> present an illustrative first-order fragment along these lines and are able to supply a coherent formal semantics for the CLF-QLFs themselves , using a technique essentially equivalent to supervaluations : a QLF is true iff all its possible RQLFs are , false iff they are all false , and undefined otherwise . Background val
We use the open-source Moses toolkit ( <CITED HERE> ) to build a phrase-based SMT system trained on mostly MSA data ( 64M words on the Arabic side ) obtained from several LDC corpora including some limited DA data . Uses train
This approach is taken , for example , in LKB ( <CITED HERE> ) where lexical rules are introduced on a par with phrase structure rules and the parser makes no distinction between lexical and nonlexical rules ( Copestake 1993 , 31 ) . CompareOrContrast train
This confirms that although Kozima 's approach ( <CITED HERE> ) is computationally expensive , it does produce more precise segmentation . CompareOrContrast train
<CITED HERE> recently described a hybrid method for finding abbreviations and their definitions . Background train
More specifically , the notion of the phrasal lexicon ( used first by <CITED HERE> ) has been used successfully in a number of areas : Background train
<CITED HERE> run a finite-state NP parser on a POS-tagged corpus to calculate the relative frequency of the same six subcategorization verb classes . Background train
This section , which elaborates on preliminary results reported in <CITED HERE> , describes extraction algorithms for population , problems , interventions , outcomes , and the strength of evidence . Extends train
It would seem therefore that the iteration of the PT operation to form a closure is needed ( cfXXX <CITED HERE>b ) . CompareOrContrast train
<CITED HERE> asked subjects to identify the target of a vague description in a visual scene . Background train
Our most accurate single grammar achieves an F score of 91.6 on the WSJ test set , rivaling discriminative reranking approaches ( Charniak and Johnson , 2005 ) and products of latent variable grammars ( <CITED HERE> ) , despite being a single generative PCFG . CompareOrContrast train
Our recovery policy is modeled on the TargetedHelp ( <CITED HERE> ) policy used in task-oriented dialogue . Extends train
It has been argued that , in an incremental approach , gradable properties should be given a low preference ranking because they are difficult to process ( <CITED HERE> ) . CompareOrContrast train
` See ( <CITED HERE> ) for a discussion of the appropriateness of TIG for HPSG and a comparison with other feature logic approaches designed for HPSG . Background train
Specifically , we used Decision Graphs ( <CITED HERE> ) for Doc-Pred , and SVMs ( Vapnik 1998 ) for Sent-Pred .11 Additionally , we used unigrams for clustering documents and sentences , and unigrams and bigrams for predicting document clusters and sentence clusters ( Sections 3.1.2 and 3.2.2 ) . Uses val
There are many plausible representations , such as pairs of trees from synchronous tree adjoining grammars ( Abeille et al. 1990 ; Shieber 1994 ; <CITED HERE> ) , lexical conceptual structures ( Dorr 1992 ) and WordNet synsets ( Fellbaum 1998 ; Vossen 1998 ) . Background train
ones , DIRT ( Lin and Pantel , 2001 ) , VerbOcean ( Chklovski and Pantel , 2004 ) , FrameNet ( <CITED HERE> ) , and Wikipedia ( Mehdad et al. , 2010 ; Kouylekov et al. , 2009 ) . Background train
In the latter case , we can also take care of transferring the value of z. However , as discussed by <CITED HERE> , creating several instances of lexical rules can be avoided . Motivation train
It maximizes the probability of getting the entire DA sequence correct , but it does not necessarily find the DA sequence that has the most DA labels correct ( <CITED HERE> ) . Background train
A substring in the sentence that corresponds to a node in the representation tree is denoted by assigning the interval of the substring to SNODE of 2 These definitions are based on the discussion in ( Tang , 1994 ) and <CITED HERE> . Uses train
We found that the oldest system ( <CITED HERE> ) yielded the best prototypes , and that using these prototypes gave state-of-the-art performance on WSJ , as well as improvements on nearly all of the non-English corpora . Background train
Other definitions of predicates may be found in ( <CITED HERE> ) . Background train
For the sake of completeness , we report in this section also the results obtained adopting the `` basic solution '' proposed by ( <CITED HERE> ) . CompareOrContrast train
The representations used by Danlos ( 2000 ) , Gardent and Webber ( 1998 ) , or <CITED HERE> are similar , but do not ( always ) explicitly represent the clause combining operations as labeled nodes . Background train
Since earlier versions of the SNoW based CSCL were used only to identify single phrases ( <CITED HERE> ; Munoz et al. , 1999 ) and never to identify a collection of several phrases at the same time , as we do here , we also trained and tested it under the exact conditions of CoNLL-2000 ( Tjong Kim Sang and Buchholz , 2000 ) to compare it to other shallow parsers . Extends train
If differences in meaning between senses are very fine-grained , distinguishing between them is hard even for humans ( <CITED HERE> ) .6 Pairs containing such words are not suitable for evaluation . Background train
The application of domain models and deep semantic knowledge to question answering has been explored by a variety of researchers ( e.g. , Jacquemart and Zweigenbaum 2003 , <CITED HERE> ) , and was also the focus of recent workshops on question answering in restricted domains at ACL 2004 and AAAI 2005 . Background train
Griffiths et al. ( 2007 ) helped pave the path for cognitive-linguistic multimodal research , showing that Latent Dirichlet Allocation outperformed Latent Semantic Analysis ( <CITED HERE> ) in the prediction of association norms . Background train
Although not the first to employ a generative approach to directly model content , the seminal work of <CITED HERE> is a noteworthy point of reference and comparison . CompareOrContrast train
Others include selectional preferences , transitivity ( <CITED HERE> ) , mutual exclusion , symmetry , etc. . Background train
• cross-language information retrieval ( e.g. , <CITED HERE> ) , • multilingual document filtering ( e.g. , Oard 1997 ) , • computer-assisted language learning ( e.g. , Nerbonne et al. 1997 ) , • certain machine-assisted translation tools ( e.g. , Macklovitch 1994 ; Melamed 1996a ) , • concordancing for bilingual lexicography ( e.g. , Catizone , Russell , and Warwick 1989 ; Gale and Church 1991 ) , Background val
For example , ( <CITED HERE> ) discusses the evaluation of two different text categorization strategies with several variations of their feature spaces . Background train
As stated before , the experiments are run in the ACE '04 framework ( <CITED HERE> ) where the system will identify mentions and will label them ( cfXXX Section 4 ) with a type ( person , organization , etc ) , a sub-type ( OrgCommercial , OrgGovernmental , etc ) , a mention level ( named , nominal , etc ) , and a class ( specific , generic , etc ) . Uses train
Thus , the second class of SBD systems employs machine learning techniques such as decision tree classifiers ( Riley 1989 ) , neural networks ( Palmer and Hearst 1994 ) , and maximum-entropy modeling ( <CITED HERE> ) . Background train
or quotation of messages in emails or postings ( see <CITED HERE> but cfXXX Agrawal et al. ( 2003 ) ) . Background train
The first work to do this with topic models is <CITED HERE>b ) . Background train
The language chosen for semantic representation is a flat semantics along the line of ( Bos , 1995 ; Copestake et al. , 1999 ; <CITED HERE> ) . CompareOrContrast train
Tetreault 's contribution features comparative evaluation involving the author 's own centering-based pronoun resolution algorithm called the Left-Right Centering algorithm ( LRC ) as well as three other pronoun resolution methods : Hobbs 's naive algorithm ( <CITED HERE> ) , BFP ( Brennan , Friedman , and Pollard 1987 ) , and Strube 's 5list approach ( Strube 1998 ) . Background train
For future work , we might investigate how machine learning algorithms , which are specifically designed for the problem of domain adaptation ( <CITED HERE> ; Jiang and Zhai , 2007 ) , perform in comparison to our approach . Future train
The X2 statistic is performing at least as well as G2 , throwing doubt on the claim by <CITED HERE> that the G2 statistic is better suited for use in corpus-based NLP . CompareOrContrast train
Provided with the candidate fragment elements , we previously ( <CITED HERE> ) used a chunker3 to finalize the output fragments , in order to follow the linguistic definition of a ( para - ) phrase . Extends train
There is a rich literature on organization and lexical access of morphologically complex words where experiments have been conducted mainly for derivational suffixed words of English , Hebrew , Italian , French , Dutch , and few other languages ( Marslen-Wilson et al. , 2008 ; Frost et al. , 1997 ; Grainger , et al. , 1991 ; <CITED HERE> ) . Background train
This method of incorporating dictionary information seems simpler than the method proposed by Brown et al. for their models ( <CITED HERE>b ) . CompareOrContrast train
One important example is the constituentcontext model ( CCM ) of <CITED HERE> , which was specifically designed to capture the linguistic observation made by Radford ( 1988 ) that there are regularities to the contexts in which constituents appear . Background train
Japanese ( <CITED HERE> ) , despite a very high accuracy , is different in that attachment score drops from 98 % to 85 % , as we go from length 1 to 2 , which may have something to do with the data consisting of transcribed speech with very short utterances . CompareOrContrast train
7 We ignore the rare `` false idafa '' construction ( <CITED HERE> , p. 102 ) . Background train
Various approaches for computing semantic relatedness of words or concepts have been proposed , e.g. dictionary-based ( Lesk , 1986 ) , ontology-based ( <CITED HERE> ; Leacock and Chodorow , 1998 ) , information-based ( Resnik , 1995 ; Jiang and Conrath , 1997 ) or distributional ( Weeds and Weir , 2005 ) . Background train
Some methods are based on likelihood ( Och and Ney , 2002 ; Blunsom et al. , 2008 ) , error rate ( Och , 2003 ; Zhao and Chen , 2009 ; Pauls et al. , 2009 ; Galley and Quirk , 2011 ) , margin ( Watanabe et al. , 2007 ; Chiang et al. , 2008 ) and ranking ( <CITED HERE> ) , and among which minimum error rate training ( MERT ) ( Och , 2003 ) is the most popular one . Background train
We follow <CITED HERE> in allowing a small set of generic , linguistically-plausible unary and binary grammar rules . Uses train
( 7 ) NEIGHBOR : Research in lexical semantics suggests that the SC of an NP can be inferred from its distributionally similar NPs ( see <CITED HERE>a ) ) . Motivation val
Discrepancies in length throw constituents off balance , and so prosodic phrasing will cross constituent boundaries in order to give the phrases similar lengths ; this is the case in Chickens were eating II the remaining green vegetables , where the subject-predicate boundary finds no prosodic correspondent .4 The most explicit version of this approach is the analysis presented in <CITED HERE> ( henceforth G&G ) . CompareOrContrast train
Our approach to the problem is more compatible with the empirical evidence we presented in our prior work ( <CITED HERE> ) where we analyzed the output of Chinese to English machine translation and found that there is no correlation between sentence length and MT quality . CompareOrContrast train
For all experiments reported in this section we used the syntactic dependency parser MaltParser v1 .3 ( <CITED HERE> , 2008 ; Kübler , McDonald , and Nivre 2009 ) , a transition-based parser with an input buffer and a stack , which uses SVM classifiers Uses train
Against the background of a growing interest in multilingual NLP , multilingual anaphora / coreference resolution has gained considerable momentum in recent years ( Aone and McKee 1993 ; Azzam , Humphreys , and Gaizauskas 1998 ; Harabagiu and Maiorano 2000 ; Mitkov and Barbu 2000 ; Mitkov 1999 ; <CITED HERE> ; Mitkov , Belguith , and Stys 1998 ) . Background train
We use the same data setting with Xue ( 2008 ) , however a bit different from <CITED HERE> . CompareOrContrast train
They proved to be useful in a number of NLP applications such as natural language generation ( Iordanskaja et al. , 1991 ) , multidocument summarization ( McKeown et al. , 2002 ) , automatic evaluation of MT ( <CITED HERE> ) , and TE ( Dinu and Wang , 2009 ) . Motivation train
Moreover , a sandbox is a temporary view of a document itself i.e. a sandbox can not cause a change in the history ( <CITED HERE> ) . Background train
Regarding future work , there are many research line that may be followed : i ) Capturing more features by employing external knowledge such as ontological , lexical resource or WordNet-based features ( <CITED HERE>a ; Basili et al. , 2005b ; Bloehdorn et al. , 2006 ; Bloehdorn and Moschitti , 2007 ) or shallow semantic trees , ( Giuglea and Moschitti , 2004 ; Giuglea and Moschitti , 2006 ; Moschitti and Bejan , 2004 ; Moschitti et al. , 2007 ; Moschitti , 2008 ; Moschitti et al. , 2008 ) . Future val
Another line of research approaches grounded language knowledge by augmenting distributional approaches of word meaning with perceptual information ( Andrews et al. , 2009 ; Steyvers , 2010 ; Feng and Lapata , 2010b ; Bruni et al. , 2011 ; Silberer and Lapata , 2012 ; <CITED HERE> ; Bruni et al. , 2012a ; Bruni et al. , 2012b ; Silberer et al. , 2013 ) . Background train
Future research should apply the work of Blunsom et al. ( 2008 ) and <CITED HERE> , who marginalize over derivations to find the most probable translation rather than the most probable derivation , to these multi-nonterminal grammars . Future train
We have since improved the interface by incorporating a capability in the recognizer to propose additional solutions in turn once the first one fails to parse ( <CITED HERE> ) To produce these `` N-best '' alternatives , we make use of a standard A * search algorithm ( Hart 1968 , Jelinek 1976 ) . Uses train
<CITED HERE> Background train
OT therefore holds out the promise of simplifying grammars , by factoring all complex phenomena into simple surface-level constraints that partially mask one another .1 Whether this is always possible under an appropriate definition of `` simple constraints '' ( e.g. , <CITED HERE>b ) is of course an empirical question . Background train
Consider , for example , the lexical rule in Figure 2 , which encodes a passive lexical rule like the one presented by Pollard and Sag ( 1987 , 215 ) in terms of the setup of <CITED HERE> , ch . Background train
This result is consistent with other works using this model with these features ( <CITED HERE> ; Silberer and Lapata , 2012 ) . CompareOrContrast train
<CITED HERE> reported a correlation of r = .9026.10 The results are not directly comparable , because he only used noun-noun pairs , words instead of concepts , a much smaller dataset , and measured semantic similarity instead of semantic relatedness . CompareOrContrast test
Similar observation for surface word frequency was also observed by ( Bertram et al. , 2000 ; <CITED HERE> ; Burani et al. , 1987 ; Burani et al. , 1984 ; Schreuder et al. , 1997 ; Taft 1975 ; Taft , 2004 ) where it has been claimed that words having low surface frequency tends to decompose . Background test
But their importance has grown far beyond machine translation : for instance , transferring annotations between languages ( <CITED HERE> ; Hwa et al. 2005 ; Ganchev , Gillenwater , and Taskar 2009 ) ; discovery of paraphrases ( Bannard and Callison-Burch 2005 ) ; and joint unsupervised POS and parser induction across languages ( Snyder and Barzilay 2008 ) . Motivation test
Previous sentiment-analysis work in different domains has considered inter-document similarity ( Agarwal and Bhattacharyya , 2005 ; Pang and Lee , 2005 ; <CITED HERE> ) or explicit Background test
However , the method we are currently using in the ATIS domain ( <CITED HERE> ) represents our most promising approach to this problem . Uses test
Henceforth the collaborative traits of blogs and wikis ( <CITED HERE> ) emphasize annotation , comment , and strong editing . Background test
The ICA system ( <CITED HERE> ) aims to reduce the training time by introducing independence assumptions on the training samples that dramatically reduce the training time with the possible downside of sacrificing performance . Background test
To this end , several toolkits for building spoken dialogue systems have been developed ( Barnett and Singh , 1997 ; <CITED HERE> ) . Background test
Thus , over the past few years , along with advances in the use of learning and statistical methods for acquisition of full parsers ( Collins , 1997 ; Charniak , 1997a ; Charniak , 1997b ; Ratnaparkhi , 1997 ) , significant progress has been made on the use of statistical learning methods to recognize shallow parsing patterns syntactic phrases or words that participate in a syntactic relationship ( Church , 1988 ; Ramshaw and Marcus , 1995 ; Argamon et al. , 1998 ; Cardie and Pierce , 1998 ; Munoz et al. , 1999 ; Punyakanok and Roth , 2001 ; <CITED HERE> ; Tjong Kim Sang and Buchholz , 2000 ) . Background test
Task properties Determining whether or not a speaker supports a proposal falls within the realm of sentiment analysis , an extremely active research area devoted to the computational treatment of subjective or opinion-oriented language ( early work includes Wiebe and Rapaport ( 1988 ) , <CITED HERE> , Sack ( 1994 ) , and Wiebe ( 1994 ) ; see Esuli ( 2006 ) for an active bibliography ) . Background test
Various approaches for computing semantic relatedness of words or concepts have been proposed , e.g. dictionary-based ( Lesk , 1986 ) , ontology-based ( Wu and Palmer , 1994 ; Leacock and Chodorow , 1998 ) , information-based ( <CITED HERE> ; Jiang and Conrath , 1997 ) or distributional ( Weeds and Weir , 2005 ) . Background test
Both tasks are performed with a statistical framework : the mention detection system is similar to the one presented in ( Florian et al. , 2004 ) and the coreference resolution system is similar to the one described in ( <CITED HERE> ) . CompareOrContrast test
The advantage of tuning similarity to the application of interest has been shown previously by <CITED HERE> . CompareOrContrast test
Although there are other discussions of the paragraph as a central element of discourse ( e.g. <CITED HERE> , Halliday and Hasan 1976 , Longacre 1979 , Haberlandt et al. 1980 ) , all of them share a certain limitation in their formal techniques for analyzing paragraph structure . CompareOrContrast test
Thus , over the past few years , along with advances in the use of learning and statistical methods for acquisition of full parsers ( Collins , 1997 ; Charniak , 1997a ; Charniak , 1997b ; <CITED HERE> ) , significant progress has been made on the use of statistical learning methods to recognize shallow parsing patterns syntactic phrases or words that participate in a syntactic relationship ( Church , 1988 ; Ramshaw and Marcus , 1995 ; Argamon et al. , 1998 ; Cardie and Pierce , 1998 ; Munoz et al. , 1999 ; Punyakanok and Roth , 2001 ; Buchholz et al. , 1999 ; Tjong Kim Sang and Buchholz , 2000 ) . Background test
We experiment with four learners commonly employed in language learning : Decision List ( DL ) : We use the DL learner as described in Collins and Singer ( 1999 ) , motivated by its success in the related tasks of word sense disambiguation ( <CITED HERE> ) and NE classification ( Collins and Singer , 1999 ) . Motivation test
A central technique is to define a joint relation as a noisy-channel model , by composing a joint relation with a cascade of one or more conditional relations as in Fig. 1 ( <CITED HERE> ; Knight and Graehl , 1998 ) . Background test
We use the same set of binary features as in previous work on this dataset ( Pang et al. , 2002 ; <CITED HERE> ; Zaidan et al. , 2007 ) . Uses test
Our classification framework , directly inspired by <CITED HERE> , integrates both perspectives , optimizing its labeling of speech segments based on both individual speech-segment classification scores and preferences for groups of speech segments to receive the same label . Uses test
As for work on Arabic ( MSA ) , results have been reported on the PATB ( Kulick , Gabbard , and Marcus 2006 ; Diab 2007 ; <CITED HERE> ) , the Prague Dependency Treebank ( PADT ) ( Buchholz and Marsi 2006 ; Nivre 2008 ) and the CATiB ( Habash and Roth 2009 ) . Background test
For instance , <CITED HERE> report that the SATZ system ( decision tree variant ) was trained on a set of about 800 labeled periods , which corresponds to a corpus of about 16,000 words . CompareOrContrast test
One possible direction is to consider linguistically motivated approaches , such as the extraction of syntactic phrase tables as proposed by ( <CITED HERE> ) . Future test
Later works , such as Atallah et al. ( 2001a ) , <CITED HERE> , Taskiran et al. ( 2006 ) and Topkara et al. ( 2006b ) , further made use of part-ofspeech taggers and electronic dictionaries , such as WordNet and VerbNet , to increase the robustness of the method . Background test
A number of speech understanding systems have been developed during the past fifteen years ( Barnett et al. 1980 , Dixon and Martin 1979 , Erman et al. 1980 , <CITED HERE> , Lea 1980 , Lowerre and Reddy 1980 , Medress 1980 , Reddy 1976 , Walker 1978 , and Wolf and Woods 1980 ) . CompareOrContrast test
The bottom panel of table 1 lists the results for the chosen lexicalized model ( SSN-Freq > 200 ) and five recent statistical parsers ( Ratnaparkhi , 1999 ; Collins , 1999 ; Charniak , 2000 ; <CITED HERE> ; Bod , 2001 ) . CompareOrContrast test
The basic Python reflection has already been implemented and used for large scale experiments with POS tagging , using pyMPI ( a message passing interface library for Python ) to coordinate experiments across a cluster of over 100 machines ( <CITED HERE> ; Clark et al. , 2003 ) . Background test
This imbalance foils thresholding strategies , clever as they might be ( Gale & Church , 1991 ; Wu & Xia , 1994 ; <CITED HERE> ) . Background test
Training was done on the Penn Treebank ( <CITED HERE> ) Wall Street Journal data , sections 02-21 . Uses test
We performed Latent Semantic Analysis ( LSA ) over Wikipedia using the jLSI tool ( <CITED HERE> ) to measure the relatedness between words in the dataset . Uses test
For example , our previous work ( <CITED HERE> ; Nakov and Ng , 2012 ) experimented with various techniques for combining a small bi-text for a resource-poor language ( Indonesian or Spanish , pretending that Spanish is resource-poor ) with a much larger bi-text for a related resource-rich language ( Malay or Portuguese ) ; the target language of all bi-texts was English . CompareOrContrast test
Various approaches for computing semantic relatedness of words or concepts have been proposed , e.g. dictionary-based ( <CITED HERE> ) , ontology-based ( Wu and Palmer , 1994 ; Leacock and Chodorow , 1998 ) , information-based ( Resnik , 1995 ; Jiang and Conrath , 1997 ) or distributional ( Weeds and Weir , 2005 ) . Background test
Another line of research approaches grounded language knowledge by augmenting distributional approaches of word meaning with perceptual information ( Andrews et al. , 2009 ; Steyvers , 2010 ; Feng and Lapata , 2010b ; Bruni et al. , 2011 ; Silberer and Lapata , 2012 ; Johns and Jones , 2012 ; <CITED HERE>a ; Bruni et al. , 2012b ; Silberer et al. , 2013 ) . Background test
<CITED HERE> replicated the experiment of Rubenstein and Goodenough with the original 65 word pairs translated into German . Background test
One approach to this more general problem , taken by the ` Nitrogen ' generator ( <CITED HERE>a ; Langkilde and Knight , 1998b ) , takes advantage of standard statistical techniques by generating a lattice of all possible strings given a semantic representation as input and selecting the most likely output using a bigram language model . Uses test
where mk is one mention in entity e , and the basic model building block PL ( L = 1 | e , mk , m ) is an exponential or maximum entropy model ( <CITED HERE> ) . Uses test
13 We also employed sequence-based measures using the ROUGE tool set ( <CITED HERE> ) , with similar results to those obtained with the word-by-word measures . Uses test
Second , using continuous distributions allows us to leverage a variety of tools ( e.g. , LDA ) that have been shown to be successful in other fields , such as speech recognition ( <CITED HERE> ) . Background test
In this section , we validate the contribution of key tag sets and morphological features -- and combinations thereof -- using a different parser : the Easy-First Parser ( <CITED HERE> ) . Uses test
The typical solution to the redundancy problem is to group verbs according to their argument realization patterns ( <CITED HERE> ) , possibly arranged in an inheritance hierarchy . CompareOrContrast test
Later , <CITED HERE> , 1982 ) proposed a knowledge base in which information about language and the world would be encoded , and he emphasized the need for using `` salience '' in choosing facts from this knowledge base . Background test
Another technique is automatic discovery of translations from parallel or non-parallel corpora ( <CITED HERE> ) . Background test
ASARES is presented in detail in ( <CITED HERE> ) . Uses test
Opposition ( called `` adversative '' or `` contrary-to-expectation '' by <CITED HERE> ; cfXXX also Quirk et al. 1972 , p. 672 ) . Background test
A number of applications have relied on distributional analysis ( <CITED HERE> ) in order to build classes of semantically related terms . Background test
Previous work with MaltParser in Russian , Turkish , and Hindi showed gains with CASE but not with agreement features ( Eryigit , Nivre , and Oflazer 2008 ; Nivre , Boguslavsky , and Iomdin 2008 ; <CITED HERE> ) . CompareOrContrast test
Consider , for example , the lexical rule in Figure 2 , which encodes a passive lexical rule like the one presented by <CITED HERE> , 215 ) in terms of the setup of Pollard and Sag ( 1994 , ch . CompareOrContrast test
Two applications that , like help-desk , deal with question -- answer pairs are : summarization of e-mail threads ( Dalli , Xia , and Wilks 2004 ; <CITED HERE> ) , and answer extraction in FAQs ( Frequently Asked Questions ) ( Berger and Mittal 2000 ; CompareOrContrast test
The language grounding problem has received significant attention in recent years , owed in part to the wide availability of data sets ( e.g. Flickr , Von Ahn ( 2006 ) ) , computing power , improved computer vision models ( Oliva and Torralba , 2001 ; Lowe , 2004 ; <CITED HERE> ; Parikh and Grauman , 2011 ) and neurological evidence of ties between the language , perceptual and motor systems in the brain ( Pulverm ¨ uller et al. , 2005 ; Tettamanti et al. , 2005 ; Aziz-Zadeh et al. , 2006 ) . Background test
In addition , we find that the Bayesian SCFG grammar can not even significantly outperform the heuristic SCFG grammar ( <CITED HERE> ) 5 . CompareOrContrast test
There are several grammars developed in the FB-LTAG formalism , including the XTAG English grammar , a large-scale grammar for English ( The XTAG Research <CITED HERE> ) . Background test
Although the approach may have potential , the shifting of complex accounting into the unification algorithm is at variance with the findings of <CITED HERE> , who report large speed-ups from the elimination of disjunction processing during unification . CompareOrContrast test
For the task of unsupervised dependency parsing , <CITED HERE> add a constraint of the form `` the average length of dependencies should be X '' to capture the locality of syntax ( at least half of the dependencies are between adjacent words ) , using a scheme they call structural annealing . Background test
The speech and language processing architecture is based on that of the SRI CommandTalk system ( <CITED HERE> ; Stent et a. , 1999 ) . Uses test
Second , in line with the findings of ( <CITED HERE> ) , the results obtained over the MT-derived corpus are equal to those we achieve over the original RTE3 dataset ( i.e. 63.50 % ) . CompareOrContrast test
Therefore , inter-subject correlation is lower than the results obtained by <CITED HERE> . CompareOrContrast test
There is a general consensus among theoretical linguists that the proper representation of verbal argument structure is event structure -- representations grounded in a theory of events that decompose semantic roles in terms of primitive predicates representing concepts such as causality and inchoativity ( Dowty , 1979 ; Jackendoff , 1983 ; Pustejovsky , 1991b ; Rappaport <CITED HERE> ) . Background test
For example , some similar measures have been used in stylistic experiments in information retrieval on the basis of a robust parser built for information retrieval purposes ( <CITED HERE> ) . Background test
The resulting training procedure is analogous to the one presented in ( Brown et al. , 1993 ) and ( <CITED HERE> ) . CompareOrContrast test
successfully parses , or until a quitting criterion is reached , such as an upper bound on N. Whereas in the loosely coupled system the parser acts as a filter only on completed candidate solutions ( <CITED HERE> ) , the tightly coupled system allows the parser to discard partial theories that have no way of continuing . Uses test
<CITED HERE> substituted the non-terminal X in hierarchical phrase-based model by extended syntactic categories . CompareOrContrast test
Much of the earlier work in anaphora resolution heavily exploited domain and linguistic knowledge ( <CITED HERE> ; Carter 1987 ; Rich and LuperFoy 1988 ; Carbonell and Brown 1988 ) , which was difficult both to represent and to process , and which required considerable human input . Background test
The paradigm is `` write many , read many '' ( <CITED HERE> ) . Background test
The Praat tool was used ( <CITED HERE> ) . Uses test
2 The reader is asked to focus on any reasonable size measurement , for example , the maximal horizontal or vertical distance , or some combination of dimensions ( <CITED HERE> ; also Section 8.1 of the present article ) . Background test
The implementation has been inspired by experience in extracting information from very large corpora ( Curran and Moens , 2002 ) and performing experiments on maximum entropy sequence tagging ( <CITED HERE> ; Clark et al. , 2003 ) . Motivation test
Default parameters were used , although experimentation with different parameter settings is an important direction for future work ( Daelemans and Hoste , 2002 ; <CITED HERE> ) . Future test
Our work is inspired by the latent left-linking model in <CITED HERE> and the ILP formulation from Chang et al. ( 2011 ) . Uses test
Furthermore , the availability of rich ontological resources , in the form of the Unified Medical Language System ( UMLS ) ( Lindberg et al. , 1993 ) , and the availability of software that leverages this knowledge -- MetaMap ( Aronson , 2001 ) for concept identification and SemRep ( <CITED HERE> ) for relation extraction -- provide a foundation for studying the role of semantics in various tasks . Background test
The names given to the components vary ; they have been called `` strategic '' and `` tactical '' components ( e.g. , McKeown 1985 ; Thompson 1977 ; Danlos 1987 ) 1 , `` planning '' and `` realization '' ( e.g. , McDonald 1983 ; <CITED HERE>a ) , or simply `` what to say '' versus `` how to say it '' ( e.g. , Danlos 1987 ; Reithinger 1990 ) . Background test
Over the last decade there has been a lot of interest in developing tutorial dialogue systems that understand student explanations ( Jordan et al. , 2006 ; Graesser et al. , 1999 ; Aleven et al. , 2001 ; <CITED HERE> ; Nielsen et al. , 2008 ; VanLehn et al. , 2007 ) , because high percentages of selfexplanation and student contentful talk are known to be correlated with better learning in humanhuman tutoring ( Chi et al. , 1994 ; Litman et al. , 2009 ; Purandare and Litman , 2008 ; Steinhauser et al. , 2007 ) . Background test
We use the TRIPS dialogue parser ( <CITED HERE> ) to parse the utterances . Uses test
In order to address these limitations in a practical way , we conducted a small user study where we asked four judges ( graduate students from the Faculty of Information Technology at Monash University ) to assess the responses generated by our system ( <CITED HERE>a ) . Uses test
The understanding module utilizes ISSS ( Incremental Significant-utterance Sequence Search ) ( <CITED HERE>b ) , which is an integrated parsing and discourse processing method . Uses test
We applied our system to the XTAG English grammar ( The XTAG Research <CITED HERE> ) 3 , which is a large-scale FB-LTAG grammar for English . Uses test
After the extraction , pruning techniques ( <CITED HERE> ) can be applied to increase the precision of the extracted paraphrases . Background test
In this paper , we extend two classes of model adaptation methods ( i.e. , model interpolation and error-driven learning ) , which have been well studied in statistical language modeling for speech and natural language applications ( e.g. , <CITED HERE> ; Bellegarda , 2004 ; Gao et al. , 2006 ) , to ranking models for Web search applications . Background test
GATE goes beyond earlier systems by using a component-based infrastructure ( <CITED HERE> ) which the GUI is built on top of . Background test
Since sentences can refer to events described by other sentences , we may need also a quotation operator ; <CITED HERE> describes how first order logic can be augmented with such an operator . Background test
The system uses a knowledge base implemented in the KM representation language ( Clark and Porter , 1999 ; <CITED HERE> ) to represent the state of the world . Uses test
A possible future direction would be to compare the query string to retrieved results using a method similar to that of <CITED HERE> . Future test
description-level lexical rules ( DLRs ; <CITED HERE> ) .5 2.2.1 Meta-Level Lexical Rules . Background test
All EBMT systems , from the initial proposal by <CITED HERE> to the recent collection of Carl and Way ( 2003 ) , are premised on the availability of subsentential alignments derived from the input bitext . Background test
The necessity of this kind of merging of arguments has been recognized before : Charniak and McDermott ( 1985 ) call it abductive unification/matching , <CITED HERE> , 1979 ) refers to such operations using the terms knitting or petty conversational implicature . Background test
In a number of proposals , lexical generalizations are captured using lexical underspecification ( Kathol 1994 ; <CITED HERE> ; CompareOrContrast test
These keywords are potentially useful features because some of them are subclasses of the ACE SCs shown in the left column of Table 1 , while others appear to be correlated with these ACE SCs .2 ( 6 ) INDUCED CLASS : Since the first-sense heuristic used in the previous feature may not be accurate in capturing the SC of an NP , we employ a corpusbased method for inducing SCs that is motivated by research in lexical semantics ( e.g. , <CITED HERE> ) . Motivation test
Other psycholing-uistic studies that confirm the validity of paragraph units can be found in <CITED HERE> and Haberlandt et al. ( 1980 ) . Background test
The bottom panel of table 1 lists the results for the chosen lexicalized model ( SSN-Freq > 200 ) and five recent statistical parsers ( Ratnaparkhi , 1999 ; Collins , 1999 ; Charniak , 2000 ; Collins , 2000 ; <CITED HERE> ) . CompareOrContrast test
Nevertheless , the full document text is present in most systems , sometimes as the only feature ( Sugiyama and Okumura , 2007 ) and sometimes in combination with others see for instance ( Chen and Martin , 2007 ; <CITED HERE> ) - . Background test
In a similar vain to <CITED HERE> and Buchholz et al. ( 1999 ) , the method extends an existing flat shallow-parsing method to handle composite structures . Future test
As a result , researchers have re-adopted the once-popular knowledge-rich approach , investigating a variety of semantic knowledge sources for common noun resolution , such as the semantic relations between two NPs ( e.g. , <CITED HERE> ) , their semantic similarity as computed using WordNet ( e.g. , Poesio et al. ( 2004 ) ) or Wikipedia ( Ponzetto and Strube , 2006 ) , and the contextual role played by an NP ( see Bean and Riloff ( 2004 ) ) . Background test
We built a two-stage baseline system , using the perceptron segmentation model from our previous work ( <CITED HERE> ) and the perceptron POS tagging model from Collins ( 2002 ) . Extends test
Note that although our current system uses MeSH headings assigned by human indexers , manually assigned terms can be replaced with automatic processing if needed ( <CITED HERE> ) . Future test
Furthermore , medical terminology is characterized by a typical mix of Latin and Greek roots with the corresponding host language ( e.g. , German ) , often referred to as neo-classical compounding ( <CITED HERE> ) . Background test
Previously ( <CITED HERE> ) , we assessed the importance of various implicit argument feature groups by conducting feature ablation tests . Extends test
To model d ( FWi − 1 , S → T ) , d ( FWi +1 , S → T ) , i.e. whether Li , S → T and Ri , S → T extend beyond the neighboring function word phrase pairs , we utilize the pairwise dominance model of <CITED HERE> . Uses test
For instance , <CITED HERE> , p. 8 ) says that the sentence `` Reagan thinks bananas , '' which is otherwise strange , is in fact acceptable if it occurs as an answer to the question `` What is Kissinger 's favorite fruit ? '' Motivation test
Semantic Role labeling ( SRL ) was first defined in <CITED HERE> . Background test
AJAX function lets the communication works asyncronously between a client and a server through a set of messages based on HTTP protocol and XML ( <CITED HERE> ) . Background test
The inclusion of the coreference task in the Sixth and Seventh Message Understanding Conferences ( MUC-6 and MUC-7 ) gave a considerable impetus to the development of coreference resolution algorithms and systems , such as those described in Baldwin et al. ( 1995 ) , Gaizauskas and Humphreys ( 1996 ) , and <CITED HERE> . Background test
The most detailed evaluation of link tokens to date was performed by ( <CITED HERE> ) , who trained Brown et al. 's Model 2 on 74 million words of the Canadian Hansards . CompareOrContrast test
Log-linear models have proved successful in a wide variety of applications , and are the inspiration behind one of the best current statistical parsers ( <CITED HERE> ) . CompareOrContrast test
While we have observed reasonable results with both G2 and Fisher 's exact test , we have not yet discussed how these results compare to the results that can be obtained with a technique commonly used in corpus linguistics based on the mutual information ( MI ) measure ( <CITED HERE> ) : Background test
Morphological alterations of a search term have a negative impact on the recall performance of an information retrieval ( IR ) system ( <CITED HERE> ; J ¨ appinen and Niemist ¨ o , 1988 ; Kraaij and Pohlmann , 1996 ) , since they preclude a direct match between the search term proper and its morphological variants in the documents to be retrieved . Background test
For shuffling paraphrases , french alternations are partially described in ( <CITED HERE> ) and a resource is available which describes alternation and the mapping verbs/alternations for roughly 1 700 verbs . Background test
A more recent approach , advocated by Rappaport Hovav and Levin ( 1998 ) , describes a basic set of event templates corresponding to Vendler 's event classes ( <CITED HERE> ) : ( 3 ) a. [ x ACT <MANNER> ] ( activity ) b. [ x <STATE> ] ( state ) c. [ BECOME [ x <STATE> ] ] ( achievement ) d. [ x CAUSE [ BECOME [ x <STATE> ] ] ] ( accomplishment ) Background test
<CITED HERE> combines lexical and dependency mappings to form his generalizations . Background test
Thus for instance , ( Copestake and Flickinger , 2000 ; <CITED HERE> ) describes a Head Driven Phrase Structure Grammar ( HPSG ) which supports the parallel construction of a phrase structure ( or derived ) tree and of a semantic representation and ( Dalrymple , 1999 ) show how to equip Lexical Functional grammar ( LFG ) with a glue semantics . Background test
The reordering models we describe follow our previous work using function word models for translation ( Setiawan et al. , 2007 ; <CITED HERE> ) . Extends test
And Collins ( 2000 ) argues for `` keeping track of counts of arbitrary fragments within parse trees '' , which has indeed been carried out in <CITED HERE> who use exactly the same set of ( all ) tree fragments as proposed in Bod ( 1992 ) . Motivation test
In our work , we gather sets of sentences , and assume ( but do not employ ) existing approaches for their organization ( Goldstein et al. 2000 ; Barzilay , Elhadad , and McKeown 2001 ; <CITED HERE> ) . Background test
criteria and data used in our experiments are based on the work of <CITED HERE> . Uses test
We present experiments on the two standard coreference resolution datasets , ACE-2004 ( NIST , 2004 ) and OntoNotes-5 .0 ( <CITED HERE> ) . Uses test
• Only qualitative observations of the responses were reported ( no formal evaluation was performed ) ( Lapalme and Kosseim 2003 ; <CITED HERE> ) . CompareOrContrast test
And subderivations headed by A1 with external nonterminals only at the leaves , internal nonterminals elsewhere , have probability 1/a1 ( <CITED HERE> ) . Background test
• Support vector machines for mapping histories to parser actions ( <CITED HERE> ) . Uses test
<CITED HERE> , 1998 ) developed a polynomial time PCFG-reduction of DOP1 whose size is linear in the size of the training set , thus converting the exponential number of subtrees to a compact grammar . Background test
<CITED HERE> avoids enumerating the various senses for adjectives like fast by exploiting the semantics of the nouns they modify . Background test
<CITED HERE> have conducted a study on dependency parsing for 21 languages using features that encode whether the values for certain attributes are equal or not for a node and its governor . Background test
Such approaches have been tried recently in restricted cases ( McCallum et al. , 2000 ; <CITED HERE>b ; Lafferty et al. , 2001 ) . Background test
The relation between discourse and prosodic phrasing has been examined in some detail by <CITED HERE> , who argues that each noun phrase in an utterance constitutes a separate prosodic phrase unless it is destressed because of reference to previous discourse . Background test
By contrast , Turkish ( Oflazer et al. , 2003 ; <CITED HERE> ) exhibits high root accuracy but consistently low attachment scores ( about 88 % for length 1 and 68 % for length 2 ) . CompareOrContrast test
The candidate examples that lead to the most disagreements among the different learners are considered to have the highest TUV ( Cohn , Atlas , and Ladner 1994 ; <CITED HERE> ) . Background test
Subsequently , we extracted the bilingual phrase table from the aligned corpora using the Moses toolkit ( <CITED HERE> ) . Uses test
Representative systems are described in Boisen et al. ( 1989 ) , De Mattia and Giachin ( 1989 ) , Niedermair ( 1989 ) , Niemann ( 1990 ) , and <CITED HERE> . Background test
Our rules for phonological word formation are adopted , for the most part , from G & G , <CITED HERE> , and the account of monosyllabic destressing in Selkirk ( 1984 ) . Uses test
As a generalization , <CITED HERE> notes that lexicons such as COMLEX tend to demonstrate high precision but low recall . Background test
Such systems extract information from some types of syntactic units ( clauses in ( Fillmore and Atkins , 1998 ; Gildea and Jurafsky , 2002 ; <CITED HERE> ) ; noun phrases in ( Hull and Gomez , 1996 ; Rosario et al. , 2002 ) ) . Background test
Various approaches for computing semantic relatedness of words or concepts have been proposed , e.g. dictionary-based ( Lesk , 1986 ) , ontology-based ( Wu and Palmer , 1994 ; Leacock and Chodorow , 1998 ) , information-based ( Resnik , 1995 ; <CITED HERE> ) or distributional ( Weeds and Weir , 2005 ) . Background test
Besides WordNet , the RTE literature documents the use of a variety of lexical information sources ( Bentivogli et al. , 2010 ; <CITED HERE> ) . Background test
The question answering system developed by <CITED HERE> belongs to the merging category of approaches , where the output of an individual method can be used as input to a different method ( this corresponds to Burke 's cascade sub-category ) . CompareOrContrast test
More recently , ( Sebastiani , 2002 ) has performed a good survey of document categorization ; recent works can also be found in ( Joachims , 2002 ) , ( <CITED HERE> ) , and ( Lewis et al. , 2004 ) . Background test
Discriminant analysis has been employed by researchers in automatic text genre detection ( <CITED HERE>b ; Karlgren and Cutting 1994 ) since it offers a simple and robust solution despite the fact that it presupposes normal distributions of the discriminating variables . Background test
This model has previously been shown to provide excellent performance on multiple tasks , including prediction of association norms , word substitution errors , semantic inferences , and word similarity ( <CITED HERE> ; Silberer and Lapata , 2012 ) . Extends test
In other words , existing treatments of gradables in GRE fail to take the `` efficiency of language '' into account ( <CITED HERE> ; see our Section 2 ) . Background test
Word alignments are used primarily for extracting minimal translation units for machine translation ( MT ) ( e.g. , phrases [ Koehn , Och , and Marcu 2003 ] and rules [ <CITED HERE> ; Chiang et al. 2005 ] ) as well as for Background test
Following <CITED HERE> , the IR system ranks documents according to the probability that a document D is relevant given the query Q , P ( D is R IQ ) . Uses test
In modern syntactic theories ( e.g. , lexical-functional grammar [ LFG ] [ Kaplan and Bresnan 1982 ; <CITED HERE> ; Dalrymple 2001 ] , head-driven phrase structure grammar [ HPSG ] [ Pollard and Sag 1994 ] , tree-adjoining grammar [ TAG ] [ Joshi 1988 ] , and combinatory categorial grammar [ CCG ] [ Ades and Steedman 1982 ] ) , the lexicon is the central repository for much morphological , syntactic , and semantic information . Background test
We have shown elsewhere ( Jensen and Binot 1988 ; <CITED HERE>a , 1987b ) that natural language programs , such as on-line grammars and dictionaries , can be used as referential levels for commonsense reasoning -- for example , to disambiguate PP attachment . Extends test
Thus rather than a single training procedure , we can actually partition the examples by predicate , and train a 1For a fixed verb , MI is proportional to <CITED HERE> 's conditional probability scores for pseudodisambiguation of ( v , n , n ′ ) triples : Pr ( v | n ) = Pr ( v , n ) / Pr ( n ) , which was shown to be a better measure of association than co-occurrence frequency f ( v , n ) . Motivation test