7 results on '"Zeyu Dai"'
Search Results
2. Weakly Supervised Subevent Knowledge Acquisition
- Author
-
Ruihong Huang, Maitreyi Ramaswamy, Bonan Min, Zeyu Dai, and Wenlin Yao
- Subjects
Computer science ,business.industry ,02 engineering and technology ,computer.software_genre ,Knowledge acquisition ,Knowledge base ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Tuple ,business ,computer ,Classifier (UML) ,Natural language processing - Abstract
Subevents elaborate an event and widely exist in event descriptions. Subevent knowledge is useful for discourse analysis and event-centric applications. Acknowledging the scarcity of subevent knowledge, we propose a weakly supervised approach to extract subevent relation tuples from text and build the first large scale subevent knowledge base. We first obtain the initial set of event pairs that are likely to have the subevent relation, by exploiting two observations that 1) subevents are temporally contained by the parent event, and 2) the definitions of the parent event can be used to further guide the identification of subevents. Then, we collect rich weak supervision using the initial seed subevent pairs to train a contextual classifier using BERT and apply the classifier to identify new subevent pairs. The evaluation showed that the acquired subevent tuples (239K) are of high quality (90.1% accuracy) and cover a wide range of event types. The acquired subevent knowledge has been shown useful for discourse analysis and identifying a range of event-event relations.
- Published
- 2020
- Full Text
- View/download PDF
3. Coreference Aware Representation Learning for Neural Named Entity Recognition
- Author
-
Zeyu Dai, Ping Li, and Hongliang Fei
- Subjects
Coreference ,Artificial neural network ,business.industry ,Computer science ,02 engineering and technology ,010501 environmental sciences ,computer.software_genre ,01 natural sciences ,Task (project management) ,Named-entity recognition ,Component (UML) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Layer (object-oriented design) ,business ,computer ,Feature learning ,Word (computer architecture) ,Natural language processing ,0105 earth and related environmental sciences - Abstract
Recent neural network models have achieved state-of-the-art performance on the task of named entity recognition (NER). However, previous neural network models typically treat the input sentences as a linear sequence of words but ignore rich structural information, such as the coreference relations among non-adjacent words, phrases or entities. In this paper, we propose a novel approach to learn coreference-aware word representations for the NER task at the document level. In particular, we enrich the well-known neural architecture ``CNN-BiLSTM-CRF'' with a coreference layer on top of the BiLSTM layer to incorporate coreferential relations. Furthermore, we introduce the coreference regularization to ensure the coreferential entities to share similar representations and consistent predictions within the same coreference cluster. Our proposed model achieves new state-of-the-art performance on two NER benchmarks: CoNLL-2003 and OntoNotes v5.0. More importantly, we demonstrate that our framework does not rely on gold coreference knowledge, and can still work well even when the coreferential relations are generated by a third-party toolkit.
- Published
- 2019
- Full Text
- View/download PDF
4. A Regularization Approach for Incorporating Event Knowledge and Coreference Relations into Neural Discourse Parsing
- Author
-
Zeyu Dai and Ruihong Huang
- Subjects
Coreference ,Parsing ,Commonsense knowledge ,Artificial neural network ,business.industry ,Computer science ,02 engineering and technology ,010501 environmental sciences ,computer.software_genre ,01 natural sciences ,Regularization (mathematics) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Natural language processing ,0105 earth and related environmental sciences - Abstract
We argue that external commonsense knowledge and linguistic constraints need to be incorporated into neural network models for mitigating data sparsity issues and further improving the performance of discourse parsing. Realizing that external knowledge and linguistic constraints may not always apply in understanding a particular context, we propose a regularization approach that tightly integrates these constraints with contexts for deriving word representations. Meanwhile, it balances attentions over contexts and constraints through adding a regularization term into the objective function. Experiments show that our knowledge regularization approach outperforms all previous systems on the benchmark dataset PDTB for discourse parsing.
- Published
- 2019
- Full Text
- View/download PDF
5. Study and Comparison of Wind Power Correlation Using Two Types of Dataset
- Author
-
Yannan Luo, Zeyu Dai, and Yurong Wang
- Subjects
Wind power ,Computer science ,business.industry ,020209 energy ,02 engineering and technology ,computer.software_genre ,Wind speed ,Copula (probability theory) ,Correlation ,Goodness of fit ,0202 electrical engineering, electronic engineering, information engineering ,Power grid ,Data mining ,business ,computer - Abstract
With the integration of large-scale wind farms, stability and economy of power grid are greatly challenged. Considering the stochastic characteristics of wind and the coupling relationship of geographically distributed wind farms, this paper presents the comparison of analysis methods and results when using wind speed dataset and wind power output dataset in wind power spatial correlation research based on copula. Combined with several revised criteria, a novel method is proposed to select copula type and judge goodness of fit. In the case study, copula models and typical scenarios using the two datasets are highlighted and compared. Study results clarified the advantages and disadvantages of using different datasets.
- Published
- 2018
- Full Text
- View/download PDF
6. Building Context-aware Clause Representations for Situation Entity Type Classification
- Author
-
Ruihong Huang and Zeyu Dai
- Subjects
FOS: Computer and information sciences ,Computer Science - Computation and Language ,Interpretation (logic) ,Computer science ,business.industry ,Context (language use) ,02 engineering and technology ,Type (model theory) ,16. Peace & justice ,computer.software_genre ,TheoryofComputation_MATHEMATICALLOGICANDFORMALLANGUAGES ,Entity type ,Categorization ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Paragraph ,business ,Computation and Language (cs.CL) ,computer ,Natural language processing - Abstract
Capabilities to categorize a clause based on the type of situation entity (e.g., events, states and generic statements) the clause introduces to the discourse can benefit many NLP applications. Observing that the situation entity type of a clause depends on discourse functions the clause plays in a paragraph and the interpretation of discourse functions depends heavily on paragraph-wide contexts, we propose to build context-aware clause representations for predicting situation entity types of clauses. Specifically, we propose a hierarchical recurrent neural network model to read a whole paragraph at a time and jointly learn representations for all the clauses in the paragraph by extensively modeling context influences and inter-dependencies of clauses. Experimental results show that our model achieves the state-of-the-art performance for clause-level situation entity classification on the genre-rich MASC+Wiki corpus, which approaches human-level performance., Comment: Accepted by EMNLP 2018
- Published
- 2018
- Full Text
- View/download PDF
7. Online Deception Detection Refueled by RealWorld Data Collection
- Author
-
Ruihong Huang, James Caverlee, Wenlin Yao, and Zeyu Dai
- Subjects
Data collection ,Information retrieval ,Computer science ,business.industry ,media_common.quotation_subject ,02 engineering and technology ,010501 environmental sciences ,Deception ,computer.software_genre ,01 natural sciences ,Bottleneck ,Writing style ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Quality (business) ,Product (category theory) ,Artificial intelligence ,business ,computer ,Real world data ,Social network analysis ,Natural language processing ,0105 earth and related environmental sciences ,media_common - Abstract
The lack of large realistic datasets presents a bottleneck in online deception detection studies. In this paper, we apply a data collection method based on social network analysis to quickly identify high quality deceptive and truthful online reviews1 from Amazon. The dataset contains more than 10,000 deceptive reviews and is diverse in product domains and reviewers. Using this dataset, we explore effective general features for online deception detection that perform well across domains. We demonstrate that with generalized features – advertising speak and writing complexity scores – deception detection performance can be further improved by adding additional deceptive reviews from assorted domains in training. Finally, reviewer level evaluation gives an interesting insight into different deceptive reviewers’ writing styles.
- Published
- 2017
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.