Results for 'Corpus-based speech synthesis Gaussian mixture models'

999 found
Order:
  1.  64
    Automatic phonetic segmentation of Hindi speech using hidden Markov model.Archana Balyan, S. S. Agrawal & Amita Dev - 2012 - AI and Society 27 (4):543-549.
    In this paper, we study the performance of baseline hidden Markov model (HMM) for segmentation of speech signals. It is applied on single-speaker segmentation task, using Hindi speech database. The automatic phoneme segmentation framework evolved imitates the human phoneme segmentation process. A set of 44 Hindi phonemes were chosen for the segmentation experiment, wherein we used continuous density hidden Markov model (CDHMM) with a mixture of Gaussian distribution. The left-to-right topology with no skip states has been (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark  
  2.  6
    Simulation of Tennis Match Scene Classification Algorithm Based on Adaptive Gaussian Mixture Model Parameter Estimation.Yuwei Wang & Mofei Wen - 2021 - Complexity 2021:1-12.
    This paper presents an in-depth analysis of tennis match scene classification using an adaptive Gaussian mixture model parameter estimation simulation algorithm. We divided the main components of semantic analysis into type of motion, distance of motion, speed of motion, and landing area of the tennis ball. Firstly, for the problem that both people and tennis balls in the video frames of tennis matches from the surveillance viewpoint are very small, we propose an adaptive Gaussian mixture model (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  3.  6
    End-to-End Speech Synthesis for Tibetan Multidialect.Xiaona Xu, Li Yang, Yue Zhao & Hui Wang - 2021 - Complexity 2021:1-8.
    The research on Tibetan speech synthesis technology has been mainly focusing on single dialect, and thus there is a lack of research on Tibetan multidialect speech synthesis technology. This paper presents an end-to-end Tibetan multidialect speech synthesis model to realize a speech synthesis system which can be used to synthesize different Tibetan dialects. Firstly, Wylie transliteration scheme is used to convert the Tibetan text into the corresponding Latin letters, which effectively reduces the (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  4.  12
    Damage Detection of Refractory Based on Principle Component Analysis and Gaussian Mixture Model.Changming Liu, Zhigang di ZhouWang, Dan Yang & Gangbing Song - 2018 - Complexity 2018:1-9.
    Acoustic emission technique is a common approach to identify the damage of the refractories; however, there is a complex problem since there are as many as fifteen involved parameters, which calls for effective data processing and classification algorithms to reduce the level of complexity. In this paper, experiments involving three-point bending tests of refractories were conducted and AE signals were collected. A new data processing method of merging the similar parameters in the description of the damage and reducing the dimension (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  5.  5
    Strudel: A CorpusBased Semantic Model Based on Properties and Types.Marco Baroni, Eduard Barbu, Brian Murphy & Massimo Poesio - 2010 - Cognitive Science 34 (2):222-254.
    Computational models of meaning trained on naturally occurring text successfully model human performance on tasks involving simple similarity measures, but they characterize meaning in terms of undifferentiated bags of words or topical dimensions. This has led some to question their psychological plausibility (Murphy, 2002;Schunn, 1999). We present here a fully automatic method for extracting a structured and comprehensive set of concept descriptions directly from an English part‐of‐speech‐tagged corpus. Concepts are characterized by weighted properties, enriched with concept–property types (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   7 citations  
  6.  46
    Strudel: A CorpusBased Semantic Model Based on Properties and Types.Marco Baroni, Brian Murphy, Eduard Barbu & Massimo Poesio - 2010 - Cognitive Science 34 (2):222-254.
    Computational models of meaning trained on naturally occurring text successfully model human performance on tasks involving simple similarity measures, but they characterize meaning in terms of undifferentiated bags of words or topical dimensions. This has led some to question their psychological plausibility (Murphy, 2002;Schunn, 1999). We present here a fully automatic method for extracting a structured and comprehensive set of concept descriptions directly from an English part‐of‐speech‐tagged corpus. Concepts are characterized by weighted properties, enriched with concept–property types (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   6 citations  
  7.  13
    Robust Algorithms for a Multimodal Biometric System Using Palmprint and Speech.R. Raghavendra - 2011 - Journal of Intelligent Systems 20 (4):305-326.
    In this paper, we propose a person verification scheme using a novel combination of palmprint and speech. The crucial aspect of biometric based verification lies in its use of features in verification. Thus, in this paper, we propose two novel feature extraction methods for palmprint verification. The proposed methods are based on the Gaussian mixture model followed by subspace based approaches such as ICA I and ICA II, called independent component analysis I mixture (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  8.  38
    Speaker Identification Using Empirical Mode Decomposition-Based Voice Activity Detection Algorithm under Realistic Conditions.R. Kumaraswamy, V. Kamakshi Prasad, Nilabh Kumar Pathak & M. S. Rudramurthy - 2014 - Journal of Intelligent Systems 23 (4):405-421.
    Speaker recognition under mismatched conditions is a challenging task. Speech signal is nonlinear and nonstationary, and therefore, difficult to analyze under realistic conditions. Also, in real conditions, the nature of the noise present in speech data is not known a priori. In such cases, the performance of speaker identification or speaker verification degrades considerably under realistic conditions. Any SR system uses a voice activity detector as the front-end subsystem of the whole system. The performance of most VADs deteriorates (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  9.  14
    Speaker Verification Under Degraded Conditions Using Empirical Mode Decomposition Based Voice Activity Detection Algorithm.R. Kumaraswamy, V. Kamakshi Prasad & M. S. Rudramurthy - 2014 - Journal of Intelligent Systems 23 (4):359-378.
    The performance of most of the state-of-the-art speaker recognition systems deteriorates under degraded conditions, owing to mismatch between the training and testing sessions. This study focuses on the front end of the speaker verification system to reduce the mismatch between training and testing. An adaptive voice activity detection algorithm using zero-frequency filter assisted peaking resonator was integrated into the front end of the SV system. The performance of this proposed SV system was studied under degraded conditions with 50 selected speakers (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  10.  9
    Discriminatively trained continuous Hindi speech recognition using integrated acoustic features and recurrent neural network language modeling.R. K. Aggarwal & A. Kumar - 2020 - Journal of Intelligent Systems 30 (1):165-179.
    This paper implements the continuous Hindi Automatic Speech Recognition (ASR) system using the proposed integrated features vector with Recurrent Neural Network (RNN) based Language Modeling (LM). The proposed system also implements the speaker adaptation using Maximum-Likelihood Linear Regression (MLLR) and Constrained Maximum likelihood Linear Regression (C-MLLR). This system is discriminatively trained by Maximum Mutual Information (MMI) and Minimum Phone Error (MPE) techniques with 256 Gaussian mixture per Hidden Markov Model(HMM) state. The training of the baseline system (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  11.  67
    Expectation-Maximization Algorithm of Gaussian Mixture Model for Vehicle-Commodity Matching in Logistics Supply Chain.Qi Sun, Liwen Jiang & Haitao Xu - 2021 - Complexity 2021:1-11.
    A vehicle-commodity matching problem is presented for service providers to reduce the cost of the logistics system. The vehicle classification model is built as a Gaussian mixture model, and the expectation-maximization algorithm is designed to solve the parameter estimation of GMM. A nonlinear mixed-integer programming model is constructed to minimize the total cost of VCMP. The matching process between vehicle and commodity is realized by GMM-EM, as a preprocessing of the solution. The design of the vehicle-commodity matching platform (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  12.  20
    Robot Motion Planning Method Based on Incremental High-Dimensional Mixture Probabilistic Model.Fusheng Zha, Yizhou Liu, Xin Wang, Fei Chen, Jingxuan Li & Wei Guo - 2018 - Complexity 2018:1-14.
    The sampling-based motion planner is the mainstream method to solve the motion planning problem in high-dimensional space. In the process of exploring robot configuration space, this type of algorithm needs to perform collision query on a large number of samples, which greatly limits their planning efficiency. Therefore, this paper uses machine learning methods to establish a probabilistic model of the obstacle region in configuration space by learning a large number of labeled samples. Based on this, the high-dimensional samples’ (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  13.  14
    Classification of Infant Cries Using Dynamics of Epoch Features.Kapinaiah Viswanath, K. Sreenivasa Rao, Jayanta Mukhopadhyay & Avinash Kumar Singh - 2013 - Journal of Intelligent Systems 22 (3):351-364.
    In this article, epoch-based dynamic features such as sequence of epoch interval values and epoch strength values are explored to classify infant cries. Epoch is the instant of significant excitation of the vocal tract system during the production of speech. For voiced speech, the most significant excitation takes place around the instant of glottal closure. The different types of infant cries considered in this work are hunger, pain, and wet diaper. In this work, epoch strength and epoch (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  14.  9
    College Students’ Psychological Health Analysis Based on Multitask Gaussian Graphical Models.Qiang Tian, Rui Wang, Shijie Li, Wenjun Wang, Ou Wu, Faming Li & Pengfei Jiao - 2021 - Complexity 2021:1-17.
    Understanding and solving the psychological health problems of college students have become a focus of social attention. Complex networks have become important tools to study the factors affecting psychological health, and the Gaussian graphical model is often used to estimate psychological networks. However, previous studies leave some gaps to overcome, including the following aspects. When studying networks of subpopulations, the estimation neglects the intrinsic relationships among subpopulations, leading to a large difference between the estimated network and the real network. (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  15.  11
    Recognition of English speech – using a deep learning algorithm.Shuyan Wang - 2023 - Journal of Intelligent Systems 32 (1).
    The accurate recognition of speech is beneficial to the fields of machine translation and intelligent human–computer interaction. After briefly introducing speech recognition algorithms, this study proposed to recognize speech with a recurrent neural network (RNN) and adopted the connectionist temporal classification (CTC) algorithm to align input speech sequences and output text sequences forcibly. Simulation experiments compared the RNN-CTC algorithm with the Gaussian mixture model–hidden Markov model and convolutional neural network-CTC algorithms. The results demonstrated that (...)
    Direct download  
     
    Export citation  
     
    Bookmark  
  16.  27
    確率的 Web 画像収集.Yanai Keiji - 2007 - Transactions of the Japanese Society for Artificial Intelligence 22 (1):10-18.
    We propose a novel probabilistic image selection method for the Web image gathering system we proposed before. It employed two-step processing: (1) Gather HTML files of Web pages related to given keywords, analyze them and fetch only Web images expected to be highly related to the keywords. (2) Select only relevant images from the gathered images based on the image-feature-based clustering. In this paper, we propose building a generative model based on the Gaussian mixture model (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  17.  14
    Adjacent and Non‐Adjacent Word Contexts Both Predict Age of Acquisition of English Words: A Distributional Corpus Analysis of Child‐Directed Speech.Lucas M. Chang & Gedeon O. Deák - 2020 - Cognitive Science 44 (11):e12899.
    Children show a remarkable degree of consistency in learning some words earlier than others. What patterns of word usage predict variations among words in age of acquisition? We use distributional analysis of a naturalistic corpus of child‐directed speech to create quantitative features representing natural variability in word contexts. We evaluate two sets of features: One set is generated from the distribution of words into frames defined by the two adjacent words. These features primarily encode syntactic aspects of word (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  18.  17
    Generating Facial Expressions for Speech.Catherine Pelachaud, Norman I. Badler & Mark Steedman - 1996 - Cognitive Science 20 (1):1-46.
    This article reports results from a program that produces high‐quality animation of facial expressions and head movements as automatically as possible in conjunction with meaning‐based speech synthesis, including spoken intonation. The goal of the research is as much to test and define our theories of the formal semantics for such gestures, as to produce convincing animation. Towards this end, we have produced a high‐level programming language for three‐dimensional (3‐D) animation of facial expressions. We have been concerned primarily (...)
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   3 citations  
  19.  6
    What corpus-based Cognitive Linguistics can and cannot expect from neurolinguistics.Alice Blumenthal-Dramé - 2016 - Cognitive Linguistics 27 (4):493-505.
    This paper argues that neurolinguistics has the potential to yield insights that can feed back into corpus-based Cognitive Linguistics. It starts by discussing how far the cognitive realism of probabilistic statements derived from corpus data currently goes. Against this background, it argues that the cognitive realism of usage-based models could be further enhanced through deeper engagement with neurolinguistics, but also highlights a number of common misconceptions about what neurolinguistics can and cannot do for linguistic theorizing.
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   5 citations  
  20.  5
    Approche sur corpus des compétences pragmatiques et multimodales des personnes 'gées présentant un trouble cognitif léger.Guillaume Duboisdindien, Cyril Grandin, Dominique Boutet & Anne Lacheret-Dujour - 2018 - Corpus 19.
    This article presents a multimodal video corpus with the principal aim to model and predict the effects of aging in Mild Cognitive Impairment situation on pragmatic and communicative skills. We take as observable variables the verbal pragmatic markers and non-verbal pragmatic markers. This approach, at the interface of the psycholinguistics, cognitive sciences and rehabilitation medicine (speech-language pathology and therapy) is part of a longitudinal research process in an ecological situation (interviews conducted by close intimate of the elderly).In the (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  21.  31
    A probabilistic corpus-based model of syntactic parallelism.Amit Dubey, Frank Keller & Patrick Sturt - 2008 - Cognition 109 (3):326-344.
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   10 citations  
  22.  5
    Using Statistical Model to Study the Daily Closing Price Index in the Kingdom of Saudi Arabia.Hassan M. Aljohani & Azhari A. Elhag - 2021 - Complexity 2021:1-5.
    Classification in statistics is usually used to solve the problems of identifying to which set of categories, such as subpopulations, new observation belongs, based on a training set of data containing information whose category membership is known. The article aims to use the Gaussian Mixture Model to model the daily closing price index over the period of 1/1/2013 to 16/8/2020 in the Kingdom of Saudi Arabia. The daily closing price index over the period declined, which might be (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  23. FBST for Mixture Model Selection.Julio Michael Stern & Marcelo de Souza Lauretto - 2005 - AIP Conference Proceedings 803:121-128.
    The Fully Bayesian Significance Test (FBST) is a coherent Bayesian significance test for sharp hypotheses. This paper proposes the FBST as a model selection tool for general mixture models, and compares its performance with Mclust, a model-based clustering software. The FBST robust performance strongly encourages further developments and investigations.
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  24.  15
    Are Words Easier to Learn From Infant‐ Than Adult‐Directed Speech? A Quantitative CorpusBased Investigation.Adriana Guevara-Rukoz, Alejandrina Cristia, Bogdan Ludusan, Roland Thiollière, Andrew Martin, Reiko Mazuka & Emmanuel Dupoux - 2018 - Cognitive Science 42 (5):1586-1617.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  25.  3
    Zero quoting in the speech of British and Spanish teenagers: A contrastive corpus-based study.Ignacio M. Palacios Martínez - 2013 - Discourse Studies 15 (4):439-462.
    Quotatives have been studied extensively in the language of teenagers in recent years as they present distinctive features of their own that make them different in part from those used by adults in mainstream English and Spanish. However, zero quoting has not received all the attention it certainly deserves as it has not been fully probed in terms of its discourse and pragmatic functions. This corpus-based study is focused on the strategies used by British and Spanish teenagers to (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  26.  12
    Corpus-Based Metaphorical Framing Analysis: WAR Metaphors in Hong Kong Public Discourse.Winnie Huiheng Zeng & Kathleen Ahrens - 2023 - Metaphor and Symbol 38 (3):254-274.
    This study proposes an operational approach to a metaphorical framing analysis using large-scale data. We conducted a case analysis of how war metaphors are framed to address various societal issues in a corpus of public speeches by Hong Kong government officials. By investigating patterns of lexical choices under the source domain of WAR and the underlying reasons for the source-target domain mappings (i.e. Mapping Principles), we found that the target domain of social issues in Hong Kong is primarily conceptualized (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  27.  27
    Machine Meets Man: Evaluating the Psychological Reality of Corpus-based Probabilistic Models.Dagmar Divjak, Ewa Dąbrowska & Antti Arppe - 2016 - Cognitive Linguistics 27 (1):1-33.
    Name der Zeitschrift: Cognitive Linguistics Jahrgang: 27 Heft: 1 Seiten: 1-33.
    No categories
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   9 citations  
  28. The logic of indirect speech.Steven Pinker - manuscript
    When people speak, they often insinuate their intent indirectly rather than stating it as a bald proposition. Examples include sexual come-ons, veiled threats, polite requests, and concealed bribes. We propose a three-part theory of indirect speech, based on the idea that human communication involves a mixture of cooperation and conflict. First, indirect requests allow for plausible deniability, in which a cooperative listener can accept the request, but an uncooperative one cannot react adversarially to it. This intuition is (...)
     
    Export citation  
     
    Bookmark   28 citations  
  29.  7
    Residual-Based Algorithm for Growth Mixture Modeling: A Monte Carlo Simulation Study.Katerina M. Marcoulides & Laura Trinchera - 2021 - Frontiers in Psychology 12.
    Growth mixture models are regularly applied in the behavioral and social sciences to identify unknown heterogeneous subpopulations that follow distinct developmental trajectories. Marcoulides and Trinchera recently proposed a mixture modeling approach that examines the presence of multiple latent classes by algorithmically grouping or clustering individuals who follow the same estimated growth trajectory based on an evaluation of individual case residuals. The purpose of this article was to conduct a simulation study that examines the performance of this (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  30.  6
    A corpus-based insight into genre: The case of WIPO domain name arbitration decisions.Laura Martínez Escudero - 2011 - Discourse and Communication 5 (4):375-392.
    To prevent domains from cyber-piracy, the WIPO offers private and confidential procedures tasked to address the legitimate use of a domain name. WIPO domain name arbitration consists of an alternative dispute resolution process in which one or more panelists make a binding decision over the legitimacy of a domain. This article investigates the structure of the discourse of this professional genre. Following Maley, this study focuses, first, on spotting the generic moves of WIPO domain name arbitration decisions. Second, the analysis (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  31. A Corpus-based Cognitive Linguistic Analysis of Pre-existing Knowledge of Scientific Terminology: The Case of English Energy and Arabic طَاقَة (ṭāqa).Hicham Lahlou - 2020 - Arab World English Journal for Translation and Literary Studies 4 (1):3-13.
    The present paper aims to broaden the current understanding of students’ misconception of scientific terminology by identifying the gaps between Arabic and English scientific terminologies and between everyday language and scientific language. The paper compares the polysemy, prototypes, and motivating factors of English energy with those of Arabic طَاقَة (ṭāqa), with more focus on students’ prior knowledge. The study employs Lakoff’s (1987) idealized cognitive models and Rosch’s (1975) prototype theory to reveal the radial members of both categories, i.e., energy (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  32.  67
    On estimation of functional causal models : general results and application to the post-nonlinear causal model.Kun Zhang, Zhikun Wang, Jiji Zhang & Bernhard Scholkopf - unknown
    Compared to constraint-based causal discovery, causal discovery based on functional causal models is able to identify the whole causal model under appropriate assumptions [Shimizu et al. 2006; Hoyer et al. 2009; Zhang and Hyvärinen 2009b]. Functional causal models represent the effect as a function of the direct causes together with an independent noise term. Examples include the linear non-Gaussian acyclic model, nonlinear additive noise model, and post-nonlinear model. Currently, there are two ways to estimate the (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   3 citations  
  33.  9
    Modeling the Influence of Language Input Statistics on Children's Speech Production.Ingeborg Roete, Stefan L. Frank, Paula Fikkert & Marisa Casillas - 2020 - Cognitive Science 44 (12):e12924.
    We trained a computational model (the Chunk-Based Learner; CBL) on a longitudinal corpus of child–caregiver interactions in English to test whether one proposed statistical learning mechanism—backward transitional probability—is able to predict children's speech productions with stable accuracy throughout the first few years of development. We predicted that the model less accurately reconstructs children's speech productions as they grow older because children gradually begin to generate speech using abstracted forms rather than specific “chunks” from their (...) environment. To test this idea, we trained the model on both recently encountered and cumulative speech input from a longitudinal child language corpus. We then assessed whether the model could accurately reconstruct children's speech. Controlling for utterance length and the presence of duplicate chunks, we found no evidence that the CBL becomes less accurate in its ability to reconstruct children's speech with age. (shrink)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  34. The Full Bayesian Significance Test for Mixture Models: Results in Gene Expression Clustering.Julio Michael Stern, Marcelo de Souza Lauretto & Carlos Alberto de Braganca Pereira - 2008 - Genetics and Molecular Research 7 (3):883-897.
    Gene clustering is a useful exploratory technique to group together genes with similar expression levels under distinct cell cycle phases or distinct conditions. It helps the biologist to identify potentially meaningful relationships between genes. In this study, we propose a clustering method based on multivariate normal mixture models, where the number of clusters is predicted via sequential hypothesis tests: at each step, the method considers a mixture model of m components (m = 2 in the first (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  35.  12
    Thematic and paradigm models of the concept system of science.Konstantin I. Belousov, Dmitriy A. Baranov & Elena A. Erofeeva - 2018 - Epistemology and Philosophy of Science 55 (1):184-203.
    The article describes two approaches to modeling the concept system of science – the thematic and paradigm ones. The re­search represents a case study of the two corpuses of abstracts: abstracts of projects supported by the Department of Humanities and Social Sciences of the Russian Federal Property Fund in lin­guistics, as well abstracts of articles by authors (and their co-au­thors) who have received multiple support from this foundation. Thematic modeling was carried out within the frameworks of two approaches: сcorpus (...) approach (modeling the system of concepts as a holistic entity of term fields network), and single text approach (singling out compositions of term fields steadily present in the texts of projects which can be treated as separate branches of linguistics). The method of semantic graph modeling realized in the “Semograph” Information system (corpus based approach) and the K-means clustering method (single text based approach) were applied. Arrays of keywords (those that occurred in the abstracts of projects supported by the Foundation) grouped into term fields served as operational units. Paradigmatic model­ing was based on the analysis of network interaction of research­ers (created on the basis of the information on joint publications). After the hypergraph of researchers (2,108 state points) had been created, it was divided into subgraphs (network communities) by means of the modularity method (analogue of cluster analysis). Each cluster can be considered as a model of a scientific commu­nity that actualize a certain scientific paradigm; a set of clusters represents a model of the concept system of a certain subject domain from the point of view of representation and interrela­tion of several paradigms. The synthesis of thematic and paradig­matic models appears to be the direction for future research that involves modeling the concept system of science. The considered models can be applied as an effective means of monitoring, fore­casting and managing scientific research, i.e. as an instrument of state and/or department policy. (shrink)
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark  
  36.  65
    Manufacturing Consent: A corpusbased critical discourse analysis of New Labour's educational governance.Jane Mulderrig - 2011 - Educational Philosophy and Theory 43 (6):562-578.
    This paper presents selected findings from a historical analysis of change in the discursive construction of social identity in UK education policy discourse from 1972–2005. My chief argument is that through its linguistic forms of self-identification the government construes educational roles, relations and responsibilities not only for itself, but also for other educational actors and wider society. More specifically, I argue that New Labour's distinctive mode of self-representation is an important element in its hegemonic project, textually manufacturing consent over its (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   5 citations  
  37.  6
    Searching for Statesmanship: a Corpus-Based Analysis of a Translated Political Discourse.Henry Jones - 2019 - Polis 36 (2):216-241.
    With its connotations of superior moral integrity, exceptional leadership qualities and expertise in the science of government, the modern ideal of statesmanship is most commonly traced back to the ancient Greek concept of πολιτικός and the work of Plato and Aristotle in particular. Through an analysis of a large corpus of modern English translations of political works, built as part of the AHRC Genealogies of Knowledge project, this case-study aims to explore patterns that are specific to this translated discourse, (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  38. Abstract of "part-of-speech tagging of modern hebrew texts".Yoad Winter - unknown
    Words in Semitic texts often consist of a concatenation of word segments, each corresponding to a Part-of-Speech (POS) category. Semitic words may be ambiguous with regard to their segmentation as well as to the POS tags assigned to each segment. When designing POS taggers for Semitic languages, a major architectural decision concerns the choice of the atomic input tokens (terminal symbols). If the tokenization is at the word level the output tags must be complex, and represent both the segmentation (...)
    No categories
     
    Export citation  
     
    Bookmark  
  39.  4
    Gaussian process-based analysis of the nitrogen dioxide at Madrid Central Low Emission Zone.Juan Luis Gómez-González & Miguel Cárdenas-Montes - forthcoming - Logic Journal of the IGPL.
    Concern about air-quality in urban areas has led to the implementation of Low Emission Zones as one of many other initiatives to control it. Recently in Spain, the enactment of a law made this mandatory for cities with a population larger than 50k inhabitants. The delimitation of these areas is not without controversy because of possible negative economic and social impacts. Therefore, clear assessments of how these initiatives decrease pollutant concentrations are to be provided. Madrid Central is a major initiative (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  40.  4
    A Comparative Corpus-Based Study on the Political Discourse of the U.S. Presidents: Obama and Trump.Arta Toçi & Enes Ismeti - 2022 - Seeu Review 17 (2):71-86.
    The aim of this research is to analyze the political discourse and the language they used in public addresses provided by two former presidents of the United States, specifically, President Barak Obama and President Donald Trump. This research reflects on the material that has been collected for several months which aimed to contribute on the analysis of the corpus, distinctions and similarities, as well as their attitudes towards the public opinion. The objectives of this study are mainly empirical and (...)
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  41.  3
    A Study of Word Complexity Under Conditions of Non-experimental, Natural Overt Speech Production Using ECoG.Olga Glanz, Marina Hader, Andreas Schulze-Bonhage, Peter Auer & Tonio Ball - 2022 - Frontiers in Human Neuroscience 15:711886.
    The linguistic complexity of words has largely been studied on the behavioral level and in experimental settings. Only little is known about the neural processes underlying it in uninstructed, spontaneous conversations. We built up a multimodal neurolinguistic corpus composed of synchronized audio, video, and electrocorticographic (ECoG) recordings from the fronto-temporo-parietal cortex to address this phenomenon based on uninstructed, spontaneous speech production. We performed extensive linguistic annotations of the language material and calculated word complexity using several numeric parameters. (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  42.  24
    Markers of Topical Discourse in Child‐Directed Speech.Hannah Rohde & Michael C. Frank - 2014 - Cognitive Science 38 (8):1634-1661.
    Although the language we encounter is typically embedded in rich discourse contexts, many existing models of processing focus largely on phenomena that occur sentence-internally. Similarly, most work on children's language learning does not consider how information can accumulate as a discourse progresses. Research in pragmatics, however, points to ways in which each subsequent utterance provides new opportunities for listeners to infer speaker meaning. Such inferences allow the listener to build up a representation of the speakers' intended topic and more (...)
    No categories
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   3 citations  
  43.  29
    カーネル密度推定器としての実数値交叉: Undx に基づく交叉カーネルの提案.Kobayashi Shigenobu Sakuma Jun - 2007 - Transactions of the Japanese Society for Artificial Intelligence 22 (5):520-530.
    This paper presents a kernel density estimation method by means of real-coded crossovers. Functions of real-coded crossover operators are composed of probabilistic density estimation from parental populations and sampling from estimated models. Real-coded Genetic Algorithm (RCGA) does not explicitly estimate probabilistic distributions, however, probabilistic model estimation is implicitly included in algorithms of real-coded crossovers. Based on this understanding, we exploit the implicit estimation of probabilistic distribution of crossovers as a kernel density estimator. We also propose an application of (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  44.  35
    Children’s Production of Unfamiliar Word Sequences Is Predicted by Positional Variability and Latent Classes in a Large Sample of Child-Directed Speech.Danielle Matthews & Colin Bannard - 2010 - Cognitive Science 34 (3):465-488.
    Direct download  
     
    Export citation  
     
    Bookmark   5 citations  
  45.  34
    Defamation case law in Hong Kong: A corpus-based study.Winnie le ChengCheng & Jian Li - 2016 - Semiotica 2016 (208):203-222.
    Defamation law is a long-standing research focus. Previous studies on defamation law have pointed out the importance of balancing two fundamental issues in law, namely, protection of reputation and freedom of speech. The present corpus-based legal study, using ConcGram 1.0 as the analytical tool, examined the phraseological profile of reported cases on defamation in Hong Kong in order to find out the types of defense and the approach to meaning in the defamation case law in Hong Kong. (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   2 citations  
  46.  29
    Forgetting of Foreign‐Language Skills: A CorpusBased Analysis of Online Tutoring Software.Ridgeway Karl, C. Mozer Michael & R. Bowles Anita - 2017 - Cognitive Science 41 (4):924-949.
    We explore the nature of forgetting in a corpus of 125,000 students learning Spanish using the Rosetta Stone® foreign-language instruction software across 48 lessons. Students are tested on a lesson after its initial study and are then retested after a variable time lag. We observe forgetting consistent with power function decay at a rate that varies across lessons but not across students. We find that lessons which are better learned initially are forgotten more slowly, a correlation which likely reflects (...)
    Direct download  
     
    Export citation  
     
    Bookmark   4 citations  
  47.  14
    Reporting Verbs in Court Judgments of the Common Law System: A Corpus-Based Study.Wei Yu - 2020 - International Journal for the Semiotics of Law - Revue Internationale de Sémiotique Juridique 34 (2):525-560.
    Professionals in various disciplines adopt significantly different lexicons to report their discoveries and arguments. Scientists discover, philosophers argue, whereas legal practitioners apply and consider. Reporting, as a ubiquitous linguistic phenomenon, has its disciplinary characteristics. In court judgments, it reflects the way judges identify the evidence of different documents or other courts. In the self-built court judgment corpus, the paper focuses on the way that judicial arguments are constructed through reporting verbs. On the basis of the analysis of the representation (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  48. What Is the Basic Unit of Scientific Progress? A Quantitative, Corpus-Based Study.Moti Mizrahi - 2022 - Journal for General Philosophy of Science / Zeitschrift für Allgemeine Wissenschaftstheorie 53 (4):441-458.
    This paper presents the results of an empirical study following up on Mizrahi (2021). Using the same methods of text mining and corpus analysis used by Mizrahi (2021), we test empirically a philosophical account of scientific progress that Mizrahi (2021) left out of his empirical study, namely, the so-called functional-internalist account of scientific progress according to which the aim or goal or scientific research is to solve problems. In general, our results do not lend much empirical evidence in support (...)
    Direct download (5 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  49.  30
    Predicting syntactic choice in Mandarin Chinese: a corpus-based analysis of ba sentences and SVO sentences.Haitao Liu & Yu Fang - 2021 - Cognitive Linguistics 32 (2):219-250.
    This paper investigates the effects of 10 factors on the choice between alternative ba sentences and SVO sentences in Mandarin Chinese. These factors are givenness, definiteness, animacy and pronominality of NP2s, NP2 length, VP length, verb sense, syntactic parallelism, dependency distance, and surprisal. Using corpus data and mixed-effects logistic regression modeling, we find that on the one hand, givenness, syntactic parallelism, and the log-transformed ratio of NP2 length and VP length are significant predictors of the choice between ba sentences (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  50.  25
    Finding variants for construction-based dialectometry: A corpus-based approach to regional CxGs.Jonathan Dunn - 2018 - Cognitive Linguistics 29 (2):275-311.
    This paper develops a construction-based dialectometry capable of identifying previously unknown constructions and measuring the degree to which a given construction is subject to regional variation. The central idea is to learn a grammar of constructions using construction grammar induction and then to use these constructions as features for dialectometry. This offers a method for measuring the aggregate similarity between regional CxGs without limiting in advance the set of constructions subject to variation. The learned CxG is evaluated on how (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
1 — 50 / 999