Kenny Smith is one of my computational linguistics professors in Edinburgh University. He is great at guiding you through complex mathematical concepts related to language, with a northern accent and a laid-back attitude. On his series of papers about linguistic models of language acquisition, he covers a very wide range of issues in the evolution and transfer of language through cultural interaction and with the help of some learning biases. He made models of language generation, acquisition, maintenance and creation. These proved ecologically relevant, since they predicted (or followed) the findings from empirical studies on developmental psychology. On his paper “Inferential Transmission of Language” from 2005, he explores the very interesting issues of Lexical acquisition in infants and provides computational models to simulate them. This article will provide a short, read-me-before-the-exam synopsis and evaluation of the main issues Kenny brings forward.

On the paper “On learning the Past Tenses of English Verbs”, Rumelhart and McClelland aim to explore the ability of networks to extract rules from the input given to them in a different fashion to what traditional rule systems use. This arises in connection to the argument about the LAD  (Language Acquisition Device), where humans are born with innate ability to extract and process rules in language from a very poor input. They built a network that simulates past tense acquisition in children based on the following stages from developmental psychology:

  1. Children use only a small number of verbs in the past tense, all high-frequency words. Most of them are irregulars. There’s no evidence of the use of any rules.
  2. Evidence of implicit knowledge of a linguistic rule emerges. Children use a much larger set of verbs in the past tense.
    • The child can now generate a past tense for an invented word (i.e. rick – ricked)
    • Children supply incorrect regular past-tense endings for words they used correctly in stage 1
  3. Regular and irregular forms co-exist. Children regain knowledge of the correct irregular forms, and they apply the regular form to new verbs. This persists into adulthood.

These stages (as everything in psychology) aren’t really distinct and sequential, but in fact they are rather gradual.

While the connectionist approach shows great potential and it has been at the forefronts of many an advance in the field of cognitive science and AI in general, there still remain a number of important challenges. Let’s have a look at Elman’s favorite issues. Continue reading ‘Issues in Connectionist Models [notes on cognitive science]’

On the book Rethinking Innateness, by Elman et al, we’re given an introduction to the advantages of connectionism as an explanative model of human development and language acquisition. Connnectionist models are usually built up of a number of nodes interconnected by communication channels. These nodes recieve biased activation and depending on their internal threshold function, they send an activation themselves. Nodes can be connected and arranged in an innumerable amount of structures, from a simple layered network to incredibly complex multi-modular structures. The connections between the nodes have weights and it is there where the knowledge of the network is stored. 3 Layer Feed-Forward network

These weights are always real valued and they alter the input value by multiplying it by their own value. So if a node with a weight of -2.0 receives an input of 0.5, the resulting output would be -1.0, an inhibitory signal. (if the result was positive, the signal would be excitatory. We call the net input of any given node the sum over all input nodes of the products of the activation and the weight of the input. Now the node has an internal response function that will output a value according to the net input.showequation This is usually a threshold function, which will output on the range of 0 to 1, in relation to the input. The most commonly used are the sigmoid function, or the logistic function. For very large inputs, they will output 1, for very small (negative) inputs, they will ouput 0. When the input is close to 0, the output will be in the range of 0 to 1. Changing the threshold function will give us either more sensitive thresholds or more abrupt ones. The “magic” of these systems is that their nonlinear response allows for fine-grained distinctions of categories that are continuous in nature. In practice however, we usually regard the outputs as zero or one. (more on this later). Bias nodes are always on, and they allow us to set the default behavior of other nodes in the absence of input.

In this paper by Frank Jackson, he introduces himself as a ‘qualia freak’. He argues that certain sensations and experiences cannot be accounted by a physicalist approach. A physicalist approach only considers physical information to be the “correct” kind of information, yet he argues that there are things that cannot be described just in terms of physical information, such as the smell of a rose, seeing the sky, feeling pain, etc; therefore physicalism is false. He acknowledges that this argument has little polemic validity. He tries to fix that by elaborating on what he calls “the knowledge argument”.

In this article Nagel draws our attention towards the role of consciousness in the mind-body problem, attacking reductionist theories. He tries to show that there is no reduction concept available to deal with consciousness and goes on to explore these implications, by beginning with an account of the nature of consciousness.

Terrence Horgan and James Woodward defend Folk Psychology on their paper “Folk Psychology is Here to Stay” from the attacks of Churchland and Stich, arguing that none of them has provided convincing reasons for doubting Folk Psychology as a method of explaining individuals. They base their critique on the overly stringent conceptions of the interactions between FP and other lower level theories. They define FP as consisting of two components: theoretical principles (universal closures of conditional formulas) and an existential thesis (that our FP descriptions of people are true and they do undergo the FP events attributed to them). Horgan and Woodward claim that Stich and Churchland argue against the existential thesis, by denying the existence of beliefs, desires and intentions. Horgan and Woodward thus claim that FP provides a useful framework for prediction and genuine causal explanations.

