Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Reading Guide To AI Design & Neural Networks?

Posted by kdawson on Tue Dec 02, 2008 05:49 AM
from the open-the-library-doors-hal dept.
Raistlin84 writes "I'm a PhD student in theoretical physics who's recently gotten quite interested in AI design. During my high school days, I spent most of my spare time coding various stuff, so I have a good working knowledge of some application programming languages (C/C++, Pascal/Delphi, Assembler) and how a computer works internally. Recently, I was given the book On Intelligence, where Jeff Hawkins describes numerous interesting ideas on how one would actually design a brain. As I have no formal background in computer science, I would like to broaden my knowledge in the direction of neural networks, pattern recognition, etc., but don't really know where to start reading. Due to my background, I figure that the 'abstract' theory would be mostly suited for me, so I would like to ask for a few book suggestions or other directions."
+ -
story

Related Stories

This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • PDP (Score:5, Informative)

    by kahizonaki (1226692) on Tuesday December 02 2008, @05:56AM (#25957457) Homepage
    Parallel Distributed Processing (both books) by Rumelhart, McClelland, and the PDP research group, 1986. "THE" classic neural network resource--and still somewhat relevant.
    • Re: (Score:3, Informative)

      For a somewhat more up-to-date and maybe complementary book, I advise you Computational Explorations in Cognitive Neuroscience by Randall C. O'Reilly and Yuko Munakata (The MIT Press). The simulator intends to extend and replace PDP++ and is quite pleasant to use. It is on http://grey.colorado.edu/emergent/index.php/Main_Page [colorado.edu]
    • Re:PDP (Score:4, Interesting)

      by babbs (1403837) on Tuesday December 02 2008, @06:52AM (#25957757)
      I prefer James Anderson's "An Introduction to Neural Networks". I think it is better suited for someone coming from the physical, mathematical, or neuro- sciences.
      • Re: (Score:2, Interesting)

        The great thing about the PDP books is that they make almost NO assumption as to what the reader's background is. There's no code, a bunch of pictures, and something in there for everyone. Each chapter is written with a specific goal in mind, and by leaders in the field--there are chapters on the mathematics of the networks, the dynamical properties of them (i.e. how they can be thought of as boltzmann's machines), as well as lots of ideas for applications and specific studies of how real experiments worked
    • Re: (Score:3, Informative)

      Cosma Shalizi is also a Physicist. I don't think he is actually doing research in machine learning or AI but he likes to read a lot and he tends to have fairly extensive reading lists.

      Machine Learning [umich.edu]

      AI [umich.edu]

      You may also want to get familiar with Geoffrey Hinton's current work in neural networks [youtube.com].
  • by Anonymous Coward on Tuesday December 02 2008, @06:00AM (#25957483)

    Due to the possibility of a robot army rising up, I refuse to help.

  • AIMA (Score:5, Informative)

    by omuls are tasty (1321759) on Tuesday December 02 2008, @06:01AM (#25957487)
    Artificial Intelligence: A Modern Approach by Rusell and Norvig is more or less the standard AI textbook and the book I'd suggest to get an overview of AI and its different methodologies. Mind you, it's over 1000 pages, but a very interesting read.
    • Re: (Score:3, Interesting)

      I must second that, Russel and Norvig book is one of the most important books.

      I would also recommend:

      Artificial Intelligence: A new Synthesis [google.com] from Nills J. Nilson [wikipedia.org], who is considered one of the founders of A.I.

    • If it's the book I think it is, it gives a good overview of 'traditional' AI (rules, logic systems, planning) but not really anything about 'soft' approaches like neural nets. I found it rather disappointing. Read any of the classic Rob Brooks papers. If nothing else, they are certainly inspiring - they always make me want to build robots.
      • Re: (Score:3, Informative)

        Nope. Its got neural networks. (section 20.5) Try walking into any cog sci / AI faculty office without seeing this book. Don't let anyone tell you it's dry (its got math! gasp!). It's accessible and thorough.

        Also:
        Statistics!

        ...learn it, love it. Thats mostly what AI is under all the gloss. That sound is a thousand Cog Sci students screaming in terror, ignore them.
    • Re: (Score:2, Informative)

      by Anonymous Coward

      I'd like to add to this. AIMA gives you a very broad and moderately deep overview of the state of AI ten years ago. As such, it is a truly excellent introduction introduction to the subject.

      If you want a more recent, much more thorough and narrow introduction to neural networks in particular and machine learning in general, I'd recommend Chris Bishop's book: Pattern Recognition and Machine Learning (http://research.microsoft.com/~cmbishop/prml/), which focuses on learning rather than searching and planning.

    • Re: (Score:3, Informative)

      Agreed. All the basic knowledge about the field is in this book. Part of these are available freely online. You can be judge : http://aima.cs.berkeley.edu/ [berkeley.edu]
    • Re:AIMA (Score:4, Informative)

      by six11 (579) <johnsoggNO@SPAMcmu.edu> on Tuesday December 02 2008, @08:42AM (#25958453) Homepage

      Also seconded. Russel & Norvig. Artificial Intelligence: A Modern Approach [berkeley.edu] is a good book, well illustrated, and generally lacks the undecipherable academia-speak that pervades lots of AI literature.

      Here's an article that was particularly influential on me and some of my friends: Brooks, R. 1991. Intelligence Without Reason. MIT AI Memo num 1292 [mit.edu]. Even though it is 'just' a tech report, it is frequently cited. He had another one, Intelligence without Representation, which is also good.

      Somebody else mentioned the McClelland and Rumelhart PDP (neural networks) book, and it is also still quite good in spite of its age.

      The interesting thing about AI (to me) is the funny mix of domain expertise. You have philosophers, sociologists, cognitive scientists, psychologists, computer scientists, and mathematicians. That's not a complete list---I'm in human-computer interaction and design research.

      But because of the motley crew of domains you have a hundred people speaking a hundred different dialects. Some people put everything in really mathy terms, and their journal articles look (to me) like they are written in Klingon. Then you have others who write in beautiful prose but don't give any specifics on how to implement things. Still others express everything in code or predicate logic.

      The oldest school of AI holds that you can reduce intelligence to a series of rules that can operate on any input and make some deterministic and intelligent sense of it. That works to a degree, but it falls apart at some point partly because of the computational complexity (e.g. the algorithm works if you have a million years to wait for the answer). Another reason it falls apart is because there are some kinds of intelligence that can't be reduced to rational computation (e.g. I love my wife because of that thing she does...).

      There's a newer kind of AI that is based on having relatively simple computational structures that eat lots of data, "learn" rules based on that data, and are capable of giving fairly convincing illusions of smartness when given additional data from the wild. Neural nets fall into this category.

      A third kind of AI brings these two schools together in the belief that there are fundamental computational structures like Bayesian Networks that can model intelligence* but those structures by themselves are insufficient and must be able to adapt based on exposure to real data. So instead of having a static BN whose topology is defined at the start and remains the same throughout the life of the robot, we can have a dynamic BN whose structure changes based on the environment.

      I remember reading a recent article by John McCarthy arguing that all this statistical business is hogwash, and that the old school positivist, reductionist approach will eventually win. He's a smart guy, inventor of LISP and a Turing Award recipient. It seems his view is in the minority, but I'm not one to say he's wrong. However, my inclination is that the third hybrid group is probably going to be the one to make the most progress in the years to come.

      The reason for my preference to the hybrid school could probably be best explained by Lucy Suchman's Plans and Situated Actions [wikipedia.org]. I can't really do her thesis justice in a few sentences, but the short version of her argument is that there are plans (the sequence of steps that we think we are about to carry out before performing some task) and actions, which is the set of things we actually do. In my mind, a plan corresponds roughly with the underlying computational mechanism, but the actions correspond with how that mechanism executes and what happens when the underlying structure is insufficient, wrong, misleading, or fails.

      Hope that helps.

      Gabe

      * None of this is to say that computational structures that we implement with software/hardware ar

  • AI != design brain (Score:5, Insightful)

    by Kupfernigk (1190345) on Tuesday December 02 2008, @06:03AM (#25957499)
    There is a very big difference between AI - which is based on guesses about how "intelligence" works, and studies of brain function. I'm going to make a totally unjustified sweeping generalisation and suggest that one reason that AI has generally been a failure is because we have had quite wrong ideas about how the brain actually works. That's to say, the focus has been on how the brain seems to be like a distributed computer (neurons and the axons that relay their output) because up till now nobody has really understood how the brain stores and organises memory in parallel- which seems to be the key to it all, and is all about the software.

    So my feeling is that the first people really to get anywhere with AI will either work for Google or be the neurobiologists who finally crack what is actually going on in there. If I wasn't close to retirement, and wanted to build a career in AI, I'd be looking at how mapreduce works, and the work being done building on that, rather than robotics. I'd also be looking as seriously parallel processing.

    So my initial suggestion is nothing to do with conventional AI at all - look at Programming Erlang, and anything you can find about how Google does its stuff.

    • Re: (Score:2, Funny)

      by Anonymous Coward

      The human brain does not use anything that even remotely resembles software. The brain is hardwired.

      Software in brains... that a paddlin'

      • The universe is software, the brain workings are just a tiny side-effect, but can still be considered software.

        From universe.c:

        int main()
        {
              [...]
              return 42;
        }

    • Re: (Score:3, Informative)

      http://www.databasecolumn.com/2008/01/mapreduce-a-major-step-back.html [databasecolumn.com]

      As both educators and researchers, we are amazed at the hype that the MapReduce proponents have spread about how it represents a paradigm shift in the development of scalable, data-intensive applications. MapReduce may be a good idea for writing certain types of general-purpose computations, but to the database community, it is:

      1. A giant step backward in the programming paradigm for large-scale data intensive applic

    • by Viol8 (599362) on Tuesday December 02 2008, @07:10AM (#25957853)

      .. as applied to normal computers. In this case its simply speeded up serial computation - ie the algorithm could be run serially so Programming Erlang is irrelevant. With the brain , parallel computation is *vital* to how it works - it couldn't work serially - some things MUST happen at the same time - eg different inputs to the same neuron, so studying parallel computation in ordinary computers is a complete waste of time if you want to learn how biological brains work. Its comparing apples and oranges.

      • You've just reinforced my point by not understanding how the brain works. Neuron inputs and outputs are known to be pulse coded, and as you would imagine with chemical based transmitters, the pulse frequency is low (it evolved, it didn't get designed from first principles!) So it is perfectly possible to represent a neuron by a time-slicing parallel system, because it is extremely low bandwidth, and its output varies very slowly, but is NEVER DC. As a result, the output of the neuron does not need to be con
        • Re: (Score:3, Interesting)

          And you've missed my point. Parallel computing on a von neumann computer raises issues of race conditions, deadlocking etc. These are the sort of things you have to worry about with parallel silicon systems. None of these issues apply to brains (as far as we know) so what is the use in learning about them? You're talking about simulating a neural system which is not the same thing - a simulation of anything can be done serially given enough time, never mind in parallel. But it will never be an exact represe

    • You may be right, but it's never been a major goal of AI researchers to duplicate how the brain works. AI has been steadfastly interested in building machines that do what the brain does, but not how the brain does it. So while I'm sure that many AI researchers keep an eye on these things, I don't think that "wrong ideas about how the brain actually works" is the problem, since ideas about how the brain works have relatively little influence on AI.

      As an aside, MapReduce is not that complicated, nor is it

    • There is a very big difference between AI - which is based on guesses about how "intelligence" works, and studies of brain function. I'm going to make a totally unjustified sweeping generalisation and suggest that one reason that AI has generally been a failure is because we have had quite wrong ideas about how the brain actually works. That's to say, the focus has been on how the brain seems to be like a distributed computer (neurons and the axons that relay their output) because up till now nobody has really understood how the brain stores and organises memory in parallel- which seems to be the key to it all, and is all about the software.

      A lot of the brain's function is architectural, rather than merely a matter of 'software'.

      I don't know if you can say "AI has generally been a failure", but traditional AI has actually been guided by the non-biological notion of a "physical symbol system" rather than by conceptions about how the brain actually works. And even in the biologically inspired side of the field, only the most ignorant would think that artificial neural networks have much in common with the brain.

      The field of AI, with few execpti

    • If I wasn't close to retirement, and wanted to build a career in AI, I'd be looking at how mapreduce works...

      Why not do this stuff during your retirement? What else are you going to do with the time between now and your death?

    • "There is a very big difference between AI - which is based on guesses about how "intelligence" works, and studies of brain function."

      Yes, there most certainly is. AI is a far broader topic than study of the brain for starters, it extends to the study of swarm intelligence and emergent properties in evolution for example. The field of AI generally uses nature as inspiration and builds useful techniques from there. The human brain is but one of these items that has been studied for inspiration and has led to

    • by TapeCutter (624760) on Tuesday December 02 2008, @08:20AM (#25958263) Journal

      "I'd also be looking as seriously parallel processing."

      If you haven't seen this [bluebrain.epfl.ch] it might interest you. Note that it's a simulation for use in studying the physiology of the mammalian brain, not an AI experiment. Any ghost in the machine would have to emerge by itself in pretty much the same way mind emerges from brain function.

  • Heard of AGI? (Score:3, Informative)

    by QuantumG (50515) * <qg@biodome.org> on Tuesday December 02 2008, @06:04AM (#25957503) Homepage Journal

    http://www.opencog.org/wiki/OpenCogPrime:WikiBook [opencog.org]

    Some interesting stuff.

    • Only philosophical bullshit. AI is making way too many simplifications in how the brain works, but this book contains even less material. It makes sweeping conclusions based on almost no data.

      It is very, very probably flat out wrong.

      • "this book" .. by that do you mean "On Intelligence".. in which case I agree, but umm.. maybe you weren't trying to reply to me.

        Slashdot's comment system is fucked, I recommend you switch to "classic" view as soon as possible.

        It's a lot like Vista......

  • Russell & Norvig (Score:5, Interesting)

    by Gazzonyx (982402) on Tuesday December 02 2008, @06:05AM (#25957517)
    In my AI class, last semester, we used Stuart Russell and Peter Norvig's Artificial Intelligence A Modern Approach, 2nd Ed.. It's fairly dry, but good for theory nonetheless. If you're a physics geek, it should be right up your alley; they approach everything from a mathematical angle and then have a bit of commentary on the theory, but never seem to get to the practical uses for the theory.

    If you're in the US, send me an email and I'll send you my copy. They charge an arm and a leg for these books and then buy them back for 1/10 the price. I usually don't even bother selling them back.
  • by Anonymous Coward on Tuesday December 02 2008, @06:05AM (#25957521)

    Following Books are must have for machine learning enthusiasts:

    Christopher Bishop
    http://research.microsoft.com/~cmbishop/prml/

    Richard Duda
    http://rii.ricoh.com/~stork/DHS.html

    There you will get an insight how machine learning methods (like neural networks, SVM, boosting, bayes classificator) work

    for general AI (not so much in direction of statistical learning as the books above, but more into higher level learning like inference rules) I can recommend published work done by

    Drew McDermott
    http://cs-www.cs.yale.edu/homes/dvm/

    • Re: (Score:2, Informative)

      +1 for the book by Bishop (don't know about the others). In addition, have a look at Information Theory by David Mackay which I found stunningly good. There is a free on-line version available, but you should buy it: http://www.inference.phy.cam.ac.uk/itprnn/book.html [cam.ac.uk]
    • Re: (Score:3, Informative)

      I'll second Duda and Hart, though I guess it's Duda, Hart, and Stork now.

      It's probably the most widely used pattern classification book that I've seen, and covers most of the techniques that you'll find. The coverage of neural networks is limited to Backprop though, so you'll need to look elsewhere for more in-depth on those.

  • by MosesJones (55544) on Tuesday December 02 2008, @06:07AM (#25957537) Homepage

    Question: Where can I find a Reading Guide to AI Design & Neural Networks

    Answer: Why do you want to AI design & Neural Networks?

    Question: Because I want to learn.

    Answer: Will learn AI design & neural networks make you happy

    Question: Yes

    There you go. Now the question is whether Slashdot beats the Turing test on this one.

  • Adding another point to your feature space, I'll put in a plug for a technique called Stochastic Discrimination. It's not well known but is quite good at pattern recognition and avoids a lot of the weaknesses of neural networks such as over-training. Since it's not so well known, you have to go to the few academic papers to read up on it. Or visit the website http://kappa.math.buffalo.edu/ [buffalo.edu]. But it's got a very solid mathematical foundation (developed by a former math professor if mine) and isn't as "hacky"

  • On Neural Nets at least.. The only text book that I can think of offhand which is decent is Duda, Hart and Stork [ricoh.com]

    Hawkins, like many others, has ripped off many of his ideas from Steve Grossberg [bu.edu] (in this case, the ART model). Although he's not very easy to read, especially if you start much earlier than say, Ellias and Grossberg, 1975. You should also check out the work of people like Jack Cowan [uchicago.edu], Rajesh Rao [washington.edu], Christof Koch [caltech.edu], Tom Poggio [mit.edu], David McLaughlin [nyu.edu], Bard Ermentrout [pitt.edu], among many, many others. I think

  • by Gearoid_Murphy (976819) on Tuesday December 02 2008, @06:41AM (#25957697)
    be careful before committing to a large scale neural network project. Aside from the intuition that the brain is a massively interconnected network, no one is really sure what aspect of neural network functionality is necessary for intelligence. My advice to you is to spend time coming to terms with the abstract nature of intelligence rather than coding up elaborate projects. This link [uh.edu] is a philosophical discussion on directed behaviour which I found quite interesting (if a bit vague, which is the mark of philosophy). Also, as you become familiar with the literature, you will see many examples of algorithms which claim to model certain aspects of intelligence. These algorithms work because they have a reliable and unambiguous artificial environment from which they draw their sensory information. The problem with practical artificial intelligence is that the real world is extremely ambiguous and noisy (in the signal sense). Therefore the problem is not creating an algorithm which can emulate intelligent behaviour but solving the problem of taking the empirical information of the sensory input and producing from that data a reliable abstract representation which is easily processed by the AI algorithms (whatever they may be, neural networks, genetic programming, decision trees etc) Good luck.
    • My advice to you is to spend time coming to terms with the abstract nature of intelligence rather than coding up elaborate projects. This link is a philosophical discussion on directed behaviour which I found quite interesting (if a bit vague, which is the mark of philosophy).

      I wouldn't recommend for anyone to waste their time reading philosophers' opinions about AI research. Might as well read a used car salesman's treatise on automotive design.

      At least used car salesmen actually have cars to sell...

  • Christoph Adami's Introduction to Artificial Life. He's a closet physicist and it shows. Do at least read the TOC before you dismiss it.
  • Cognitive Psychology (Score:3, Interesting)

    by tgv (254536) on Tuesday December 02 2008, @06:58AM (#25957783) Journal

    I would strongly recommend starting with a text book on Cognitive Psychology, or reading it in parallel. AI tends to overlook the fact that intelligence is a human trait, not the most efficient algorithm for solving a logic puzzle. Anderson's book can be recommended: http://bcs.worthpublishers.com/anderson6e/default.asp?s=&n=&i=&v=&o=&ns=0&uid=0&rau=0 [worthpublishers.com].

  • These might seem a little old, but are still a couple of my favorites:
    Reinforcement Learning by Sutton & Barto [amazon.com]
    Machine Learning by Tom Mitchell [amazon.com]
  • you said you don't have any formal knowledge on CS. then don't think about neural networks yet, you have to build from the ground up. you need to take algorithms (doesn't matter if you're a programmer) and language theory (languages, regex, ... turing machines) at the very least. after that you can start experimenting with AI.

  • Haven't we had a number of stories recently questioning the validity of CS degrees with lots of (usually sys admins) waffling on about how degrees are a waste of time and how anyone can pick up computer skills? Ok all you "I don't need no degree , I can do it all on my own" , show us how you've all conquered the world of AI where so many others doing BScs, MSCs and PHds degrees have failed?

    What? Is that the sound of silence I can hear?

  • I think 'neural gas' is the area of neural networks research inspired by statistical physics. Don't know if there are any books about it, but you may find a chapter in an ANN textbook, and can certainly find papers vial Google.

    Contrary to what others are suggesting, you probably aren't looking for the Russell & Norvig book, which is in fact good and almost qualifies as "the standard AI textbook". I counterrecommend it only because it's about Good Old Fashioned AI, which is interesting stuff, but compl

  • We seem to be reading a lot of Skynet related posts these days.

    I better get the drapes for the bunker finished!
  • Without knowing the details about where you stand with things, my advice would be to concentrate on finishing your PhD first. There's no limit to the number of distractions during that final push, but big new areas of study are usually a bad idea.

    Assuming that's not an issue (nor or eventually), as a beginner in the field, you don't need to start with articles, there are books that will help for a while. But you may find quickly that you need to place yourself in one of two camps: people who want to devel

  • that is, its complete bullshit, but as a dream forever out of reach, it drives a lot of important and accidental discoveries, like databases or optical character recognition

    so we need lots of bright minds working in AI. none of them will ever actually achieve the goal. but along the way, they will spin off fantastic new technology

    so i applaud your focus, but you should be aware that anything you do of any import will be orthogonal to your goals

  • Hawkins is misguided (Score:3, Interesting)

    by joeyblades (785896) on Tuesday December 02 2008, @09:30AM (#25959003)

    I read "On Intelligence", too. While Hawkins has some interesting thoughts, I was less than inspired. Probably because I read John Searle's "Rediscovery of the Mind" first. Actually, most of Searle's work, as well as the work of Roger Penrose has led me to the conclusion that the Strong AI tract is missing the boat. The Strong AI proponents, like Hawkins, believe that if we build a sufficiently complex artificial neural network we will necessarily get intelligence. Searle and Penrose have very convincing arguments to suggest that this is not the right path to artificial intelligence.

    Realistically, how could one build an artificial brain without first understanding how the real one works? And I don't mean how neural networks function; I mean how the configuration of neural networks in the brain (and whatever other relevant structures and processes that might be necessary) accomplish the feat of intelligence. We still do not have a scientific theory for what causes intelligence. Without that, anything we build will just be a bigger artificial neural network.

    Also, the thing that Strong AI'ers always seem to forget... An artificial neural net only exhibits intelligence by virtue of some human brain that interprets the inputs and outputs of the system to decide whether the results match expectation (i.e. it takes "real" intelligence to determine when artificial intelligence has occured). Contrast this with the way your brain works and how you recognize intelligence from within, then you'll realize just how far from producing artificial brains we really are...

    I'm not saying that artificial intelligence is impossible, and neither is Searle (Penrose is still on the fence). I'm just saying, don't think you can slap a bunch of artificial neurons together and expect intelligence to happen.

    • They are. Ever heard of having genetic algorithms design neural-network controlled players ?

      That's one non-interactive AI designing another interactive AI in order to improve a certain function.

      And if your criterium is actual reproduction, let's keep in mind that no single humans are capable of even making a C64-level computer from scratch. Even a simple calculator would be pushing it too far for all but a few engineers.

      The only way humans are capable of "improving their own design" according to darwin is t