Follow Slashdot stories on Twitter

 



Forgot your password?
typodupeerror
×

Advanced Excel for Scientific Data Analysis 303

cgjherr writes "If the recent financial meltdown has left you wondering, 'When does exponential decay function stop?' then I have the book for you. Advanced Excel for Scientific Data Analysis is the kind of book that only comes along every twenty years. A tome so densely packed with scientific and mathematical formulas that it almost dares you to try and understand it all. A "For Dummies" book starts with a gentle introduction to the technology. This is more like a "for Mentats" book. It assumes that you know Excel very well. The first chapter alone will have you in awe as you see the author turn the lowly Excel into something that rivals Mathematica using VBA, brains, and a heaping helping of fortitude." Read on for the rest of Jack's review.
Advanced Excel for Scientific Data Analysis
author Robert de Levie
pages 700
publisher Oxford Press
rating 9
reviewer Jack Herrington
ISBN 9780195370225
summary Use Excel for high end scientific data analysis akin to Mathemetica
When I first opened this book my mouth just dropped. It had been years since I had seen a book typeset using LaTeX. But in an instant it made sense as the book is crammed packed with the kind of equations that would have been a nightmare to build with any other tools. Chapter after chapter has everything a really smart person needs to do curve fitting, statistical measures, differential equations, time-frequency analysis. But don't expect a play by play here. You will get the equations, set within a few dense paragraphs, with maybe a spreadsheet and a chart or two to show the results.

The first chapter concentrates on the getting the most out of Excel as a tool. All the chapters that follow dig into specific data analysis techniques. Chapters two, three and four are on least squares. Chapter five and six cover the analysis in the time domain including fourier transforms. Chapter seven covers differential equations. Chapter eight returns to Excel by digging in deeper into macros. Which leads into chapter nine, where we dig deeper into basic mathematical operations. Chapter ten covers matrix operations. And chapter eleven wraps it all up by giving you some spreadsheet best practices.

In University style there are also some exercises that you can do along the way if you want to tweak your brain pan a little more. To amuse myself I tried a few and I believe the book would have assessed my attempts 'wanting' if it had a voice to tell me.

Where most books like this would have several authors this book has just one; Roberte de Levie. This means that the tone, style and quality of the book is consistent throughout. A fact that you will come to appreciate as the book wades in ever increasingly deep data analysis concepts as the chapters roll on.

Though I would have preferred the book to have code samples in C#, I understand that the language of Excel is VBA and I guess I have to live with that. Thankfully VBA has come a long way and if you so inclined it would likely be easy to translate the code into C#, Java, or whatever else you like.

The fact that one person wrote the book left me wondering, "Who is this guy?" In my minds eye I kinda of figured he would look like one of those pulsing brain guys from Star Trek. Turns out he is a professor at Bowdoin College. And his fields of study include ionic equilibria, electrochemical kinetics, electrochemical oscillators, stochastic processes, and a whole lot more stuff that almost seems made up to sound impressive.

When this book isn't serving as an amazing reference for both Excel, scientific problem solving, or just insane equations it serves other purposes as well. It's a handy portable IQ test, as the count of pages you can grind through in one sitting, plus 90, is roughly your intelligence quotient. And if you fail at that you can always put a copy of the book, along with the Orange Bible, under your pillow and try to osmose your way to becoming the Kwisatz Haderach.

In all seriousness, this is a great book. It represents the kind of in-depth work and research we used to see in books that came out twenty years ago. Robert is to be applauded for his work. This is an excellent resource for anyone looking to do scientific data analysis but who was unaware of the powerful capabilities that Excel provides that is likely waiting just one Startup menu click away.

The book is not without fault. I would have preferred that it had been in color, or at least have one color section to show some of the more impressive visualizations that I'm sure would look great in color. In addition the index is silly short for a book that clocks in at 700 pages. But those are only minor quibbles for what is all-in-all an amazing piece of work.

You can purchase Advanced Excel for Scientific Data Analysis from amazon.com. Slashdot welcomes readers' book reviews -- to see your own review here, read the book review guidelines, then visit the submission page.
This discussion has been archived. No new comments can be posted.

Advanced Excel for Scientific Data Analysis

Comments Filter:
  • Wrong Tool (Score:5, Interesting)

    by Hatta ( 162192 ) on Wednesday October 01, 2008 @01:14PM (#25221253) Journal

    Talk about the wrong tool for the job. If you need to do any sort of serious data analysis, use R, not Excel.

  • Re:incongruous (Score:3, Interesting)

    by fiannaFailMan ( 702447 ) on Wednesday October 01, 2008 @01:15PM (#25221271) Journal

    There's something hard to reconcile about the reviewer's obvious awe and the fact that the book was written by someone who thinks doing meaningful scientific data analysis in Excel is a good idea.

    Care to expand on why you think you can't do 'meaningful scientific data analysis in Excel?' Are you one of these people who 'reviews' books without actually reading them?

  • by melted ( 227442 ) on Wednesday October 01, 2008 @01:17PM (#25221291) Homepage

    But it seems you have not seen Word 2007 equation editor. It's close, in both capabilities, and output quality, to LaTeX. Too bad the rest of Word sucks ass.

    >> that would have been a nightmare to build with any other tools

  • Re:incongruous (Score:3, Interesting)

    by johnny cashed ( 590023 ) on Wednesday October 01, 2008 @01:19PM (#25221315) Homepage
    Last time I checked (and it has been a while), Excel has computational bugs in it which can result in valid data in -> garbage out. In my mind, 'meaningful scientific data analysis' involves accurate computation. But maybe I'm just a dreamer.
  • Re:incongruous (Score:5, Interesting)

    by Mr. Underbridge ( 666784 ) on Wednesday October 01, 2008 @01:34PM (#25221547)

    Why isn't it a good idea, and does this apply equally to OpenOffice?

    It's OK for simple stuff, but trying to do something like implementing a loop in a spreadsheet. And yes, the criticism applies to OO as well.

    There are very good packages out there - some open source - for doing scientific analysis. I'd recommend R or Octave (a matlab clone), personally. Also, Python + NumPy + SciPy + Pylab is great for doing Matlab-like things, and it's all free as well.

  • Excel does not Excel (Score:4, Interesting)

    by systemeng ( 998953 ) on Wednesday October 01, 2008 @01:38PM (#25221623)
    When I worked in the semiconductor industry in the late 90's, Excel nearly cost us several hundred grand. It had "helpfully" autocorrected a code in the documentation for a mask used in one of our clock buffer chip products. Had the engineers not caught this mistake in the printout, the fab of the chip would have been botched. The engineers were mad as I recall because they would change the code and Excel would change it back. If you can't prove what your tool is doing, you don't get to use it is what they taught me in engineering school.
  • Re:incongruous (Score:2, Interesting)

    by Murple the Purple ( 130813 ) on Wednesday October 01, 2008 @01:39PM (#25221637)

    Beating on Excel is easy. Do you check your math utility? Or do you just assume your compiler/vendor does everything correctly?

  • Re:incongruous (Score:3, Interesting)

    by cyphercell ( 843398 ) on Wednesday October 01, 2008 @01:44PM (#25221751) Homepage Journal
    I use spreadsheets to prototype and document ideas. Once I had thought a full blown reference implementation in a spreadsheet would be a good idea (basically, more time was spent on the reference than the final project). Fact is spreadsheets are good for one-off problems, or simple problems that gather lots of data (ex. accounting, statistics). When you have a heavy data model, heavy logic model, and complex results, spreadsheets are ultimate FAIL. They are good for developing algorithms quickly, good for testing a piece of data and figuring out what you want it to look like in a database, but they do not scale well for many types of projects. My rule of thumb is that any given portion of a successful spreadsheet should be limited to about five. Five inputs, five outputs, or five calculations. So you can have five inputs, 20 calculations. 20 inputs and five calculations etc. Otherwise the debugging process will consume your project.
  • by solafide ( 845228 ) on Wednesday October 01, 2008 @01:57PM (#25221971) Homepage
    Output quality: does it have automatic equation numbering? An equivalent for BibTeX? Intelligent modifiable Table of Contents? Ability to replace a math symbol wherever used with another? Change aforementioned numerations at will?
  • by mangu ( 126918 ) on Wednesday October 01, 2008 @02:09PM (#25222169)

    Python for scientific analysis [scipy.org],

    Python is the solution I recommend for everyone who looks for tips on advanced Excel uses. Excel is OK if you just want some quick and dirty solution for a small problem, but if you have to go to the trouble of reading a book, Excel is clearly not the best solution.

    For scientists and engineers who need something more than what Excel (and possibly Matlab) offers, I recommend starting with either A Byte of Python [swaroopch.com] or Dive Into Python [diveintopython.org].

  • by MightyYar ( 622222 ) on Wednesday October 01, 2008 @02:10PM (#25222175)

    This is one reason the VB scripting turns out to be highly useful.

    I recently had to do a project in VBA/Excel after years away from it, and it made me want to dig my eyes out with a spoon.

    Don't ever write custom functions... ever. You'll thank me when you don't have to worry about whether or not they silently fail.

    And once I had my whole spreadsheet corrupt for some reason... had to open on a Mac and re-save it. Then it worked fine on the PC again! Aye.

  • by Colonel Korn ( 1258968 ) on Wednesday October 01, 2008 @02:17PM (#25222281)

    As a graduate student in physics, I have never seen a serious researcher use excel for data analysis.

    Nor for that matter, is it common to see a scientist using windows for the OS--all linux and mac OS.

    This is akin to writing a book about publishing scientific papers with office. Instead, learn LaTex...

    The only group of people who use excel for large data analysis are financial types and MBAs. Need I remind you how that turned out?

    Oh, so that's why at APS meetings I've seen maybe 5 presentations, ever (out of at the very least 500) given on something other than Windows (Ubuntu once, MacOS the other few times), despite the fact that nearly every speaker uses his or her own laptop for the presentation. Wait...my data seems to indicate that physicists hardly ever use Mac or Linux at all!

    If we're just talking about computers controlling instruments, then I see about 90% Windows, 10% Linux if the instrument costs less than a million dollars, and 90% Linux, 10% Windows if it costs more than a million (and there's a transition zone in there somewhere, but my experience is mostly with $100k and $100MM equipment). CERN, for instance, is mostly controlled by non-Windows, but a lot of the laptops lying around there, presumably because people are going to do analysis on them, are Windows. Argonne is a healthy mix of Linux for controls and Windows for analysis. At MIT and Princeton, a couple professors use a Mac, a couple use Linux, most use Windows for personal use. Nearly everyone running stat mech simulations uses Linux for those, but a lot of those people have a personal Linux box.

    I'm not going to say that it's good, but I will say that it's true.

  • I must disagree (Score:3, Interesting)

    by dj245 ( 732906 ) on Wednesday October 01, 2008 @02:31PM (#25222463) Homepage
    In my experience, excel is also rarely used for anything serious outside of US. Maybe its an indictment of how lazy, slow witted and easily misled our pool of talent is becoming.

    I recently spent some time in Japan in a design group for a large Japanese company. I was showed the massive spreadsheet used to calculate power plant capacity and consumption. I almost cried. The whole sheet was based upon one large circular reference. Nobody understood it and it referenced steam tables through a plugin but didn't show the output of these calculations.

    The US is not alone in using Excel for things it wasn't meant to do. Now please excuse me while I run a couple monte carlo simulations in Excel.
  • by ChienAndalu ( 1293930 ) on Wednesday October 01, 2008 @02:36PM (#25222545)

    In my experience, excel is also rarely used for anything serious outside of US. Maybe its an indictment of how lazy, slow witted and easily misled our pool of talent is becoming.

    I have experienced whole companies running on Excel spreadsheets - they use it for accounting, instead of a database, and, you guessed it, scientific data analysis. The company I'm talking about is in the power supply industry.

  • by AlpineR ( 32307 ) <wagnerr@umich.edu> on Wednesday October 01, 2008 @04:40PM (#25224433) Homepage

    Maybe some of the people yelling about how Excel is the wrong tool can give some advice for my scientific data analysis and visualization needs.

    I have simulations (written in C++ and Python) that spit out tab delimited data files. I then need to analyze that data, doing things like linear regression on subsets of the data and calculations to transform the raw data into something else for plotting.

    I have a Mac (with Windows XP in Parallels), I am not a student, and I don't have much budget ($500) for software. Currently I use a Mac program called Plot which is a little buggy and incomplete but has some nice plotting abilities. When I need a spreadsheet I use Apple's Numbers, but that seems sorely limited in abilities. What's a better tool for this job?

You knew the job was dangerous when you took it, Fred. -- Superchicken

Working...