kodiaktau writes: Surprise. The geniuses at Microsoft and New South Wales discover that large data sets improve learning algorithms and require a responsible business practice to manage them. Really the whole paper is really quite nice and can be found here. Of particular importance is to remember that the data can and will be impacted by the filter or analyst who is interpreting it. The example given regarding Twitter data is an excellent example of reading into the data.