Firefox Plugin Annodex For Searching Audio, Video 129
loser in front of a computer writes "ZDNet Australia reports that 'Australia's CSIRO research organisation has developed a Firefox plugin named Annodex that allows browsing through time-continuous media such as audio and video in the same way that HTML allows browsing through text.' I've just checked Annodex out and it's very cool. The sample video from the Perl conference is way funny too." The catch is, the media to be searched has to be prepped first.
astonishing (Score:5, Funny)
Re:astonishing (Score:4, Funny)
Re:astonishing (Score:1)
Of course (Score:4, Insightful)
Isn't that obvious? It's too much to expect it to be able to search video without knowing what it is.
Re:Of course (Score:3, Insightful)
Re:Of course (Score:5, Insightful)
Re:Of course (Score:2)
Re:Of course (Score:3, Interesting)
Re:Of course (Score:1)
I'll do this, who will take drama? (Score:2)
Re:Of course (Score:1)
Re:Of course (Score:1)
upside (Score:2)
Not likely at currently then (Score:5, Insightful)
Re:Not likely at currently then (Score:2, Insightful)
Re:Not likely at currently then (Score:5, Insightful)
Currently trying to get a computer to understand something in pictures, even less in motion pictures is very inaccurate and extremly prosessor intensive, unless one uses a really small subset(like fingerprint recognition)
Re:Not likely at currently then (Score:1)
not as far off as you might think.. (Score:2)
Already being done to classify music... (Score:1)
Re:Not likely at currently then (Score:1)
Re:Not likely at currently then (Score:4, Interesting)
That sounds like a doctorate in the making... I'd anticipate an 80% hit rate in genre classification (at least) within 6 months of research, just given those sorts of categories. It's just image recogition and classification, really, but with a fscking huge dataset (which is a good thing).
Re:Not likely at currently then (Score:2)
Re:Not likely at currently then (Score:2)
Even voice recognition is still pretty bad. Say something with an accent, and you can forget about proper recognition, unless you spend huge effort tweaking the recognizer for each differe
AI approach (Score:2)
Then there's other things that can be examined: the number of colours could distinguish anime from film, the amount of audio compression / background noise etc.
I imagine the best way to do it would be using a neural net approach. Get someone to sit with the computer and play human-categorised content to it al
Re:AI approach (Score:2)
Re:AI approach (Score:2)
So yah, I agree with you in general about AI research, but I think that the research centred around classification is the most promising because it's the least ambitious and the most mathematically rig
Re:Not likely at currently then (Score:2)
Re:Not likely at currently then (Score:1)
Can I bundle Vorbis and another media type (like text lyrics or pictures) in the same file?
Yes. The Ogg container format was designed to allow different media types to be multiplexed together; Theora will be mixed with Vorbis audio in an Ogg container to encode movies.
--http://vorbis.com/faq.psp#container/ [vorbis.com]
Does that mean Ogg too can do what you're suggesting? Probably needs some work still, though.
Re:Not likely at currently then (Score:2)
Re:Not likely at currently then (Score:2)
MirrorDot (Score:3, Insightful)
Read more... (Score:5, Insightful)
A cool application, nonetheless.
Re:Read more... (Score:4, Interesting)
I really wish the Anime community saw it as a viable format rather than using XVid and DivX for everything. OGG is beautiful.
Re:Read more... (Score:5, Insightful)
Now, I might be wrong, but chances are that what you got instead of Ogg Theora compressed files were Ogg Media Files [faireal.net] (.ogm).
OGM is a container format for audio/video that supports multiple subtitles (just like you mentioned) and multiple audio tracks. From what I personally know, the video is usually compressed with XviD and the audio with Ogg Vorbis.
(see also Matroska [matroska.org] which does the above, and more)
Re:Read more... (Score:2, Insightful)
Re:Read more... (Score:2, Informative)
Re:Read more... (Score:1)
Sure, I too would love to see an open format like ogg hit the mainstream, but since CDRs can only hold 700 MB of data, I also want to use that space as efficient as possible. That means using the codec with the highest compression/quality ratio, which unfortunately is not free (as in speech).
Re:Read more... (Score:2)
Re:Read more... (Score:3, Interesting)
Although I guess that might present a chicken/egg situation.
Re:Read more... (Score:2)
Re:Read more... (Score:1)
Re:Read more... (Score:1)
And that is supports only ogg theora is a misnamona. The output video is an ogg container with xml packets and theora video interleaved in a format suitable for streaming. The source input video can be anything your system can play back and feed into the encoding/interleaving tools
Re:Read more... (Score:1)
I dunno (Score:3, Insightful)
Re:I dunno (Score:1)
I think one problem is that with audio, it's easy to just fade one song into another, but with video, there's not as clear an equivalent.
What we need is a machine that automatically adds "star wipe" to every conceivable frame of video. Now that would be sweet.
Re:I dunno (Score:2)
This could lead to new conventions for image language, as video-clips did to TV.
After all, fading audio was not at all natural the first time someone did it to merge different audio tracks. When music was all hand-made, there was no such thing as fading.
interactive film (Score:2, Interesting)
moreover (Score:2, Interesting)
composition tools are not yet available for the general public.
there was no bandwidth a that time ;) (Score:2, Funny)
Re:there was no bandwidth a that time ;) (Score:1)
it is US that charges all inbound traffic (Score:1)
Re:it is US that charges all inbound traffic (Score:2)
Surely... (Score:4, Informative)
The only implication here is that you could skip past part of a stream that exists as a preprepared complete file at the other end (as opposed to radio, which is incomplete and not browsable); but I bet the prepped file is significantly bigger, and the time saved skipping over a boring section would be replaced by the time required to download the extra data.
Quicktime
Re:Surely... (Score:1, Informative)
We've designed Annodex and CMML with the Interne
YAML (Yet Another Markup Language) (Score:3, Insightful)
Re:YAML (Yet Another Markup Language) (Score:1)
Re:YAML (Yet Another Markup Language) (Score:2)
Slashdotted...damn! (Score:2, Interesting)
Re:Slashdotted...damn! (Score:1, Informative)
- Andre (one of the developers)
What is the innovation here? (Score:4, Interesting)
I watched the video, but all it seems to be is a system of sectioning audio-visual files into smaller chunks, and a browser that gives access to a "table of contents" that lets the user jump directly to a section.
Is the sectioning/table-of-content-generation process automated? It seems to be manual.
I think software is already available that can partially automate the sectioning of a video. It does this by detecting scene-transitions, and then offering up the "chunks" to the user for approval and labelling. I think such software is used in DVD authoring for generating the "Jump to a Scene" DVD menu.
Re:What is the innovation here? (Score:1, Interesting)
Or you can search and get links directly into a specific position in a video. eg. With this search engine
http://labs.panopticsearch.com/search/sea r ch.cgi?c ollection=labs.cmweb
The section/TOC generation is manual. However in theory it could be automated using scene-detection and speech to text. But you can consider that as part of the original authoring process.
Re:What is the innovation here? (Score:2)
Today, I can listen to streaming audio from an online radio station with Windows media player.
These stations already section their streams into songs. Media player lets me add individual songs from
Re:What is the innovation here? (Score:2)
On page 7:
Real innovation (Score:2, Interesting)
Re:Real innovation (Score:2)
I wonder what sort of arrangement Blinkx have with content providers in order for users to view content. I wonder if they also search the closed-captions/teletext as Google Video does. (About a year ago, I also intended doing [slashdot.org] something similar as a hobby project.)
Re:Real innovation (Score:2)
Weather report excertp:
I know this is the way this OS Linux agency lot of clout through northern parts and through eastern parts and you can see how this is this just pushing its weight and keeping eastern parts of Britain but there's no such plan through central and eastern parts he is going to be bringing a lot of snow
Anether except:
the coming into force this week in August tested in the courts they sit unworkable and unenforceable and that is
Re:What is the innovation here? (Score:1, Interesting)
Additionally, the "freeform" annotation of the video serves very well for searching: it enables marking up a video in a way that's very meaningful for people. Try out the YAPC video on the example web there to see what I mean. The annotati
thats great! (Score:1)
we do similar things, but with proprietary technology...
actually our implementation of interactive films (or structured video) goes beyond that and allows to link video with any additional information type.
also we've explored back in 2001-2002 almost all the applications of this technology and made a bunch of samples of various kinds.
Re:What is the innovation here? (Score:1)
Also, it is not actually creating smaller chunks from audio-visual files - the files stay intact! There is a separate language to create the markup (just like HTML for text pages) and once that is created, everything else is automated - also the table-of-content-generation process.
time-continuous media (Score:1)
Ok I'm being a pedantic asshole I admit it
Re:time-continuous media (Score:1)
Time-continuous media: television shows, movies, radio shows.
Non-time-continuous media: Paintings, photographs, books(?)
Re:time-continuous media (Score:1)
Time is a function of our inner experience. To illustrate time in Space, the function of our outer experience, it must be represented on a line. I assume that this would change the direction of the line.
Imagine watching the a video normally is live following a vertical line, but you can only turn your head horizontally. You can't see where you've been or where you are going.
Turn the line on its side, so you can see all the s
Re:time-continuous media (Score:1)
An openoffice document does not record an instance of time, yet it is media. There is no specificity in time of an OO document and neither does it change over time.
Yes there is the instant the document was saved/created but this applies to all media, even video clips but has nothing to do with the media itself ("How longs the DVD? 24th April 2004 10:56:
Firefox Plugins Links Thread (Score:2, Informative)
Allow me to kick it off. The following are links for Firefox browsers only as they will install themselves automagically upon click. You've been warned. A couple of these, I forgot which, install links are for the MS Windows platforms sin
Re:Firefox Plugins Links Thread (Score:2)
Re:Firefox Plugins Links Thread (Score:2)
W
Re:Firefox Plugins Links Thread (Score:1, Informative)
Your link is to an older version (1.2)
From the changelog:
So if you want to try it, better get 1.3: http://patsis.brownhost.com/hpxpi/linkpreview13.xp i [brownhost.com]
Re:Firefox Plugins Links Thread (Score:1)
Arse. Does that mean I have to leave now because Windows works on my PC?
Re:Firefox Plugins Links Thread (Score:1)
Find every single Firefox extension in the world here [mozilla.org]
Re:Firefox Plugins Links Thread (Score:2)
You know, for the irony.
Legal implications? (Score:2, Interesting)
How it really works (Score:5, Informative)
it is already possible with quicktime... (Score:1)
Could be great for TV news (free and otherwise) (Score:4, Interesting)
I think anybody doing closed captioning [robson.org] already has the descriptive content they need. (Others could use a similar process to create it.)
That info, combined with relatively easily-detectable scene transitions, would make it possible to automate the searchable video file creation to a large extent.
So the CC or equivalent would still have to be done manually but you'd have this extremely useful, huge searchable archive of video.
Not so easy for things that depend on the visual content as opposed to the spoken content, but for news it could be amazing.
Then watch as politicians and captains of industry squirm [ntk.net] at the thought that their every word and twitch is available for searching...
Re:Could be great for TV news (free and otherwise) (Score:1)
All Hail the Abe Vigoda Status Plugin (Score:2)
SMIL already does this, and is widespread (Score:3, Interesting)
What's more SMIL is already [w3.org] supported by Quicktime, Real, MS Media Player, & MS Internet Explorer (& Firefox with some effort).
For platforms SMIL is available on Linux, Linux/PDA, Windows, Windows CE, MacOS, & MacOS X.
For content creation numerous SMIL tools are out there, inlcuding most industry standard ones.
For those curious here's a SMIL tutorial [empirenet.com], in SMIL.
Re:SMIL already does this, and is widespread (Score:1)
When HTML was started, SGML was around and did all that HTML did, too - and lots more. Do you remember SGML?
Re:SMIL already does this, and is widespread (Score:2)
As to SMIL typically not including useful (search engine friendly) metadata, that's more an issue with authors not taking advantage of this then a shortcoming in the format.
Presumably with multimedia search becoming a standard service we'll see sites getting smart about exposing their resources and attracting users.
But, yes I recall SGML - I used to
I must be missing something here... (Score:2)
Re:I must be missing something here... (Score:1)
Also, there are development tools available as open source code on all platforms for Ogg, while QuickTime is not available on Linux, which is where we started developing.
BTW: With Annodex, links are als
Re:I must be missing something here... (Score:2)
I'm curious though - why couldn't you annotate while streaming with MPEG-4? (There are a number of open source projects to manipulate MPEG-4, which offers the same sort of multi-track format QT does. *We* just happened to be using QT because back in '98 when OvalTine was started, that was the only thing out there that came close to filling the bill. Now, MPEG-4 does, no QT necessary.) We looked into it,
Re:I must be missing something here... (Score:2)
Again, I'm not sure what the hierarchical vs. flat nature of the formats has to do with streaming. Obviously, QT containers can be streamed just dandily, the preprocessing is to provide a series of hints to the server to optimize for network load, client player, etc. The container still stays the same, a
Oh no, not preparation! (Score:2)
Holy fuck, that's just like saying text files have to be "prepped [w3.org]" before they can be part of a global hypertext system [w3.org].
Dear god, whatever shall we do.
We need a neologism (Score:2)
I suggest to call them WiVi.
Search me? (Score:2)
404 (Score:2)
Re:/.ed?? (Score:2)
Re:/.ed?? (Score:2)
Re:/.ed?? (Score:1)
http://mirrordot.com/stories/c097c40d3f9a53ff5c7dd fc2f7f1c05c/index.html [mirrordot.com]