$bbtitle
AAPL: 160.18 ( -1.04 ) AppleInsider RSS Feed
Search:
AppleInsider.com Archives Reviews Anonymous Mailer Submit Story AppleInsider Forums Polls Advertise on AppleInsider Contact AppleInsider
Help AppleInsider: Please take a moment to complete this quick survey.
Thursday, July 17, 2008

Apple filing takes Podcasts to the next level

By Slash Lane

Published: 12:00 PM EST

A recently published filing discovered by AppleInsider reveals work by Apple's chief software architect to advance the Podcast beyond its static form and into a live interactive presentation medium suitable for use by educational institutes and businesses for their daily presentations.

"Podcasts of classroom lectures and other presentations typically require manual editing to switch the focus between the video feed of [an] instructor and the slides (or other contents) being presented," Bertrand Serlet, Senior Vice President of Software Engineering at Apple, wrote in the 15-page filing. "In a school or enterprise where many presentations take place daily, editing podcasts require a dedicated person, which can be prohibitive. "

To solve this problem, Serlet proposes has proposed an automated content capture and processing system where a live camera feed of a presenter can be automatically merged with a Keynote or PowerPoint presentation to form an entertaining and dynamic podcast that lets the viewer watch the presenter's slides as well as the presenter.

In one example outlined in the filing, the content capture system provides a video stream (Stream A) and an Keynote presentation stream (Stream B) to a recording agent such as a Mac running specialized Podcast creation software. The recording agent then blends the two feeds together based on certain cues and sends the combined feed to a syndication server that would then distribute the video wirelessly as a Podcast to any number of authorized Macs, iPods or iPhones.

Serlet also explained that syndication server could include an automated content creation application that applies one or more operations on the Streams A and/or B to create new content, such as transitions, effects, titles, graphics, audio, narration, avatars, animations, and so forth.

"For example, a content stream (e.g., Stream B) output by the application can be shown as background (e.g., full screen mode) with a small picture in picture (PIP) window overlying the background for showing the video camera output (e.g., Stream A)," he wrote. "If a slide in Stream B does not change (e.g., the "trigger event") for a predetermined interval of time (e.g., 15 seconds), then Stream A can be operated on (e.g., scaled to full screen on the display). A virtual zoom (e.g., Ken Burns effect) or other effect can be applied to Stream A for a close-up of the instructor or other object (e.g., an audience member) in the environment (e.g., a classroom, lecture hall, studio)."

Podcast Patent Example


The Apple executive also explained that trigger events can be captured from the actual presentation environment using, for example, the capture system, including patterns of activity of the instructor giving a presentation and/or of the reaction of an audience watching the presentation.

"The instructor could make certain gestures, or movements (e.g., captured by the video camera), speak certain words, commands or phrases (e.g., captured by a microphone as an audio snippet) or take long pauses before speaking, all of which can generate events in Stream A that can be used to trigger operations," he wrote.

"In one exemplary scenario, the video of the instructor could be shown in full screen as a default. But if the capture system detects that the instructor has turned his back to the audience to read a slide of the presentation, such action can be detected in the video stream and used to apply one or more operations on Stream A or Stream B, including zooming Stream B so that the slide being read by the instructor is presented to the viewer in full screen."

Podcast Patent Example


Throughout the filing, Serlet outlined examples of several other potential trigger events, such as the movement of a presentation pointer (e.g., a laser pointer) which could then be captured and detected as an event by an "event detector." For instance, the direction of the laser pointer to a slide can indicate that the instructor is talking about a particular area of the slide. Therefore, in one implementation, an operation can be to show the slide to the viewer.

"The movement of a laser pointer can be detected in the video stream using AVSR software or other known pattern matching algorithms that can isolate the laser's red dot on a pixel device and track its motion (e.g., centroiding)," he added. "If a red dot is detected, then slides can be switched or other operations performed on the video or application streams. Alternatively, a laser pointer can emit a signal (e.g., radio frequency, infrared) when activated that can be received by a suitable receiver (e.g., a wireless transceiver) in the capture system and used to initiate one or more operations.

In some other implementations, a detection of a change of state in a stream is used to determine what is captured from the stream and presented in the final media file or podcast. For instance, the instructors transition to a new slide can cause a switch back from a camera feed of the instructor to a slide. When a new slide is presented by the instructor, the application stream containing the slide would be shown first as a default configuration, and then switched to the video stream showing the instructor, respectively, after a first predetermined period of time has expired. In other implementations, after a second predetermined interval of time has expired, the streams can be switched back to the default configuration.

Taking his next-generation podcast concept a step further, Serlet went on to say that the capture system could conceivably include a video camera that can follow the instructor as he moves about the environment. The cameras could be moved by human operator or automatically using known location detection technology. The camera location information could then be used to trigger an operation on a stream and/or determine what is captured and presented in the final media file or podcast.

It should be noted that Serlet's concept one of at least three Podcast enhancements proposed by Apple employees in recent patent filings, none of which have come to fruition as of yet. Others include personalized on-demand podcasts and Podmaps.

Filed under : iPhone, iPod 45 Comments ] 
Story topics: patents, podcasts   Print ] [ Story Link ] 

Mac Poker players can play Full Tilt Poker for Mac and get 100% to $600 free with bonus code MP600, courtesy of Online Poker Mac
AppleInsider Features
Hot Forum Topics

Recent Articles
Claims renew iTunes 8 expectations, slate iPhone 2.1 for event
First iPod nano 4G photo hits the web [updated]
Microsoft calling up Gurus to take on Apple's Geniuses
Report pours water on rumors of September 9th iTunes 8 debut
Analyst braces clients for "underwhelming" Apple event
Mac OS X 10.5.5 approaching as testing focus narrowed
Road to Mac OS X Snow Leopard: the future of 64-bit apps
iPhone snags cover of Best Buy circular ahead of Sunday's launch
Third lawsuit joins into complaints about iPhone 3G speeds, bugs
Apple suspected in new deal for PowerVR graphics in multi-touch devices
Apple looks to take multi-touch beyond the touch-screen
Road to Snow Leopard: twice the RAM, half the price, 64-bits
No subscription iTunes at event; Macs high priority in enterprise
Next-gen iPod nano, iPod touch dimensions revealed?
AT&T wireless networks go down on East Coast
RBC says sub-$100 "iPod phone" market up for Apple's taking
Apple, AT&T sued for over-saturating 3G network with iPhones
Road to Mac OS X Snow Leopard: 64-bits, Santa Rosa, and more
Adobe set to take the wraps off Creative Suite 4.0
Apple confirms September 9th special event: "Let's Rock"
Apple to bundle music extras app with iTunes albums
Briefly: Mac OS X 10.5.5 to address AirPort glitch
Apple added to short-term Alpha List at Piper Jaffray
Symbian reports slow growth in front of iPhone 3G launch
Google planning new Chrome browser based on WebKit
O2 UK sets pay-as-you-go iPhone 3G prices, launch info
Microsoft plans 'Skymarket' apps store for Windows Mobile 7 in 2009
U.S. Army increasingly using custom iPods as field translators
Apple may be working with AT&T on iPhone tethering plan
Apple ships Final Cut Express 4.0.1 and ProRes plug-in
iPhone 3G finally reaching supply and demand balance
Google reveals open Android Market to rival iPhone's App Store
Apple: iPhone security holes, contacts lag, GPS quirks to be fixed soon
Bloomberg accidentally publishes Steve Jobs obituary
Apple details next-gen multi-touch techniques for tablet Macs
Why Apple keeps its iPhone 2.0 SDK under NDA
Behind the iPhone Software 2.0.2 fix to reduce dropped calls
iPods, MacBooks, iMacs up next on Apple's 2008 roadmap
Repeat tests show iPhone 3G doesn't suffer from faulty hardware
Apple iPhone ad banned in UK due to "misleading" claims

AppleInsider Market Place

Sell your Laptop - working or not. Free shipping.: Get an instant online quote and sell your laptop today !

Believe in Office: Save Up To 25% on Office 2004 For Mac. Visit Our Site for Details!

IBackup - SMB Online Backup: IBackup is the preferred online storage and backup service of choice for SMBs for its ease of use, security and value. Offers automated backup and restore, file selection and securiy.

Download free software - everyday updated freeware files

 
Advertisements







AppleInsider RSS Feed
AppleInsider © 1997-2008
Please review our Privacy Policy.
Written/Edited/Compiled by the AppleInsider Staff.