On mySimon: The Biggest Box Sets
BNET Business Network:
BNET
TechRepublic
ZDNet

March 19th, 2008

Fraunhofer IIS shows audio technology at VON.x 2008

Posted by George Ou @ 9:47 am

Categories: Audio Conferencing, VON, Video Conferencing, VoIP, ~Events~

Tags: Microphone, Speaker, Microsoft IIS Server, George Ou

In the world where chip technology improves exponentially, acoustic engineering isn’t so simple and it presents a huge hurdle to overcome to the world of telephony and video conferencing.  Fraunhofer IIS (inventors of MPEG-1 Layer 3 AKA MP3) seeks to tackle this challenge and showed off some of its research and upcoming products at VON.x 2008.

The first demonstration given to me was echo cancellation technology that prevents sound coming out of a speaker from reentering the microphone.  This is one of the most annoying things about using PC telephony like Skype since Skype lacks good echo cancellation technology[update 3/20/2008 - Recent versions of Skype now have very good echo cancellation on Windows, Mac, and Linux.  I still experienced some problems because the clients on the other end were using older versions of Skype.]  When you connect to someone using a speaker and microphone, you can often hear yourself talking a split second after you speak and it’s incredibly annoying.  With Fraunhofer’s echo cancellation technology, that problem virtually disappeared.

Now I’m fully aware that Skype is a free application but that hasn’t stopped open source solutions like Asterisk from offering licensed technology where a user for example pays $10 for the G.729 codec.  I’d gladly pay a little money for some good echo cancellation software.

The other cool demonstration was the discrete multichannel sound separation technology.  Normally when you’re in a room with multiple microphones being mixed in to a single sound channel and transmitted over a single audio channel, the sound is blurred.  But when the sound from each microphone and each person is sent in its own channel and played back from its own speakers, you can clearly hear each person speaking at the same time.  The downside of course is that each audio channel uses a separate 64 Kbps steam but that may not be a problem since it’s dwarfed by the video stream.

If you can’t spare the bandwidth and you only want to use a single 64 Kbps audio stream, Fraunhofer has another technology that can separate each person on to its own channel by marking the streams with few identifiers.  Once that’s done, each person can be moved from one sector to the other in a graphical interface shown below such that their sounds come out from the corresponding speakers.  While it wasn’t as pure as the discrete channel solution, it sounded almost as good because each person’s voice had its own dedicated speaker.  Just the act of using a physically different speaker cone per voice seems to have a huge impact on quality.

While this technology demo used 5 speakers, there’s no reason it can’t be made to work with the more typical stereo speaker set up.  I’d love to see audio conferencing bridges incorporate this technology such that multiple sound sources are marked for separate speakers so that they can be played back from separate speakers.

George Ou is Technical Director of ZDNet. See his full profile and disclosure of his industry affiliations.

  • Talkback
  • Most Recent of 7 Talkback(s)
MP3 is very old so it isn't a fair comparison
MP3 is VERY old so it isn't a fair comparison against a modern codec like OGG.

They showed Fraunhoffer's AAC low delay codec to me and it exhibited very good quality and very good robustness against packet loss.... (Read the rest)
Posted by: georgeou Posted on: 03/19/08 You are currently: a Guest | | Terms of Use
Um... Skype does have AEC  spark555 | 03/19/08
Like the rest of Skype, it doesn't work well  j.a.watson | 03/19/08
MSN messenger has much better echo cancellation, but horrible codec  georgeou | 03/19/08
yeah, too bad it doesn't work at all  georgeou | 03/19/08
Fraunhofer uses their patents to impede innovation...  sdunn2000@... | 03/19/08
true... and ogg is a far better technology.  james.faction | 03/19/08
MP3 is very old so it isn't a fair comparison  georgeou | 03/19/08

What do you think?

SponsoredWhite Papers, Webcasts, and Downloads

Click Here
advertisement

Recent Entries

Top Rated

    advertisement

    Archives

    ZDNet Blogs

    White Papers, Webcasts, and Downloads

    SmartPlanet

    • Thought-provoking progressive ideas on diverse topics that intersect with technology, business, and life, and matter to the world at large. Visit SmartPlanet
    • More from IBM
    • Innovate your business' process model, play against the market, compete against others on our scoreboards and WIN! Try INNOV8 2.0: A BPM Simulator
    • Enabling Real-World Business Transformation through IBM Service Management Read the EMA Analyst Report
    Click Here