Last modified: 2014-04-08 15:45:24 UTC

Wikimedia Bugzilla is closed!

Wikimedia migrated from Bugzilla to Phabricator. Bug reports are handled in Wikimedia Phabricator.
This static website is read-only and for historical purposes. It is not possible to log in and except for displaying bug reports and their history, links might be broken. See T33221, the corresponding Phabricator task for complete and up-to-date bug report information.
Bug 31221 - Audio pronunciation: Automatic text-to-speech to convert IPA to sound
Audio pronunciation: Automatic text-to-speech to convert IPA to sound
Status: REOPENED
Product: MediaWiki extensions
Classification: Unclassified
Extensions requests (Other open bugs)
unspecified
All All
: Lowest enhancement (vote)
: ---
Assigned To: Nobody - You can work on this!
:
Depends on:
Blocks:
  Show dependency treegraph
 
Reported: 2011-09-28 18:58 UTC by Ryan Jackson
Modified: 2014-04-08 15:45 UTC (History)
9 users (show)

See Also:
Web browser: ---
Mobile Platform: ---
Assignee Huggle Beta Tester: ---


Attachments
speaker icon - when clicked, uses computer audio output to pronounce subject word (1.22 KB, image/png)
2011-09-28 18:58 UTC, Ryan Jackson
Details

Description Ryan Jackson 2011-09-28 18:58:48 UTC
Created attachment 9114 [details]
speaker icon - when clicked, uses computer audio output to pronounce subject word

I would like to request an audio pronunciation feature for entries, similar to that found on <dictionary.reference.com>.
Comment 1 Bawolff (Brian Wolff) 2011-09-28 19:01:32 UTC
Don't we already do this (and this is more a project level issue. I doubt we want computer generated pronounciations). For example [[wikt:help#Pronunciation]] on wiktionary has a speaker, so does [[Poland]] on wikipedia

Closing invalid. I don't think there is anything to do.
Comment 2 Ryan Jackson 2011-09-28 19:15:08 UTC
I think it would be beneficial to have such a feature for every entry. I see this as a usability issue.
Comment 4 MZMcBride 2012-05-27 20:28:52 UTC
Re-opening.

As far as I can tell, this is a perfectly valid feature request. We're talking about the ability to turn IPA into speech on-the-fly. The current system doesn't work because uploading individual files of individual words is a pain in the ass, I think. And it's just overwhelming. I think for a lot of purposes, the voice technology that came with computers fifteen years ago is "good enough" or can be wrestled to be.

But ideally you'd have the ability to turn proper IPA into sound. A smarter solution is needed. Consider this a brainstorming bug. :-)
Comment 5 Amgine 2012-05-27 21:02:31 UTC
This could be a reasonably classic tag extension, <text2speech type="IPA" icon="speaker">/a.zyʁ/</text2speech>

It would also be a cool feature on the Wiktionary Mobile App, which uses the speaker icon as a button to play pronunciation files for articles which have them.
Comment 6 Quim Gil 2013-03-25 18:48:51 UTC
(In reply to comment #4)
> The current system
> doesn't work because uploading individual files of individual words is a pain
> in the ass, I think.

I haven't tried (have you?) but it shouldn't be. Nowadays plenty of people has a decent quality mic connected to the Internet.

> the voice technology that came with computers fifteen years ago is "good
> enough" or can be wrestled to be.

In English and a few other languages sure, but we have close to 200 Wiktionaries and most of those languages probably won't benefit from that tech research anytime soon.

For a reference, see 

http://www.loquendo.com/en/products/text-to-speech/languages-voices/
http://www2.research.att.com/~ttsweb/tts/demo.php

Another step will be to wait for open source alternative of these propriatery and nowadays lucrative systems...
 
> But ideally you'd have the ability to turn proper IPA into sound. A smarter
> solution is needed. Consider this a brainstorming bug. :-)

Just in case its useful:

http://www.w3.org/TR/speech-synthesis/#edef_phoneme
http://en.wikipedia.org/wiki/Speech_synthesis#Text-to-phoneme_challenges'
Comment 7 Yuvi Panda 2013-03-25 19:27:02 UTC
So if I'm reading this right, this requires an 'IPA -> Sound' engine of some sort?
Comment 8 Amgine 2013-03-25 19:56:08 UTC
(In reply to comment #7)
> So if I'm reading this right, this requires an 'IPA -> Sound' engine of some
> sort?

There is also a bug requesting the reverse, Sound -> IPA, but for this specific enhancement, yes.
Comment 9 Quim Gil 2013-03-25 20:04:54 UTC
(In reply to comment #8)
> There is also a bug requesting the reverse, Sound -> IPA

Where? I couldn't find it.
Comment 10 Rahul Maliakkal 2013-03-25 20:24:56 UTC
Many Words in Wikitionary have a pronunciation attached along with it,but there are many words that dont have this feature
Example : http://en.wiktionary.org/wiki/compendium

And i did a quick random survey and i found out that words which are pertaining to a specific field like mathematics,chemistry don't have the pronunciation attached

A solution which i propose is to provide a button to record the sound ,on clicking that button a 5 sec recording starts within which the speaker should be loud, clear and adhere to the phonetics. A Rating feature would also go along well with this feature,to describe it briefly :People can record the pronunciation and volunteers can rate the recordings out of 5(similar to the imdb ones :))

We could have the recordings saved in the ogg or wav format.
Comment 11 Amgine 2013-03-25 20:45:15 UTC
(In reply to comment #9)
> (In reply to comment #8)
> > There is also a bug requesting the reverse, Sound -> IPA
> 
> Where? I couldn't find it.

Neither can I. It was in a discussion about a mobile tool for recording spoken word and uploading to commons, both for the Wiktionary project and wikisource (for oral history recordings needing transcriptions.)
Comment 12 Nemo 2013-03-25 20:58:06 UTC
(In reply to comment #6)
> (In reply to comment #4)
> > The current system
> > doesn't work because uploading individual files of individual words is a pain
> > in the ass, I think.
> 
> I haven't tried (have you?) but it shouldn't be. 

That's what requested here, in fact. :)

> Nowadays plenty of people
> has
> a decent quality mic connected to the Internet. 

Which is why exploiting this resource is a good project.

(In reply to comment #7)
> So if I'm reading this right, this requires an 'IPA -> Sound' engine of some
> sort?

Not what comment 0 asked, but some proposed it; clarifying summary.

(In reply to comment #11)
> Neither can I. It was in a discussion about a mobile tool for recording
> spoken
> word and uploading to commons, both for the Wiktionary project and wikisource

Indeed, see URL where there's clear interest from the communities.
It's still not clear, from a Wikimedia projects point of view, if the aim is best served by an extension or other system, but the request is legit.
Comment 13 Quim Gil 2013-03-25 21:08:04 UTC
Adding Lars, who proposed the voice recording tool idea at:

http://thread.gmane.org/gmane.org.wikimedia.wiktionary/1265

And some context: Rahul - see comment 10 - is interested in this project for Google Summer of Code. Having a community need declared increases points for him. If someone would volunteer as mentor then his chances would increase even more (hint, hint).
Comment 14 Rahul Maliakkal 2013-03-25 21:12:34 UTC
(In reply to comment #13)

I would like someone to reply to comment #10

Quim Gil-Thanks,Had to think alot ,as i told you!I'd be glad to work for this project
Comment 15 Bawolff (Brian Wolff) 2013-03-25 21:46:18 UTC
(In reply to comment #13)
> Adding Lars, who proposed the voice recording tool idea at:
> 
> http://thread.gmane.org/gmane.org.wikimedia.wiktionary/1265
> 
> And some context: Rahul - see comment 10 - is interested in this project for
> Google Summer of Code. Having a community need declared increases points for
> him. If someone would volunteer as mentor then his chances would increase
> even
> more (hint, hint).

As an aside I think that should be discussed in a separate bug. Getting humans to record sound and getting auto tts of ipa (what comment 0 is asking for) is rather different.
Comment 16 Matthew Flaschen 2013-03-27 17:44:05 UTC
(In reply to comment #15)
> As an aside I think that should be discussed in a separate bug. Getting
> humans
> to record sound and getting auto tts of ipa (what comment 0 is asking for) is
> rather different.

Agreed, the original report requested text-to-speech.  Easier pronunciation recording is a whole different thing, so I've filed bug 46610 and retargeted this to the original request.

As a side note, I agree computer pronunciation is inferior to a human recording.  The question is whether it's enough better than nothing to be worth implementing.
Comment 17 Rahul Maliakkal 2013-04-05 19:35:35 UTC
I have prepared a rough project proposal Please do give me your feedback and suggestions so that i can improve on it https://www.mediawiki.org/wiki/User:Rahul21/Gsoc
Comment 18 Quim Gil 2013-04-05 20:17:57 UTC
Rahul, your proposal is related to 

Bug 46610 - Pronunciation recording tool

Please announce it there. Thank you!
Comment 19 Rahul Maliakkal 2013-04-05 20:22:53 UTC
I am sorry 2 tabs opened at the same,causing some confusion!
Comment 20 Quim Gil 2014-04-08 15:45:24 UTC
One year of silence. Setting to Lowest only to reflect the current reality, which is that nobody we are aware of is working or planning to work on this.

Note You need to log in before you can comment on or make changes to this bug.


Navigation
Links