
(See the bottom of the page for a brief set of terms that might be useful.)
The speech it produces is of quite high quality for the time (and for a single-chip IC). It rates reasonably well in intelligibility, but is not very natural sounding. 64 different phonemes are available, and 4 levels of intonation.
The SC-01A (nor SC-01) is not available on a regular basis from any known suppliers. The chip
was last made in the late 1980's, and SSI told us that due to the
types
of processes used in this analog chip, it was unlikely to be made
again--too
complicated to transfer the technology. (However, we never had
really
good communication with SSI, so take this with a grain of salt.)
(2/24/04) Update! A posting to rec.games.vectrex claims that a stock of SC-01's can be found at http://www.best-electronics-ca.com/custom-i.htm. They do show a listing for "CO61720 (1400 / 1450 Speech SC01) $30.00". Their main business is to supply Atari parts. According to someone who called, they have maybe 20 left.
Brief comments on the SC-02 / SSI-263 / "Artic 263"
The SSI-263 (or whatever you prefer calling it) is quite different from the SC-01; it has many more control registers, a different package and pinout, and so on. As far as I know, it has the same analog formant synthesis core as the SC-01, and is also capable of intelligible (but not necessarily natural sounding) speech.(July 2000) In July 2000 I heard of two companies that claimed to still have access to stocks of the 263--very interesting. The first is Artic Technologies ((248) 588-7370, Troy Michigan, http://www.artictech.com/). Their website implies the chip is still in production, but a sales person indicated that the last buy on the chip was a couple of years ago, around 1998. (This conflicts with information I had back in the early 1990's, but as I said, information did not flow very well back then.) SSI was not the manufacturer at that point (Semtech might have been at one point, and it may have passed to yet another house by the end). Artic has a stockpile of "Artic 263" chips for their own products, and is willing to sell chips. A ball park on the pricing is around $100 qty 1, $90 qty 10+, and they can quote on larger quantities. Interesting note--Artic founders were involved in the development of the Votrax chips (http://www.artictech.com/history.htm). (As of mid-2002, Artic is still quoting about $100/chip.) The second company I heard about through a personal e-mail. This person seemed to indicate their company had a steady supply of 263's, but this person wasn''t able to discuss any details, and in any case it didn't sound like any chips were available to sell to anyone else. Quite possible that their company was also running on a stockpile from a last buy, but who knows.
(10/20/05) Another SSI-263 Source! Henry Courbis of GSE-Reactive tells me that he has some SSI-263's for sale, $50 each. http://www.gse-reactive.com They also have various Apple IIe and IIgs parts (including a clone of the Mockingboard).
(4/11/06) See below for SSI-263A datasheet and user guide.
The bottom line? Single-chip analog formant synthesis is likely down for the count. There are some recent offerings, though, that could fill the niche of single-chip phoneme synthesizer. First is the SpeakJet, a pre-programmed PIC chip that takes serial data in (phonemes, though probably not the same ones as the SC-01 or 263) and outputs PWM speech. Second is the Winbond text-to-speech chip. For details on both, see below under Modern Alternatives.
See also Gagnon, R. T. (1978). "Votrax Real Time Hardware for Phoneme Synthesis of Speech," Proc. Int. Conf. Acoust. Speech Signal Process. ICASSP-78, 175-178. (Many thanks to Eric Smith for finding this citation!)
From "Talking Terminals," David M. Stoffel, Byte, September, 1982: "The Votrax VSA and VSB synthesizers seem quite similar with respect to their phoneme production, but the FSST-3, which uses the VSA, definitely sounds inferior; whether this is an artifact of the VSA synthesizer or poor audio amplification, I don't know. You may wonder why none of these products uses the new Votrax SC-1A (sic) integrated circuit, which is less expensive. The single quantity cost of the VSB is about $800, while the SC-01A is $70. But there are two major reasons why the SC-01A is not used. The speech-rate and pitch controls are both dependent on the same clock signal or timing circuit, affecting the ease with which intelligible speech may be produced. Also some people are concerned about the acceptability of the SC-1A's (sic) sound quality. Only scientific performance measures can determine which Votrax synthesizer is ultimately more intelligible. (For a description of an application using the Votrax SC-01A speech-synthesizer chip see Steve Ciarcia's article on page 64 in this issue.)" See http://www.lindenreport.com/stoffel/talk.html
For the best overview of speech synthesis up to the late 80's, see
Klatt, Dennis, "Review of Text-to-Speech Conversion for English," Journal
of the Acoustical Society of America, 82:3, September 1987, p
737-793.
This is an excellent overview, and includes (pg 756) a brief
description
of the SC-01 that starts as follows::
"Apparently oblivious to all of the prior research detailed earlier,
a man experimenting in his basement workshop, Richard Gagnon, designed
a synthesis-by-rule program that eventually resulted in the Votrax
SC-01
chip. ... It is a remarkable device for the price."
No mention of Mozer (of National Semiconductor's speech synthesizer,
if I'm remembering correctly), but this is the best article to start
with
if you want an introduction to the synthesis of speech.
Important note: Klatt's paper is now online! See this and other papers at http://www.mindspring.com/~ssshp/ssshp_cd/ss_home.htm
Prochnow, Dave, Chip Talk: Projects in Speech Synthesis, Tab
Books, Blue Ridge Summit, PA: 1987. ISBN is 0-8306-1912-7 (hard
cover)
and 0-8306-2812-6 (paperback). Hardcover is Tab # 2812.
A hobbyist-oriented book on speech synthesizers circa 1987, including
For a deep account of one text-to-speech system (the basis for some of the best speech synthesizers until perhaps recently), see the book From Text to Speech: The MITalk System, by Allen, Hunnicutt, and Klatt, Cambridge University Press, 1987. (The pseudo-code in the back is not, however, without a number of errors and omissions.) The parameter-to-speech part of MITalk is detailed for the most part in Klatt's "Software for a Cascade/Parallel Formant Synthesizer," J. Acoust. Soc. Am., 67:3, March 1980, pg 971-995.
See http://www.mindspring.com/~ssshp/ssshp_cd/ss_home.htm, the Smithsonian Speech Synthesis History Project, which includes audio of a variety of synthesizers, the Klatt paper, and personal recollections. See especially the chronology of Votrax's speech products.
Obviously, newsgroups such as comp.dsp, comp.speech.*, comp.arch.embedded, and comp.robotics.*--and their associated FAQ's--are valuable resources.
See also http://www.robotprojects.com/voice/voice.htm, by Scott Savage, which has some interesting links on speech synthesizers.
(7/2/02) Tom McClintock notes the following: "One item of interest regarding the SC-01a. The 'PinMAME' developers have incorporated SC-01a emulation into their pinball simulations. The source code includes digital representations of all the phonemes. Pretty cool stuff, but complete and accurate emulation is not quite there. Check out the source: http://pinmame.retrogames.com/release/pinmame_112_1_src.zip"
Bob Paddock wrote up a nice list of links at http://www.chipcenter.com/circuitcellar/june00/c0600rp42.htm
(2/22/04) A new pre-programmed PIC that does single chip speech synthesis and sound effects--the SpeakJet. Apparently released within the past two weeks, it accepts serial date in (phonemes) and output a PWM signal that with minimal (2-pole) filtering can be fed to an amplifier and then speaker. Internal oscillator. Seems to run about $25. Developed apparently by Magnevation (www.magnevation.com) and Scott Savage (oopic.com) over the past 5 years. I have only heard a few demos. Widely available through robotic supply sources. The interface and command set look very well thought out. This might turn out to be a very nice chip for applications that would normally want to use the SC-01A. According to an email I got from Scott, the SpeakJet does do transitioning between phonemes. If anyone has additional details how this works, what PIC it is (someone guessed an PIC18F1320), etc., let me know. (This conjecture makes sense--the 18F1320 has an 8x8 mulitplier, 8k bytes of program space, PWM, and runs about 10 MIPs. This is more than enough to do a stripped-down digital formant synthesizer. A full bore, unoptimized KLATTalk-ish formant synthesizer core will run on a 10 MIPs 16-bit wide chip with MAC.)
Update (12/4/07): Robert Doerr (http://www.robotworkshop.com/) just wrote an article in the December issue of Servo Magazine (http://www.servomagazine.com/) about using a small microcontroller to translate SC-01 phonemes into SpeakJet allophones, plus handle the interface signals so you can plug the circuit into a regular SC-01 22-pin socket. If you need to replace an SC-01, but keep the rest of your circuitry intact (e.g., Hero robot), this could be an interesting solution.
Also, Chip Gracey, Parallax founder and the designer of the
Propeller chip, has apparently been working on speech synthesis that
would run on the Propeller. (See Make magazine volume 10:
http://www.make-digital.com/make/vol10/?pg=78&search=parallax+propeller+speech&u1=texterity&cookies=1.)
Anyone with additional information? If it ran on just one
of the eight 32-bit processors (which should be quite realistic), this
would be interesting for new embedded applications.
(7/2/02) Robert Doerr points out a newish chip from Winbond, the WTS701, which includes text-to-speech algorithms. http://www.winbond.com/E-WINBONDHTM/partner/b_2_a_5.htm.
(5/14/03) Tom Arnold points out that the datasheet is finally available for the WTS701, along with a live demo (you type in text, get back audio output) From the description, it sounds like it stores speech (using the ISD technology) on chip, concatenating to form the output. (See their FAQ on the page above.) Surface mount package. SPI interface. (See sources below for a module.)
You could also port open-source speech synthesizers to a microcontroller/DSP platform. Not trivial--you'll need on the order of 5-10 MIPS of 16-bit wide processing (with MAC), and the digital-to-analog output along with at least 32-64 kB of program space plus some RAM. Probably not very economical given the options above, but very educational if you are into that sort of thing.
As for speech synthesis by concatenation ("why not just record all 64 sounds from the SC-01 and string them together as you want?"), see the comment at the bottom of this page.
Please note that I do not have any
SC-01A's or 263's for sale.
(2/24/04)
Update!
A posting to rec.games.vectrex claims that a stock of SC-01's can be
found at http://www.best-electronics-ca.com/custom-i.htm.
They do
show
a listing for "CO61720 (1400 /
1450 Speech SC01)
$30.00". They supply Atari parts. Apparently about 20
left.
(10/20/05) See the comments under SSI-263 at the top for a new source for the SSI-263.
(2/22/04) A source for some speech chips (not including the
SC-01A): http://www.speechchips.com/shop/
The sell the
SpeakJet, SP0256-AL2, CTS256-AL2, and even the SP1000!
(2/22/04) There's a UK company selling a module with the
Winbond WTS701 (at least that's what it looks like): http://www.totalrobots.com/access_files/speech.htm
40 pounds UK (around $75).
Phoneme: An abstract sound unit, for example a sound like "eh". The problem is that the actual sound associated with a phoneme depends on its context--the "eh" sound is influenced by what sounds preceed and follow it, for example. The SSI-263 has some 60 phonemes it uses to synthesize English speech.
Allophone: The actual realization of a phoneme (i.e.,
in
a particular context). More than one allophone can be associated
with a given phoneme.
Feel free to e-mail me if you have any interesting resources or note errors on this page: dgrover at redcedar.com