betabug... Sascha Welter

home english | home deutsch | Site Map | Sascha | Kontakt | Pro | Weblog | Wiki

28 December 2004

Greenglish, Greeklish, Γρήκλης

It's a small world for writing in Greek without a Greek keyboard

Chatted with yorix from HelMUG today on AIM (using BitlBee on my side) and he mentioned that he had always wanted to write a program to translate Greeklish to real Greek characters and back. As it happened we were both using Greeklish and as it also happened I had been looking up information on Greeklish on the web the day before.

Greeklish (or Greenglish) is writing Greek with latin characters, for example you write ellhnika instead of ελληνικά. Transliteration relies on the human ability to make sense out of nonexact rules. Machine transliteration would face difficulties, because everybody makes their own Greeklish. I had some ideas about coding for transliteration already...

Some links: Scientific approach (Greek), a table for transliteration (English), sample from irc instructions.


Another example is part of our discussion in the chat:

11:41 [+yorixofhelm] betabug: Apo kairo hthela na kanw mia efarmogi pou na
metafrazei apo greeklish se ellinika...
11:41 [+yorixofhelm] betabug: Mporei loipon na fanei xrisimh :-)
11:41 [+yorixofhelm] betabug: Alla mporeis na kaneis kai ayto pou les...
11:42 [@betabug] yorixofhelm: nai, kai egw to skeftika ayto


Yorix writes the greek θ (theta) as "th", while I tend to use the 8. But since nobody tells me so, I may forget and use th sometimes. Same game with other characters. Yorix uses "u" for the Greek "ypsilon", while I go with the latin "y". His choice is better as it optically matches in better and leaves latin "y" open to be possibly used for Greek psi (ψ). I changed my habit.

Most disturbing for me is the setting of "r" for ρ (greek ro). I always want to use "p" for this, which matches better optically. But there is no obvious optical choice for π (greep pi) so it is stuck at "p".

Another problem with machine supported "Greeklish" would of course be that this is very much "up" in the human interface, close to the top of the application level. It could almost be implemented as a keyboard layout and font (so you could write and see proper Greek characters on your own side, while the other side gets Greeklies). This solution would presume an encoding where characters have a 1-to-1 relationship. Theta (θ) could not be translated to "th" because the character we see a different character on our side, but it's still just one character.

If OTOH we code a routine that transliterates input and display, then my first question is: "How do we hook that up to our applications?" Where does iChat get it? How do I tell Terminal.app? Writing web pages in TextEdit, have to call an AppleScript every time I save?.

Terminal.app might be the most simple, as in the shell I can put an Expect script between keyboard input (STDIN), display (STDOUT) and translate all kind of stuff. There is an example script for Expect that does Dvorak keyboard layout in that way.

Using a font/keyboard layout, we'd get the exact same transliteration always, like it or not. If you like to use some other transliteration, you have to change the font. And on the receiving side, if your partner uses some other transliteration, you might see partial garbage coming in. People will probably never change their habits for you, so you will for example have to live with the wrong Sigma at the end of words this way.

The program routine could employ extra logic to guess at the transliteration used on the other side. A lexicon of greek words could be used to make best guesses at certain ambivalent characters. This will have the result that we don't get full round-trip fidelity, more of a very loose mapping... but that's what Greeklish is all about.

Update: I'm now using my own python Greek to Greeklish converter, which does Unicode UTF-8 too - at least for mail reading (I don't convert Greeklish to Greek back).

Posted by betabug at 22:30 | Comments (7) | Trackbacks (0)
ch athens
Life in Athens (Greece) for a foreigner from the other side of the mountains. And with an interest in digital life and the feeling of change in a big city. Multilingual English - German - Greek.
Main blog page
Recent Entries
Splish splash I was taking a vacation (08/18 11:40)
Ferienzeit (07/29 17:09)
Backyard Adventure (07/08 08:18)
Danakos Climbing Gym (06/26 18:40)
Best of
Some of the most sought after posts, judging from access logs and search engine queries.

Apple & Macintosh:
Security & Privacy:
Misc technical:
Athens for tourists and visitors:
Life in general:
| Main | Bericht von der Generalversammlung des griechischen Mac-Clubs >>
Comments
Re: Greenglish, Greeklish, Γρήκλης

In the meantime I discovered a Greek converter (for various encodings) by Diomidis Spinellis (GRconv). Doesn't solve the "while you type in a chat" problem though.

Posted by: betabug at January 17,2005 15:04
Re: Greenglish, Greeklish, Γρήκλης

Hi

One answer to the problem you describe is the Automatic Greek Internet editor. You just type as you would IF you had a Greek keyboard, and the letters com out Greek! You then copy-paste to any apllication you want. Check it out:

http://greeklish.net/edit

It is at a very early stage, but all the basics work fine. Many enhancements are promised for the future, including forums integration, etc...

Use is free. Works on both IE & Mozilla/Firefox.

HTH

Posted by: "k" at June 19,2005 10:52
Re: Greenglish, Greeklish, Γρήκλης

I have made an extension for Mozilla Firefox which automatically converts the text typed in greek (like http://greeklish.net/edit) . The output is in UTF-8 so it's only a cut n' paste way from the textboxes in Firefox. The url is : https://addons.mozilla.org/extensions/moreinfo.php?id=830

Posted by: Φώτης at July 20,2005 16:22
Re: Greenglish, Greeklish, Γρήκλης

I'm researching this problem in my diploma thesis. Check it out at http://django.easysoftware.gr/greeklish/.

I'm afraid that currently the site is down, but I'll have it back up ASAP.

Posted by: Orestis Markou at October 16,2006 17:30
Re: Greenglish, Greeklish, Γρήκλης

Orestis, the site works somehow (I had a link failing, but the start page came up). But I didn't see anything about a diploma thesis :-) Oh, and I fixed up the link.

Posted by: betabug at October 16,2006 18:59
Re: Greenglish, Greeklish, Γρήκλης

You can find out more about it:

http://django.easysoftware.gr/main/about/

The url has changed...

Posted by: Orestis Markou at December 18,2006 17:18
Re: Greenglish, Greeklish, Γρήκλης

hi, if you can write a dvorak keyboard for greek for osx
i tried to make one with ukele but it doesnt work smoothly
thanks

windows good too

Posted by: christo at February 10,2010 14:24
Re: Greenglish, Greeklish, Γρήκλης

christo, I have no need for something like a greek dvorak keyboard... so I see no reason why I should make one. Also I disagree on the statement that windows is good too.

Posted by: betabug at February 11,2010 09:12
Re: Greenglish, Greeklish, Γρήκλης

Hi I am currently trying to learn Greek but all my friends communicate in Grenglish or Greeklish. Is there a way I can convert this text to English to understand??

Please let me know?

L

Posted by: Lauren at December 08,2010 01:31
Re: Greenglish, Greeklish, Γρήκλης

Lauren, I don't think there is something that will convert Greeklish to Greek and then to English. Sorry, you will have to tell your friends to use proper Greek when writing!

Posted by: betabug at December 08,2010 09:45
Trackbacks
You can trackback to: http://betabug.ch/blogs/ch-athens/7/tbping
There are no trackbacks.
Leave a comment