From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Greylist: delayed 450 seconds by postgrey-1.34 at speech; Sun, 11 Dec 2011 08:53:58 EST Received: from mail.insightbb.com (smtp.insight.synacor.com [208.47.185.22]) by speech.braille.uwo.ca (Postfix) with ESMTP id E18BBC1A0C8 for ; Sun, 11 Dec 2011 08:53:58 -0500 (EST) X_CMAE_Category: 0,0 Undefined,Undefined X-CNFS-Analysis: v=1.1 cv=xTcl4fkGVQQtDOcA3lWsiwIMXnwBVhEmF98vkNvaKdo= c=1 sm=0 a=0K7_Eq2e2LYA:10 a=jLN7EqiLvroA:10 a=8nJEP1OIZ-IA:10 a=J3Oe81MOfEy-iyEJqKgA:9 a=wPNLvfGTeEIA:10 a=YvC6Y5V1PpclOuQhTIW3vg==:117 X-CM-Score: 0 X-Scanned-by: Cloudmark Authority Engine Authentication-Results: smtp02.insight.synacor.com smtp.mail=n8kl@insightbb.com; spf=softfail; sender-id=softfail Authentication-Results: smtp02.insight.synacor.com header.from=n8kl@insightbb.com; sender-id=softfail Received-SPF: softfail (smtp02.insight.synacor.com: transitional domain insightbb.com does not designate 74.130.214.42 as permitted sender) Received: from [74.130.214.42] ([74.130.214.42:1049] helo=randy3) by mail.insightbb.com (envelope-from ) (ecelerity 2.2.2.40 r(29895/29896)) with ESMTP id 46/78-11821-434B4EE4; Sun, 11 Dec 2011 08:46:29 -0500 Message-ID: <3AA846357E9F4197BC21A54FFF615192@randy3> From: "Kitty Litter" To: "Speakup is a screen review system for Linux." References: Subject: Re: How to map /dev/vcsa charactel values to unicode code points Date: Sun, 11 Dec 2011 08:46:30 -0500 MIME-Version: 1.0 Content-Type: text/plain; format=flowed; charset="iso-8859-1"; reply-type=response Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2900.5931 X-MIMEOLE: Produced By Microsoft MimeOLE V6.00.2900.6157 X-BeenThere: speakup@braille.uwo.ca X-Mailman-Version: 2.1.14 Precedence: list Reply-To: "Speakup is a screen review system for Linux." List-Id: "Speakup is a screen review system for Linux." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 11 Dec 2011 13:53:59 -0000 There are charts you should be able to find giving codepoints and the character they represent. If you know C I can give you code to demonstrate how to get the codepoint from a 2 3 or 4 byte utf-8 sequence. Basically when you see an extended ascii character you determine how many leading 1 bits there are. If there are 3 for example then sequence should be a 3-byte utf8. You then check the second and third byte to see that B15 is 1 and B14 is zero. Then you concatenate the bits and come up with the codepoint. Quite complicated!