Lasso Soft Inc. > Home

  • Articles

Unicode-aware

Lasso actively converts the character sets of various input and output.

Lasso is Unicode-aware; that is, at the core of Lasso, all text is stored in Unicode. In order to facilitate this, Lasso must convert all input (files on disk, browser requests, etc) into Unicode. Lasso then converts all output from Unicode into the desired character set.

While this provides very powerful text-serving capabilities, it also poses a challenge to the developer to always ensure the character sets of input are correct. Failure to do so means the wrong characters are sent as output.

While there are a variety of ways to get text data into Lasso, the two ways that appear to have the most character set mismatches are listed below. Follow the respective links to read more:

1. File encoding article
2. include_url from Tagswap

Also, an overview of character encoding work flow is here:

Character Encoding and Unicode Workflow

Author: Bil Corry
Created: 30 Mar 2009
Last Modified: 16 Mar 2011

Comments

No comments found
You must be logged in to comment.

Please note that periodically LassoSoft will go through the notes and may incorporate information from them into the documentation. Any submission here gives LassoSoft a non-exclusive license and will be made available in various formats to the Lasso community.

LassoSoft Inc. > Home

 

 

©LassoSoft Inc 2015 | Web Development by Treefrog Inc | PrivacyLegal terms and Shipping | Contact LassoSoft