This is an archive of the original scripts.sil.org site, preserved as a historical reference. Some of the content is outdated. Please consult our other sites for more current information: software.sil.org, ScriptSource, FDBP, and silfontdev



Home

Contact Us

General

Initiative B@bel

WSI Guidelines

Encoding

Principles

Unicode

Training

Tutorials

PUA

Conversion

Resources

Utilities

TECkit

Maps

Resources

Input

Principles

Utilities

Tutorials

Resources

Type Design

Principles

Design Tools

Formats

Resources

Font Downloads

Gentium

Doulos

IPA

Rendering

Principles

Technologies

OpenType

Graphite

Resources

Font FAQ

Links

Glossary


Computers & Writing Systems

SIL HOME | SIL SOFTWARE | SUPPORT | DONATE | PRIVACY POLICY

You are here: Encoding
Short URL: https://scripts.sil.org/Encoding

Character set encoding basics Peter Constable, 2001-06-13
In understanding technologies for working with multilingual and multi-script text data, we need to start with an understanding of character encoding. Systems for working with text involve a collection of processes that work together—processes for creating and editing text, presenting it, for sorting, for laying out paragraphs and wrapping at line breaks, etc. Character encoding is the thing that ties all of these processes together.
Computer systems employ a wide variety of character encodings. The most important of these for us is Unicode. It is also important for us to understand other encodings, however, and how they relate to Unicode. In this section, I want to look at some basic concepts that relate to all encodings, and also give an overview of legacy encodings and their importance for us.

XSEM: XML Scripture Encoding Model Dennis Drescher, 2001-09-06
The XML Scripture Encoding Model (XSEM), an SIL project, is a markup language that conforms to the Extensible Markup Language (XML) version 1.0 standard. On this page you will find information about the model and the project.

Character Encoding Choices in Paratext 6 Peter Constable, 2003-04-29
This article discusses options available to users for how their data can be encoded in Paratext 6, and looks at pros and cons of those options.

Windows and Codepages Martin Hosken, 1997-12-29
This document examines how Windows 95 handles multi-lingual computing. It looks at Languages, Codepages, Locales, Unicode and Fonts with particular reference to their support in Windows 95.
An alternative title for this document might be: “How to add a new script to Windows 95 and fail”.


© 2003-2024 SIL International, all rights reserved, unless otherwise noted elsewhere on this page.
Provided by SIL's Writing Systems Technology team (formerly known as NRSI). Read our Privacy Policy. Contact us here.