Home

Contact Us

General

Initiative B@bel

WSI Guidelines

Encoding

Principles

Unicode

Training

Tutorials

PUA

Conversion

Resources

Utilities

TECkit

Maps

Resources

Input

Principles

Utilities

Tutorials

Resources

Type Design

Principles

Design Tools

Formats

Resources

Font Downloads

Gentium

Doulos

IPA

Rendering

Principles

Technologies

OpenType

Graphite

Resources

Font FAQ

Links

Glossary


NRSI: Computers & Writing Systems

SIL HOME | SIL SOFTWARE | SUPPORT | DONATE

You are here: Encoding > Conversion > Maps
Short URL: http://scripts.sil.org/MappingFiles

Mapping Files

Lorna Evans, 2014-09-29

Note:

We have moved all the TECkit mapping files to a GitHub repository. That repo is found here:  https://github.com/silnrsi/wsresources. You are welcome to browse and find mapping files or to submit your own mapping files.

The mapping files on this page continue to be available. However, they now come directly from the GitHub repo rather than a static download.

These mapping files were created for converting data to Unicode. Some, but not all, will also convert data from Unicode back to the legacy encoding. These mapping files are for use with TECkit applications. You can do this using the program TECkit or SILConverters 4.0.

Contents

Africa

Burkina Faso

Legacy font encoding: SIL Burkina Faso standard fonts

Compiled (and uncompiled) conversion tables for each set of fonts is included here, which maps SIL Burkina Faso encoded data to Unicode and vice versa.

 SIL Burkina Faso mapping

Cameroon

Legacy font encoding: SIL Cameroon standard fonts

Compiled (and uncompiled) conversion tables for each set of fonts is included here, which maps SIL Cameroun encoded data to Unicode and vice versa.

 SIL Cam2 mapping

Chad

Legacy font encoding: Tchad2000 and Chad95 fonts

Compiled (and uncompiled) conversion tables for each set of fonts is included here, which map Tchad2000 and Chad95 encoded data to Unicode (not vice versa). Although each mapping file is bi-directional, conversion from Unicode back into the legacy encoding isn't recommended, as there would be some poor diacritic choices back in the legacy encoding. Each mapping file is currently only intended to support legacy to Unicode conversion.

 Tchad2000 mapping

 Chad95 mapping

Côte d’Ivoire

Legacy font encoding: SIL Côte d’Ivoire (RCI) standard fonts

Compiled (and uncompiled) conversion tables for each set of fonts is included here, which maps SIL Côte d’Ivoire encoded data to Unicode and vice versa.

 SIL Côte d’Ivoire (RCI) mapping

 Samogho mapping

Democratic Republic of Congo

Legacy font encoding: SIL Eastern Congo Group (ECG) standard fonts

A compiled (and uncompiled) conversion table is included here, which maps SIL ECG encoded data to Unicode and vice versa. This mapping file is Unicode 5.0 compliant.

 SIL ECG mapping

Other Eastern Congo resources are a Unicode Keyman keyboard found here.

Ethiopia

Latin to Fidel Unicode mapping

Three TECkit mapping files (compiled and uncompiled) are included in this package. They are intended for use where text has been input “phonetically” as a syllable (be, ppii, etc.) and conversion to fidel is desired (, , etc.).

 Latin to Ethiopic mapping

GeezTypeNet

This mapping file enables conversion of custom-encoded text using the GeezTypeNet font to Unicode codepoints.

 Geez Type Net mapping

Ge'ez-1/Ge'ez-2/Ge'ez-3

This mapping file enables conversion of custom-encoded text using the PowerGe'ez Geez-1/Ge'ez-2/Ge'ez-3 font(s) to Unicode codepoints.

 PowerGe'ez mapping

PUA to Unicode 6.0 mapping

If you happened to obtain an early unreleased version of the SIL Abyssinica font, and used some of the Private Use Area (PUA) codepoints in your data, you may wish to use the following mapping file to convert your data from PUA codepoints to Unicode 4.1 (many of the PUA characters were added to Unicode 4.1).

 SIL ETG PUA mapping

Other Ethiopic resources include the  Abyssinica SIL font and an  Ethiopic Keyman keyboard to type characters needed in Ethiopian and Eritrean languages.

Liberia

Three TECkit mapping files (compiled and uncompiled) are included in this package. SILVai is for use in converting text encoded with the SIL Vai font to Unicode. SILVaiE is for use in converting text encoded with the SIL Vai Extras font to Unicode. SILVaiNTsfm is intended for use where text has been input “phonetically” as a syllable (ba, dle, etc.) and conversion to Unicode Vai characters is desired (U+A552 VAI SYLLABLE BA, U+A514 VAI SYLLABLE DEE, etc.).

 SIL Vai mapping

Other Vai resources: Two Vai Unicode fonts are available  here. Dukor is based on SIL Vai. Windows 7 provides a Unicode font with Vai support called Ebrima. A Windows Keyman keyboard is available here: Known Unicode Keyman Keyboards. A MacOSX keyboard is available  here.

Mali

Legacy font encoding: SIL Mali standard fonts

Compiled (and uncompiled) conversion tables for each set of fonts is included here, which maps SIL Mali encoded data to Unicode and vice versa. The legacy and fonts will be posted at some future time.

Mali Standard

 SIL Mali Standard mapping

Malicode Dogon

 Malicode Dogon mapping

SIL FHP Fulfulde

 SIL PHP Fulfulde mapping

Senufo

 SIL Senufo mapping

Other Mali resources include Keyman keyboards developed for use in Malian languages and Charis SIL fonts which have been modified for use in Malian languages.

Nubian

Compiled and uncompiled TECkit conversion tables are included, which map data using the Nobiin,  Ibrim, Nubian(Dr. Mokhtar) and Nubian(Mokhtar.M&H & Kashef.H) fonts to Unicode.

 Ibrim mapping

 Nobiin mapping

 Nubian (Dr. Mokhtar) mapping

 Nubian (Mokhtar.M&H & Kashef.H mapping)

Togo/Benin

Legacy font encoding: SIL Togo/Benin standard fonts

The compiled (and uncompiled) conversion table is included here, which maps SIL Togo/Benin encoded data to Unicode and vice versa.

 SIL Togo/Benin mapping

Americas

Brazil

Legacy font encoding: SIL Brazil standard fonts (BRB, Daw Ortografia, Daw Phon (Daw phonology), Ik (Ikpeng), Kaxarari, Kd (Kadiweu), Mamainde Nt, Mund (Munduruku), Nambi (Nambikuara), Nbfono Nbfono, Special Junk, Special Junk 2, Special Junk 3, Special Junk 4, Tuc Tucano, Richards Wwpo, Xavante)

Compiled (and uncompiled) conversion tables for each set of fonts is included here, which maps legacy encoded data to Unicode and vice versa.

 SIL Brazil mapping

Chile

Legacy font encoding: SIL Rapa Nui standard fonts

A compiled (and uncompiled) conversion table is included here, which maps "SIL Rap" encoded data to Unicode and vice versa.

 SIL Rapa Nui mapping

Colombia

Legacy font encoding: SIL Colombia standard fonts

Compiled (and uncompiled) conversion tables for each set of fonts is included here, which maps COB encoded data or ILV Times encoded data to Unicode and vice versa.

 SIL Colombia-i mapping

 SIL Colombia-u mapping

 ILV Colombia mapping

Costa Rica

Legacy font encoding: Bribri SIL font (Bribri:BZD)

Compiled (and uncompiled) conversion tables for each set of fonts is included here, which maps Bribri SIL encoded data to Unicode and vice versa.

 Bribri SIL mapping

Ecuador

Legacy font encoding: Shuar SIL font (Shuar:JIV)

A compiled (and uncompiled) conversion table is included here, which maps Shuar SIL encoded data to Unicode and vice versa.

 Shuar SIL mapping

Guatemala

Legacy font encoding: SIL Guatemala standard font

A compiled (and uncompiled) conversion table is included here, which maps CAB encoded data to Unicode and vice versa.

 SIL Guatemala mapping

Honduras

Legacy font encoding: Garifuna CABGarif SIL font (Garifuna:CAB)

A compiled (and uncompiled) conversion table is included here, which maps Garifuna CABGarif SIL encoded data to Unicode and vice versa.

 Garifuna CABGarif SIL mapping

This language is also spoken in Guatemala, Belize and Nicaragua.

Panama

Legacy font encoding: Naso font (Naso:TFR)

A compiled (and uncompiled) conversion table is included here, which maps Naso encoded data to Unicode and vice versa.

 Naso mapping

Paraguay

Legacy font encoding: AcheSIL font (Aché:GUQ)

A compiled (and uncompiled) conversion table is included here, which maps AcheSIL encoded data to Unicode and vice versa.

 AcheSIL mapping

Asia

SIL-Philippines Standard Font to Unicode mapping

Legacy font encoding: SIL Philippines standard fonts

Compiled (and uncompiled) conversion tables for each set of fonts is included here, which maps SIL Philippines encoded data to Unicode and vice versa.

 SIL Philippines mapping

Sahu (SE Asia) SIL Charis 2002 Font to Unicode mapping

Legacy font encoding: Sahu SIL Charis 2002 fonts

Compiled (and uncompiled) conversion table maps data encoded according to the 2002 version of the Sahu SIL Charis font to Unicode and vice versa.

 Sahu SIL Charis 2002 mapping

Europe/Middle East

PitchContours

Legacy font encoding: PitchContour fonts (PitchContours, PitchContoursSymbol, Contour, Cont12, Contour4, Contour6)

Compiled (and uncompiled) conversion tables for each set of fonts is included here, which maps legacy encoded data to "Unicode" (SIL's Corporate PUA) and vice versa.

These are experimental mapping files for the PitchContours fonts. Nine codepoints for the pitches were placed in SIL Corporate PUA at U+F1F1..U+F1F9. Each of these mapping files require the use of Charis SIL or Doulos SIL version 4.1 or greater. They will turn into contours if used in Graphite applications. It is unlikely that any other fonts will contain these codepoints.

It is possible these mapping files will not work for you since the fonts sometimes were used inconsistently. At times data from one melody was placed on two lines in order to make it stretched out. With that anomaly, the brackets in the fonts were also much larger to span several lines.

In order for these mapping files to work, data for a melody must be on one line, it cannot be on two separate lines or the data will not be converted properly. If you put your data on several lines we have no way to automatically convert that data for you. It is also impossible to maintain the large size of the brackets in an encoding conversion. If you wish to have large brackets you will have to manually go through your document and resize the brackets by changing the font size.

 Pitch Contours mapping

We would like feedback on these mapping tables. Contact information is in the readme.txt found in the download package.

Amer Phon SILDoulos font

Legacy font encoding: Amer Phon SILDoulos font

A compiled (and uncompiled) conversion table is included here, which maps Amer Phon SILDoulos data to Unicode (but not vice versa).

This mapping file has been minimally tested. Feedback is requested (see the readme file which is part of the download).

 Amer Phon mapping

ASAP (IPA) fonts

Legacy font encoding: ASAP SILCharis font

A compiled (and uncompiled) conversion table is included here, which maps ASAP SILCharis data to Unicode.

This mapping file has been minimally tested. Feedback is requested (see the source mapping file for contact information).

 ASAP SILCharis mapping

Linguist's Software fonts: GraecaII, SemiticaDict, TransRomanAH and TransRomanDict

Legacy font encoding: GraecaII, SemiticaDict, TransRomanAH and/or TransRomanDict fonts1

Compiled conversion tables are included here, which map GraecaII, SemiticaDict, TransRomanAH and/or TransRomanDict data to Unicode and vice versa.

Note that several of these fonts contain characters that probably do not exist in any one font. These mapping files have only been minimally tested. Feedback is requested (see the readme file which is part of the download).

 Linguist's Software mappings

IPA-SAM phonetic fonts

Legacy font encoding:  IPA-SAM phonetic fonts

Compiled and uncompiled conversion tables are included here, which map IPA-SAM data to Unicode and vice versa.

This mapping file has only been minimally tested. Feedback is requested.

 IPA-SAM mapping

SIL Apparatus

Legacy font encoding: SIL Apparatus font

A compiled conversion table called SILApparatus.tec is included here, which maps SIL Apparatus data to Unicode and vice versa.

There are 20 characters from SIL Apparatus that are not
in Unicode. We have output U+FFFD  REPLACEMENT CHARACTER in front of the base character. If you search for you can use text markup to make the following character superscript.

 SIL Apparatus mapping

SIL Ezra

Legacy font encoding: SIL Ezra font

These mappings (and the included documentation) will help you convert some of your old Hebrew data to the Unicode codepoints, so that you can use the Ezra SIL fonts without re-typing your data. It should be particularly useful to those who have made a significant investment in their data using the SIL Ezra fonts.

 SIL Ezra mapping

SIL Galatia

Legacy font encoding: SIL Galatia

Two draft conversion programs are included with this package. One is for  Consistent Changes and the other is for TECkit. Neither of the Greek mappings have been extensively tested. These are intended for converting texts which have been encoded for the SIL Galatia legacy font to Unicode (Galatia SIL is a Greek Unicode font).

 Galatia SIL mapping

SIL Greek Trans

Legacy font encoding: SIL Greek Trans font2

A compiled conversion table called SILGreekTrans.tec is included here, which maps SIL Greek Trans data to Unicode and vice versa.

This mapping file has only been minimally tested. Feedback is requested (see the readme file which is part of the download).

 SIL Greek Trans mapping

SIL Heb Trans and SIL Heb Trans Caps

Legacy font encoding: SIL Heb Trans and/or SIL Heb Trans Caps3 fonts

Two compiled conversion tables (SILHebTrans.tec and SILHebTransCaps.tec) are included here, which map SIL Heb Trans and/or SIL Heb Trans Caps data to Unicode and vice versa.

These mapping files have only been minimally tested. Feedback is requested (see the readme file which is part of the download).

 SIL Heb Trans mappings

SIL IPA93

Legacy font encoding: ipa93

A compiled conversion table called SILIPA93.tec is included, which is set to map ipa93 data to Unicode IPA and vice versa. This mapping has been updated to reflect up to Unicode 5.1. The only changes were for the downstep and upstep. Unicode 5.1 now supports all characters that were in IPA93 and Doulos SIL and Charis SIL support this.

 SIL IPA93 mapping

SIL IPA 1990

Legacy font encoding: SIL IPA 1990

A compiled conversion table called SIL-IPA-1990.tec is included, which is set to map SIL IPA 1990 data to Unicode IPA and vice versa.

The conversion back to legacy will not be a clean round-trip, because there are duplicates of many diacritics in the SIL IPA fonts.

This mapping has been updated to reflect up to Unicode 5.1. The only changes were for the downstep and upstep. Unicode 5.1 now supports all characters that were in IPA93 and Doulos SIL and Charis SIL support this.

 SIL IPA 1990 mapping

SIL PUA to Unicode 9.0 Mapping

The Unicode 9.0 standard includes 221 (not including Hebrew) characters that were previously allocated to codepoints in the Private Use Area by SIL’s PUA committee.

All processes (input methods, mappings) that create Unicode data should be revised to generate the proper Unicode values instead of PUA codes.

If you have data that contains these PUA codes, it should be updated by replacing each PUA character with its official Unicode counterpart. This will facilitate data interchange and the use of standard fonts and software.

 SIL PUA mapping

Note

Charis SIL and Doulos SIL fonts currently only support Unicode 8.0. Therefore, unless you have fonts which support the Unicode 9.0 codepoints you may wish to wait to update your PUA mapping until Charis SIL and Doulos SIL are updated.

Translator’s Workplace miscellaneous fonts

Legacy font encoding: Dt Di SILDoulosL, I1 Ii SILDoulosL, Hausa Font SILCharis Regular and/or LRSystem fonts

Compiled conversion tables are included here, which map data using these fonts to Unicode and vice versa.

Note that several of these fonts contain characters that probably do not exist in any one font. These mapping files have only been minimally tested. Feedback is requested (see the readme file which is part of the download).

 Translator's Workplace miscellaneous mappings

Other mapping files for fonts distributed with Translator’s Workplace can be found here:

Pacific

SIL Papua New Guinea Standard Font to Unicode mapping

Legacy font encoding: SIL Papua New Guinea standard fonts

A compiled (and uncompiled) conversion table is included here, which maps SIL Papua New Guinea encoded data to Unicode and vice versa. All PNG characters are now Standard Unicode 5.1. No more PUA characters are needed!

 SIL PNG mapping

Unicode in PNG — Why and How
Roland Fumey, 2007-05-04
Download "UnicodeInPNG-WhyAndHow2007-05.pdf", Acrobat PDF document, 1MB [9190 downloads]

The above document deals with most Unicode issues for the SIL PNG branch.

It covers the following topics:
  • Why Unicode?
  • In what cases do you need to convert your data to Unicode?
  • What are the tools available, and how do you install and use them?
  • Troubleshooting

System

Symbol Font to Unicode mapping

The Symbol font built into PostScript printers and now common to Apple and Microsoft platforms contains Greek, mathematical, and other technical symbols, most of which can be mapped to Unicode.

 Symbol Font to Unicode mapping

Symbol-encoded font to codepage 1252 transliterator

As discussed in Display Issues – FAQ, Microsoft Word data that is formatted using a symbol-encoded font such as SIL Galatia or SIL IPA93 is stored using a range of PUA characters, typically U+F020 .. U+F0FF. When using File / Save As... to create an 8-bit plain-text version of the data, such PUA characters are converted to question marks ('?'). One solution is to use this Unicode-to-Unicode mapping file, called a transliterator, to convert the PUA character codes to codepage 1252 before saving to plain text.

 Symbol-encoded font to codepage 1252 transliterator

Symbol to PUA mapping for RTF files

Use this mapping to convert legacy data in an RTF file from PUA back to legacy. For more information, see What's in Your File?.

 Symbol to PUA mapping for RTF files

Draft TECkit mapping file

Use this mapping as a starting point when writing a new legacy mapping file. For more information, see How to Write a Conversion Mapping for your Legacy Font.

Draft TECkit Mapping file
Joan Wardell, 2008-03-17
Download "UTTDraftMapping.txt", Text document, 14KB [5892 downloads]

Revision History

2014-09-29 LE: updated PUAtoUnicode mapping file for Unicode 7.0
2011-11-01 LP: added Ethiopic converters
2011-07-24 LP: added ASAP map
2011-05-17 LP: updated TBB map
2010-12-03 LP: updated Ethiopic mapping files for Unicode 6.0
2009-10-26 LP: updated PUAtoUnicode mapping file for Unicode 5.2
2008-05-21 LP: updated PNG mapping for Unicode 5.1
2008-04-08 LP: updated PUAtoUnicode mapping file for Unicode 5.1
2008-04-07 LP: updated IPA93 and IPA-1990 mapping files for Unicode 5.1
2008-02-27 JW: added draft mapping file
2007-06-26 LP: added PitchContours mapping files
2007-06-22-26 LP: added many "Americas" mapping files
2007-06-15 LP: updated Ezra SIL mapping files
2007-05-04 LP: added SIL Papua New Guinea Unicode document: How and Why
2007-03-21 LP: updated SIL Papua New Guinea mapping file
2007-03-02 LP: added mapping file for IPA-SAM encoding
2007-02-02 LP: added ECG mapping
2006-10-31 LP: added Chad mappings
2006-05-19 LP: added TW mappings
2006-04-28 LP: added Ethiopic mapping
2005-10-03 LP: added SILPUAtoUnicode41 mapping
2005-06-11 JW: added SymboltoPUA mapping

2005-03-17 BH: updated ipa93 mapping to v12

2004-12-01 LP: updated SIL Papua New Guinea mapping file

2004-07-22 LP: updated ipa93 mapping and added PUA2Uni4.0 mapping

2004-06-21 BH: added SymbolFontMapping

2004-05-03 LP: updated links to fonts

2004-04-27 LP: updated Greek mapping file

2004-04-26 LP: changed cc link

2004-04-12 LP: added PNG mapping file

2004-02-10 JW: removed "available soon" comment, added CC table


1 GraecaII, SemiticaDict, TransRomanAH andTransRomanDict fonts were distributed with Logos and Translator’s Workplace
2 SIL Greek Trans was distributed with BART and Translator’s Workplace
3 SIL Heb Trans and SIL Heb Trans Caps were distributed with BART and Translator’s Workplace

© 2003-2017 SIL International, all rights reserved, unless otherwise noted elsewhere on this page.
Provided by SIL's Non-Roman Script Initiative. Contact us here.