|
|
NRSI: Computers & Writing Systems You are here: General Resources for Writing Systems Implementation using Copyleft and FLOSS (Free/Libre and Open Source Software)
Note Although many of the solutions mentioned in this document can be very helpful for those interested in implementing writing systems, SIL International has not tested them, does not make any specific recommendations or specific claims about their legality or suitability for use. It is the user's responsibility to test the maturity and to understand the licensing of the following items before using, modifying or redistributing them. Comments and feedback welcome. If you have experience with any of the projects mentioned below, feel
free to add a comment. Introduction ----- IntroductionAre you looking for ideas, information and tools to enhance or start from scratch a WSI (Writing System Implementation) or one of its components for a specific language community? This collection of links will help you use copyleft and FLOSS (Free/Libre and Open Source Software) technologies and methods to develop, adapt or enhance a WSI. The As a developer - or any kind of contributor - there are many advantages to a well-managed copyleft strategy, you can:
Note It is a good idea to check software catalogs (such as Use Existing Writing System ComponentsFonts
courrier, helvetica, new century schoolbook, symbol, times, utopia, lucida, lucidabright, lucidatypewriter, luxi, charter, bookman, open look, avant-garde, chancery, nimbus, palladio, courier, veranda, gothic, tymes, aerial, palatino... Various models of redistribution restrict the availability of fonts in a default installation, but you can download and add a lot of other fonts including proprietary ones. Copylefted fontsA selection of fonts packaged by Mandrake: A selection of fonts packaged by Redhat: A selection of fonts packaged by Debian: A selection of fonts packaged by Gentoo: The Vera Bitstream fonts from GNOME: The Toga fonts derived from Vera: The Arkpandora fonts derived from Vera: The Culmus Hebrew fonts: The Paktype Paskistani fonts: The Free UCS Outline fonts: The ETL Unicode font: The Bengali/Bangla fonts: The Mayayalam font: The Stix scientific fonts: The Dustimo fonts: The GNU Unifont: The Linux Font Project: The Linux Libertine Open fonts: A set of Cyrillic fonts: A set of Tamil fonts: The JHCFonts fonts: The URW++ standard PostScript fonts: The HindiFreehand fonts: The Wadalab Kanji fonts: The Legendum and Garioger OpenType fonts: The Malayalam Keli True type and Opentype font: The Andagii TrueType font and the Penuturesu Linear B script font: The Junicode (short for Junius-Unicode) medievalist font: The Khotot Arabic fonts: The StayPuft MS Comic-like font: The Isabella calligraphic hand-style font: The ClearlyU BDF Unicode font: The Gimp fonts: A font catalog and with the corresponding source and data files: Freeware fontsThese fonts are redistributable for free but with restrictions — no source and data files available and no modification rights. The Code2001 Plane 1 Unicode font:(with experimental provisional scripts added to Plane 15) The pxl2000 font: A selection of original font designs: The free font archive: George Williams's Unicode fonts: Various other freeware fonts collections:
The font section from the Google/DMOZ directory: SIL's has now been moved to here:
Public domain fontsThese fonts are available for redistribution with less restrictions but source code and data files are most of the time unavailable. The Ghostscript fonts: The Kanji scripts fonts: The Electronic Font Open Laboratory: The Raghu OpenType fonts: Libraries and rendering toolkitsFreetype: a free and portable TrueType font rendering engine, developed to provide TrueType support for a variety of platforms and environments. FreeType is a library which can open and manage font files as well as efficiently load, hint and render individual glyphs. FreeType is not a font server or a complete text-rendering library. The Freetype alternate packages: it is worth noting that you can get alternate Freetype packages from special repositories in software patent-unencumbered countries. A good example of this are the Freetype plf (Penguin Liberation Front) packages with hinting enabled Pango: a library for layout and rendering of internationalized text as well as handling Unicode strings and complex bidirectional or context dependent shaped strings QT: The KDE toolkit
WxWindows: another cross-platform UI library XFree86: The free implementation of the X Window system Xouvert: The open development community for the new improved X Window system X11 for Mac OS X: the extensions and contributions to X11 (X Windows) by Apple Cairo: a vector graphics library with cross-device output support Freedesktop: a standards initiative for all the desktop-related technologies. The cooperation initiative between the leading desktops environments: GNOME and KDE xlibs: the X Libraries and Protocol Headers Project directfb: a library that provides hardware graphics acceleration, input device handling and abstraction, integrated windowing system with support for transparency GLTT: a library for reading and drawing TrueType fonts in OpenGL FontTools: a library for manipulating fonts STFS (Standard Type Services Framework): a portable and extensible software framework for rendering typographically sophisticated text libunicode: a set of charset converters and string manipulation functions for the Unicode standard ICU (International Components for Unicode): a portable, open-source C/C++ Unicode library with: charset-independent locales (and multiple locales simultaneously supported in the same thread), character conversions, formatting/parsing for numbers, currencies, date/time and messages, message catalogs (resources), transliteration, collation, normalization, text boundaries (grapheme, word, line-)), etc. ICU4j: a Java Unicode library with extensions to the core Java internationalization libraries: enhanced number formatting and parsing, customizable BreakIterators, additional Calendars (Japanese, Buddhist, Hebrew, Islamic), Business Holidays, Normalizer, optimized international searching, UnicodeSet and Transliteration, SCSU, etc. fribidi: an implementation of the Unicode BiDi (bidirectional) algorithm Protobidi: a prototype bidirectional widget libots: a text summary library gtkspell: a set of tools to allow highlighting and replacement of misspelled words in a GtkText widget Enchant: the new cross-platform abstract layer to spellchecking libSDL_ttf: a True-type rendering library for SDL (Simple Direct Layer) ucdata: a library to handle character properties, case, decomposing and combined classes librecode: a transliteration, non-ASCII to ASCII conversion library Computational Resources for Linguistic Research SILGraphite: the new generation smart rendering engine and CTL (Complex Text Layout) library General Input Methods and keyboarding toolsgswitchit: the XKB toolkit (configurator, indicator) for GNOME K_TTY: a Keyman style keyboard interpreter for the console and attached (or pseudo) terminals (including X terminals, remote sessions, etc.) for IPA and non-roman scripts input edk (Enhanced Dispositionned Keyboard): an enhanced input method xvkbd: a virtual (graphical) keyboard program for X tkxinput: a set of tools to bring Xinput management capabilities to the Tk toolkit SCIM (Smart Common Input Method): a development platform that significantly reduces the difficulty of input method development. SCIM splits input method into three parts: FrontEnd, which handles user interface and communication with client applications, Server, which handles the key event to string conversion work, and BackEnd, which manages all of the Servers IIIMF (Internet/Intranet Input Method Framework): a set of tools designed to be the next generation of input method framework which provides the following capabilities:
libklavier: a utility library for X keyboard-related projects. gtkeyboard: a graphical keyboard that can be useful for the physically disabled and for keyboardless workstations. It provides the ability to type into foreign windows, to redirect output, and it also provides many shortcuts for common editing tasks XKeyCaps: a graphical front-end to xmodmap. It displays a keyboard and lets you remap keys using dialog boxes, rather than having to learn the arcane syntax used by the xmodmap command. xkb: the standard X keyboard tool kxkb: the keyboard-switcher in KDE (part of the kdebase package) gkb: the map selection applet for setxkbmap or xmodmap in GNOME settxkbmap: the command-line keyboard-switcher gtk-im-extra: a collection of third party GTK+ input method modules Language specific IM (Input Methods) and keyboarding toolsChinput (Chinese input): an input method for Chinese languages kinput2: an input method for Japanese skkinput: a simple Kana-Kanji-like Japanese input application (website in Japanese) im-ja: a generic Japanese input module for GTK+ 2. Currently supported input modes are Hiragana, Katakana, Half-Width Katakana, Zenkaku, Canna, Wnn, and Kanji character recognition Canna: a CJK (Chinese/Japanese/Korean) input method (website in Japanese) Interxim: a X11 XIM server that uses the Yudit kmaps files to translate ASCII character sequences to Unicode imhangul: a Korean input method for GTK+ xcin (X window Chinese INput): a collection of Simplified Chinese input methods based on XIM (X Input Method), including WuBi, PinYin, and QuWei. It is the descendent of gWuBi fcitfx (Fun Chinese Input Toy for X): a user-friendly input method for Chinese languages FreeWnn: a free implementation of the most widely-used Japanese input systems xsim: a simple and fast GB and BIG5 Chinese XIM server ivrix: a Hebrew input method acon (Arabic CONsole): an input method for left-to-right Arabic text on the console and ligature processing zhcon: a fast CJK console system im-classicalgreek: an input method for classical Greek unicon: a CJK (Chinese/Japanese/Korean) Linux console input and display system from the GNU Project. Supports console virtual terminals, GPM mouse, keyboard, and almost all video devices using the kernel frame buffer. It supports multiple language fonts with a font manager so each virtual console can display a different font and input method. UNICON also can display IBM table characters mixed with double byte language encodings Luit: Locale and ISO 2022 support for Unicode terminals KanjiPad: a Japanese handwriting recognition system. It uses the GTK toolkit for a GUI and Todd David Rudicks's algorithms from JavaDict for Kanji recognition. Brahmi: Java Input Methods and OpenType fonts for Indian languages Keyboards (copyleft and freeware)Inscript Keymaps for Indian languages: Maltese keyboards: Keyboards included with SIL fonts: (under SIL academic licensing) Keyman keyboards:
Character mappers and glyph selectorsWord processors offering a "special character" insertion dialog: gucharmap: the full Unicode character map for GNOME kcharselect: the glyph palette for the KDE environment (part of the kdebase package) Locales and related toolsA collection of locales By Pablo Saratxaga (The Mandrake i18n specialist): A collection of locales by IBM: localedrake: the locale switcher from Mandrake (part of the drakxtools administration suite) redhat-config-language: the locale switcher from Redhat (part of the redhat-config package) Printing systems - Device independent files and PDF generationcups (Common UNIX Printing System): the cross-platform printing solution for all UNIX environments. Based on the "Internet Printing Protocol", it provides complete printing services to most PostScript and raster printers foomatic: a database-driven system for using free software printer drivers with all known free software spoolers on Unix and Mac OS X ghostscript: a set of software that provides a PostScript interpreter, a set of C procedures (the Ghostscript library, which implements the graphics capabilities in the PostScript language) and an interpreter for Portable Document Format (PDF) files. Ghostscript translates PostScript code into many common, bitmapped formats, like those understood by your printer or screen. Ghostscript is normally used to display PostScript files and to print PostScript files to non-PostScript printers a2ps: a utility providing pretty-printing capabilities and including support for a wide number of programming languages, encodings (ISO Latins, Cyrillic, etc.), and medias Tipograf: a frontend to a2ps PangoPDF: the PDF and PostScript backends for Pango (the internationalization library from GNOME) LinuxPrinting: a documentation website around printing technologies and hardware for GNU/Linux, xBSD as well as Solaris and OS X Read documentation about WSI components and frameworksi18n primersThe GNOME i18n project: The KDE i18n project: The Intro-i18n documentation from Debian: Howtos, FAQs, man pages and tutorialsThe Font Howto: The True type font howto: The UTF8 and Unicode FAQ:
The Sub-Pixel Font Positioning howto: The Font de-uglification mini-howto: The comp.fonts FAQ (unmaintained but still interesting):
The keyboard and console howto: The XKB Configuration Guide: The Linux Accessibility howto: A Unicode bibliography: A portal on Chinese computing: Examples of language howtosThe Chinese-Howto: The Spanish Howto: The Thai Howto: The Linux Documentation Project: The man (manual) and info pages: (you can use the man pinfo yelp or khelpcenter tools to browse the documentation directly on your system, and you can find further documentation in /usr/share/doc/<nameofthepackage>) Papers from the i18n specialists within the FLOSS communityOwen Taylor: Noah Levitt: Pablo Saratxaga: Keith Packard: Frank Tang: Edward Cherlin: Yannis Haralambous:
Publications about font technologies"Things we wish we knew when we started on Truetype fonts": "Fighting Font Frustration" by Steve Coile: "Using TrueType Fonts with Red Hat Linux" by Dan Kegel : The "Advanced Font System" White Paper by Redhat: Configure your Desktop Environment (GNOME or KDE on the X Window system) for specific language needsSetting up the various WSI components to work smoothly together involves:
You need to set up these different font types for desktop use:
Under GNOMEfontilus: a set of extensions for the Nautilus graphical shell to help manage fonts on your system
(the "gnome-font-viewer" and the "gnome-thumbnail-font" utilities are part of fontilus) after installing your font you can click on them for a preview to select a particular font as your desktop font, use "Set as Application Font" entry in the context menu gfontview: an outline font viewer gnome-font-properties: a font configuration dialog (part of the GNOME Control Center)
Under KDEKfontinst: font installation tool for KDE
Other font toolsxfontsel: a font selection tool from XFree86 to filter fonts by foundry, family, weight, slant, s width, ad style, pixel size, point size, resx, resy, spacing, average height, registry, encoding The OpenOffice.org administration tool: a utility to configure OpenOffice.org and to add new fonts (they will only be available to OOo)
Manual font installation at command-line levelYou can also install fonts manually:
This will only make the fonts available to that specific user (since there are copied to his private home directory). Command-line toolsfc-cache: a font cache creation utility (part of the fontconfig package) fc-list: a font-lister in command-line mode (part of the fontconfig package) mkfontdir, mkfontscale, fslsfonts, xlsfonts, showfont: a set of utilities to handle font configuration (part of the XFree86-xfs package) Font Matching and Anti-aliasing settingsIn OpenOffice.org:
In Mozilla:
Other font management toolsFontconfig: a set of tools designed to locate fonts within the system and select them according to requirements specified by applications drakfont: the Mandrake font manager - a flexible font configuration framework defoma: the Debian Font Manager - automatic font configuration framework dfontmgr: the GUI frontend for defoma oto (Open Type Organizer): a tool to show precise font info and allow table mapping fontlingue: a font management tool with a web interface Dotfiles and important configuration files and directories in your filesystemAlthough all these settings are handled by front-end administration tools, it helps to know where to look
for settings in your filesystem. Dotfiles
Directories
Some useful environment variables$QT_XFT $GDK_USE_XFT $KEYBOARD $KEYTABLE $KBCHARSET $GTK_IM_MODULE Create your own WSI components based upon the existing copyleft models and toolsFont creation, conversion and modification toolsgfe (GNU Font editor): a graphical font editor based on the GIMP Toolkit gote (GNOME Opentype editor): an editor for vector fonts Gribouy: a PostScript font editor fonter: a interactive font editor for the console PfaEdit (Postscript font editor): an outline font editor that lets you create your own PostScript, True type, OpenType, cid-keyed, SVG and bitmap (bdf) fonts, or edit existing ones. Also lets you convert one format to another. PfaEdit has support for many MacIntosh font formats Cfe (Console Font Editor): a font editor allowing various glyph transforming, multi-level undo, and comparing glyphs of two fonts. cfe automatically supports loading of .psf and raw binary fonts fontutils: a set of tools to access font internals
otcomp (OpenType COMPiler): a compiler for OpenType fonts (with makefile examples) PostScript-utilities: a set of tools to handle PostScript and Truetype fonts tftmod: a utility to edit font tables (a companion to PfaEdit) truetypeviewer: a useful tool for debugging TrueType instructions and for viewing OpenType Layout features. It supports OpenType single, pair and mark-to-base positioning as well as single, multiple, alternate and ligature substitution tticomp (True type Instruction COMPiler): a compiler for TrueType fonts otlegacy: a tool to decompose Unicode characters into their components freetype-tools and demos: a set of test tools from the Freetype rendering library A collection of related Perl modules from CPAN (the Comprehensive Perl Archive Network):
ttfquery: a Python module using FontTools to query TTF font files for metadata and font outlines in a cross-platform manner T1utils: six programs that manipulate PostScript Type 1 fonts. Allowing you to change PFB (binary) fonts into PFA (ASCII) format (and back), translate fonts into a disassembled human-readable format (and back), and translate Macintosh Type 1 fonts into PFB or PFA (and back). font-tools: a set of font conversion utilities
lcdf-type-tools: various programs for manipulating PostScript Type 1 and PostScript-flavored OpenType fonts. MyToys:
fontutils: a tool to convert between font formats and to create fonts fontconv: a simple raster font converter ttf2pt1: a tool to convert scalable and bitmapped fonts into PostScript Type 1 Fondu: a tool to convert between mac fonts (and FONDs) and Unix font formats TTX: a tool to convert OpenType and TrueType fonts to and from XML ttf2pt1: a tool to convert nearly all True Type fonts to an Adobe Type 1 .pfa file Localisation teams and translation statisticsThe Translation project: The man page and howto translation projects: The GNOME Translation project: Review the GNOME translation status: Bring your language to GNOME by setting up a language team: The KDE Internationalization project: Review the KDE translation statistics: Bring your language to KDE by setting up a language team: Mandrake Linux i10n: Fedora i10n: Localization toolsGNU gettext: a set of libraries and utilities from the GNU project for producing multi-lingual messages gtranslator: an enhanced GNU gettext catalogs (.po files) editor for the GNOME desktop environment kbabel: A set of tools for editing and managing PO files created in GNU gettext for KDE intltool (internationalization tool collection): a tool to extract translatable strings from various source files, collect the extracted strings together with messages from traditional source files (.c, .h), and merge translations from .po files into .xml, .desktop, and .oaf files poedit: a cross-platform GNU gettext catalogs (.po files) editor getxml: an XML internationalization tool, comprised of getxml-extract (makes the header for GNU gettext) and getxml-merge (merges XML and po files) phpbabel: a web application making PHP internationalization easier by allowing a net of translators to do translations on a project and to generate include message files automagically (both constants and functions...) Plan a Free/Libre and Open strategy for your WSI componentsBe aware of open standards and build interoperable and sustainable WSI componentsThe ISO 639 Language Names: The standard names for use in SGML and XML, including a complete list of language name codes IANA Character Sets: the official names for character sets that may be used in the Internet and referred to in Internet documentation - held at the Internet Assigned Number Authority. The SIL Language IDs: The W3C's (World Wide Web Consortium) specification for internationalization (XML, CSS...): The Charsets:
The Script Encoding Initiative: organising new languages additions to the the Unicode standard The Unicode Test Pages: a set of pages to test the behaviour of Unicode-capable applications online
World Wide Web Consortium: resources from the international web standards organisation covering code tables, Unicode, HTML and XML with links to other resources and forums about internationalization and localization issues relating to character sets. Resources from the Unicode Consortium: The Free Standards Group: an important standard body Li18nux: The Linux internationalization project The RFCs (Requests for Comments): the standards of the various building blocks of the Internet IDN: the Internationalized Domain Names (IDN) Committee for multilingual Internet addresses Licensing and cooperative development modelsCompare the copyleft vs. proprietary modelsIf you're serious about creating a sustainable WSI component, you need to think about the following items:
Understand and choose the appropriate licenseFSF licence review GPL FAQ: FDL howto: Beware of software patents problemsSoftware patents are an increasing threat to interoperability and independant software creation. As a WSI component developper or contributor, you must be aware of this important issue. Example of the aliasing and hinting patent from Apple and other similar problems:your project may be safe in some countries and a big hassle in some others FFI (Foundation For a Free Information Infrastructure): a non-profit association registered in Munich dedicated to the spread of data processing literacy. FFII supports the development of public information goods based on copyright, free competition about open standards. Partnerships and fundingThere are other models than commercial-only WSI components:private/public/funding has to be considered for minority-friendly low-commercial-value WSI components Look for possible partners for WSI development:
Set-up a micropayment funding model: Paypal: The street performer models: The Potlach protocol: Look into research grants and awards:
Set up your FLOSS WSI project and get the community's interest and helpUnderstand the basics of Free, Libre and Open Source software developmentThe concept of copyleft: The GNU project: The Linux kernel: A timeline of Unix and GNU/Linux: What is Free Software? What is Open Source? The GNU Coding standards: The Software Project Management Howto: The Software Release Practise Howto: The Jargon File: a comprehensive compendium of hacker slang illuminating many aspects of hackish tradition, folklore, and humor (very useful to somewhat understand the values and the mindset ot the FLOSS community) The Open Source, Free Software and copyleft classics: OpenSources: Voices from the Open Source Revolution The Cathedral and the Bazaar: Musings on Linux and Open Source by an Accidental Revolutionary Free Software, Free Society: Selected Essays of Richard M. Stallman Free as in Freedom: Richard Stallman's Crusade for Free Software Code and Other laws of Cyberspace: The Future of Ideas:
Learn and contribute to the discussion on the major i18n mailing-listsThe Mandrake-i18n mailing-list: The Redhat-i18n mailing-list: The Debian-i18n mailing-list: The GNOME-i18n mailing-list: The KDE-i18n mailing-list: The XFree86-i18n mailing-list: /' target='_blank'>Learn how to package and deploy your WSI projectsRPMS howtos: Smart package managers (with repository creation tools): Community sites that offer to host your WSI component project and set up a distributed team so that speakers and specialists can contribute: If you feel like it you can also set up your own team platform using FLOSS groupware:
Community sites and software catalogs where you can announce your project to make users and developpers aware of your project and get some potential contributors: Freshmeat: the largest FLOSS catalog Footnotes: GNOME desktop news The Dot: KDE desktop news Slashdot: the huge news portal Create and publish documentation for your WSI componentsmanedit: the ManEdit UNIX Manual Page Editor is an editor specifically tailored for UNIX manual pages. It has a preview viewer, uses the manual page XML format for easy editing, and comes with a tutorial and reference guide and templates docbook: an XML and SGML dialect that enables you to author and store document content in a presentation-neutral form that captures the logical structure of the content. Using the modular DocBook stylesheets and related resources, you can transform, format, and publish your DocBook content as HTML pages and PDF files, and in many other formats, including TeX, RTF, FrameMaker MIF, JavaHelp, Microsoft HTML Help, UNIX man pages, and TeXinfo yelp (gman): The man and info interface for GNOME khelpcenter: The man and info interface for KDE (part of the kdebase package) Create training material for your WSITypinggtypist (GNU typist): a universal typing tutor ktouch: a program for learning how to touch-type typingtrainer: a typing skills training programme. Allows teachers to examine and record student's typing speed and accuracy Language learning softwareklettres: a tool to help learning the alphabet and practise reading syllables in different languages flashkard: a vocabulary training tool. It is designed to be used in any situation where a fixed set of vocabulary (technical or linguistic) is to be learnt vocabularytrainer: a flashcard training system for learning words in foreign languages. You can use language modules to change the language of its interface and can choose one of three learning modes (Random, Often-unknown words, and Seldom-asked words) kvoctrain: a vocabulary trainer from the KDE project Minivoki: a minimal console-based vocabulary learning aid which has some interesting features. It comes with Korean, Chinese, and Japanese input parsers, 7-bit input for 8-bit characters like accents, one-direction-only word pairs, learning and test modes, and some others features zaban: a multimedia database of human languages. It provides the flash card functionality people often use for learning foreign languages. You can enter words (both text and sounds) from multiple languages and have them played back to you in multiple ways. It has support for many different alphabets, audio recording, playback, and compression, configurable auto-cycle with audio playback, a shuffle mode to rearrange the order of playback, searching with regular expressions, configurable keymaps, and support for user generated lists. The sample database contains Swahili, Farsi, Hebrew, and English hanzim ("Hanzi Master"): an interactive visual dictionary for learning and seeing relationships between Chinese radicals, characters, and compounds kanjidrill: a multiple-choice Kanji quiz program that helps people learn Japanese characters with different guess formats and history options. Dictionary with Romaji, SKIP, four-corner, cut-n-paste, radical lookup methods and English search features kanatest: a simple, GTK 2-based kana drill tool. It offers three drill modes: hiragana, katakana, and mixed mode moji: a japanese language learning dictionnary for Mozilla pauker: a java-based flashcard program Use extra WSI components to extend your language community's cyber-literacyOffice, Publishing, Typesetting and DTP (DeskTop Publishing) softwareOpenOffice.org: a Free Software and Open Source, community-developed, multi-platform office productivity suite. It includes the key desktop applications, such as a word processor, spreadsheet, presentation manager, formula editor and drawing program, with a user interface and feature set similar to other office suites. Sophisticated and flexible, OpenOffice.org also works transparently with a variety of file formats, including Microsoft Office formats GNOME Office: a meta-project, with the mission to coordinate productivity applications for the GNOME Desktop KOffice: a free, integrated office suite for KDE Abiword: a free word processing program similar to Microsoft Word. It is suitable for typing papers, letters, reports, memos, and so forth. Various plugins and templates are available Scribus: a desktop publishing program. Its goal is to provide an easy-to-use tool for simple desktop publishing Passepartout: a desktop publishing application. The goal of this project is to create a system capable of producing pre-press material of professional quality, but also to be a useful tool for any enthusiast with access to a printer. The main focus is on making it easy for the user to create publications with a flexible layout, typical examples being magazines, brochures and leaflets Graphic design softwareGIMP (GNU Image Manipulation Program): a cross-platform Photoshop equivalent for photo retouching, image composition and image authoring. Lots of plugins and scripts are available Sodipodi: a vector-based drawing program with native SVG (Scalable Vector Graphics) support Sketch: a vector drawing program for Unix GNU Paint: a small, easy-to-use paint program for GNOME Web browsing, Email, Usenet, IRC (Internet Relay Chat) and IM (Instant Messenging) softwareMozilla: the standards-compliant cross-platform Internet suite from the Mozilla Foundation. Browser, Mail&News, AddressBook, Chat, Web design, Xul application framework. Lots of plugins and extensions are available. Check out the light stand-alone versions of the browser and email clients called Firebird and Thunderbird. The Mozilla internationalization and i18n projects:
sila: a smart rendering and CTL (Complex Text Layout) enabled version of Mozilla based on the SILGraphite library Pan: a Usenet newsreader for GNOME Xchat: a multiplatform IRC (Internet Relay Chat) client Gaim: a cross-platform cross-network IM (Instant Messenging) client for GNOME Kopete: a cross-platform cross-network IM (Instant Messenging) client for KDE Jabber: a well-established and open IM client-server protocol Website creation and publication toolsbluefish: a web development studio for GNOME quanta: a web development tool for KDE screem (Site CReation and Editing EnvironMent): an integrated development environment for creating and maintaining websites Mozilla editor/composer: a HTML and plain text editor in Gecko-based applications with DOM (Document Object Model) and IME (international text input) support nvu ("new view"): the next-generation Mozilla composer CMS (Content management systems) portals: a collection of dynamic website packages based on a database and a scripting language Wiki systems: a new website publication engine for easy and open collaborative web design flp-i18n (Flaimo's Little Package): a collection of PHP classes for internationalization. It formats date and time strings, replaces keywords, converts measurement units between SI and the US Customary System, and more Amaya: a web editor to create and update documents directly on the Web with HTML XML XHTML MathML and SVG support from the W3C (World Wide Web Consortium) Text editing with Unicode supportgedit: a lightweight text editor for the GNOME desktop with full Unicode and Pango support. Various plugins are available kwrite: a text editor with QT internationalisation support for the KDE Desktop (part of the kate editor) yudit: a free (Y)unicode text editor for all unices mined98: a free cross-platform versatile unicode CJK text editor worldpad: a smart non-roman script html and xhtml editor with SILGraphite support (win32 platform) emacs: a powerful, customizable, self-documenting, modeless text editor with strong internationalisation features
vim: a highly configurable text editor for programmers Simredo: a free Java Unicode editor Bengali Writer: a set of utilities for computerized editing and typesetting in Bengali XML editorsConglomerate: an XML editor for GNOME MLView: another xml editor for GNOME kxmleditor: an XML editor for KDE XML-type: a multilingual utf-8 XML editor Document processing markup languages (What You See Is What You Mean)LaTeX: a high-quality typesetting system, with features designed for the production of technical and scientific documentation. LaTeX is the de facto standard for the communication and publication of scientific documents. Based on TeX LyX: a friendly LaTeX frontend kyle: a friendly LaTex frontend for KDE PDFTeX: an extended version of TeX that can create PDF directly from TeX source files and enhance the result of TeX typesetting with the help of PDF Omega: an extension of TeX for printing all of the world's languages and for generating XML+MathML documents Text processing and massaging toolsPerl (Practical Extraction and Report Language): The amazing do-it-all scripting language by Larry Wall GNU coreutils: text manipulation utilities from the GNU Project GNU aspell: an intelligent spellchecker from the GNU Project GNU ispell: a spellchecker from the GNU Project spellutils: a set of programs to help spellcheckers isolate the target text unac: a utility to remove accents from strings emdros: a text database engine for storage and retrieval of analyzed or annotated text compare: a text analysis and comparison tool HSpell: a Hebrew spell-checker and morphology engine bookconv: a Cyrillic converter/filter minpair: a tool to find minimal pairs in a wordlist msort: a sorting program (capable of sorting shoebox style databases) redet: GUI regular expression development and execution tool (for matching and substitution) Transcriber: a program for creating (typically orthographic) transcriptions of sound recordings, time linked (typically at the phrase level) to a digital audio file cttext: a Thai word separator program unitex: a corpus processing system Databases, thesaurus, dictionaries and library toolsOpenOffice.org dba (database access): a frontend for various data sources GNOME-DB: a free unified database access architecture for GNOME kexi: an integrated environment for managing data in KDE. It helps creating database schemas, inserting, querying and processing data The GNOME dictionary applet: a frontend to online dictionaries (part of the gnome-utils package) gidic (Gtk Inter DICtionary): a front-end for GNOME to The Internet Dictionary project from Tyler Chambers, which includes French, German, Italian, Latin, Portuguese, and Spanish dictionaries visDic: a browser and editor of dictionaries stored in XML format Gjiten: a Japanese and Kanji dictionary program for GNOME, any combination of stroke count, radicals and search key can be used for Kanji lookups. StarDict: a cross-platform and international dictionary written in Gtk2 with powerful search features such as "Glob-style pattern matching", "Scan selection word", "Fuzzy query," etc. Aiksaurus: an English-language thesaurus library oss4lib: a portal dedicated to open source software for libraries Open language data repositoriesOLAC (Open Language Archive Community): an international partnership of institutions and individuals who are creating a worldwide virtual library of language resources by: (i) developing consensus on best current practice for the digital archiving of language resources, and (ii) developing a network of interoperating repositories and services for housing and accessing such resources UNESCO - Memory of the World Programme: an initiative from the Education, Scientific and Culture United Nations organisation to preserve documentary heritage The Project Gutenberg: a big collection of multilingual public domain e-texts ABU (Association des Bibliophiles Universels): a collection of francophone public domain resources Eldrich press: a famous public domain literature archive under Creative Commons Licensing The European Language Resources Association: a non-profit organisation hosting language resources for research purposes The Project Runenberg: the public domain center for Nordic literature The Rosetta Project: a language archive seeking to become the contemporary counterpart of the historic Rosetta Stone Ibiblio: a huge conservancy of freely available information, including software, music, literature, art, history, science, politics, and cultural studies. Ibiblio.org is a collaboration of the Center for the Public Domain and The University of North Carolina - Chapel Hill (Ibiblio is formerly known as Sunsite and Metalab) Creative commons: an organisation devoted to expanding the range of creative work available for others to build upon and share: using private rights to create public goods A selection of resources compiled by the University of Columbia: A selection of resources compiled by SIL: Language data viewers and converterslv: a powerful multilingual file viewer jless (Jam less): a patched less for multi character sets using ISO 2022 code extention techniques. With support for some code conversion among Japanese encoding schemes, JIS X 0208, JIS X 0213, SJIS, and UJIS utf8conv: a tool to encode Unicode values (in hexadecimal) as UTF-8, optionally displaying the resulting character and/or the Unicode description thereof. libiconv: a Unicode converter from the GNU Project enca (Extremely Naive Charset Analyser): detects charsets and allows conversion to other encodings uniconv: a text conversion utility from the yudit project ddsk: a Kana to Kanji conversion tool kakasi (KAnji KAna Simple Inverter): a simple language processing filter to convert Kanji characters to Hiragana, Katakana or Romaji UTF8_STRING: a selection format for UTF-8 data interchange enscript: a plain ASCII to PostScript converter Accessibility toolsThe GNOME accessibility project: The KDE accessibility project: The IBM Linux accessibility team: Text to Speech toolsgnopernicus: the GNOME Text to Speech project Speaker: a text-to-speech output system for the Konqueror Web browser and the Kate text editor, both part of KDE. Speaker is a pair of plugins that allow for text and selected text to be output as speech using an external speech synthesizer program. festival: a general multi-lingual speech synthesis system developed at CSTR. It offers a full text to speech system with various APIs, as well an environment for development and research of speech synthesis techniques. Freetts: a speech synthesis system written entirely in Java. It is based upon Flite, a small runtime speech synthesis engine developed at Carnegie Mellon University. Flite is derived from the Festival Speech Synthesis System from the University of Edinburgh and the FestVox project from Carnegie Mellon University MBROLA: a set of free speech synthesizers for as many languages as possible for non-commercial and non-military applications only speechd: a speech service implementing a /dev/speech device using either the Festival or rsynth speech synthesis packages. All plaintext written to this device will be spoken aloud emacsspeak: Emacspeak turns Emacs into a complete audio desktop to provide a pleasurable eyes-free computing experience The comp.speech FAQ: Scanning and OCR (Optical Character Recognition)Kooka: a raster image scan program for KDE Clara OCR: a powerful GUI and a web interface for cooperative digitalization of books GNU ocrad: a character recognition system from the GNU project. It is implemented as a filter and based on a feature extraction method kognition: the Omnifont OCR software for KDE gocr: a character recognition system. It reads pnm, pbm, pgm, ppm, some pcx, tga, pnm.gz, pnm.bz2, png, jpg, tiff, gif, bmp and other image files and outputs a text file OCRE: a character recognition system that reads an image file and writes ASCII characters Look for further resources on related portalsAssociationsTyperight: a diverse group of type designers, developers, foundries, and aficionados who are working together to promote typefaces as creative works and to advocate their legal protection as intellectual property i18nGurus: an open directory of links to internationalization (i18n) resources and related material Software and documentation portalsThe UNESCO Free Software portal: A selection of Free/Libre and Open Source Software for Mac OS X:
O'Reilly: the respected and FLOSS-friendly technical publisher Desktop integration:The Ximian Desktop: a internationalization-aware enterprise Desktop based on GNOME technologies The Desktop Linux Consortium: a non profit organization dedicated to the advancement of the use of Linux on the desktop The Sun Desktop project: Some related entries in the Google/DMOZ directory:
Language resources:
Open hardwareThe open hardware definition: The Simputer: an open device designed to work with non-roman scripts LinuxDevices: Note The fine print: All copylefts, copyrights and trademarks mentioned here are owned by their respective owners.
Note: the opinions expressed in submitted contributions below do not necessarily reflect the opinions of our website. © 2003-2013 SIL International, all rights
reserved, unless otherwise noted elsewhere on this page. |