This site is the archived OWASP Foundation Wiki and is no longer accepting Account Requests.
To view the new OWASP Foundation website, please visit

Category:OWASP Learn About Encoding Project

Revision as of 16:37, 31 March 2009 by Federico.casani (talk | contribs)

Jump to: navigation, search

Click here to see (& edit, if wanted) the project's template.

Project Name OWASP Learn About Encoding Project
Short Project Description

This project has as its ultimate goal of demystifying the problems related to the study of character encoding (charset encoding). From charset's proper use to the issue of canonicalization, we'll try to explain and resolve the problems related to this issue so dear to professionals in the ICT world. The project consist of: a web application that explain the character life cycle and a usable textual tool and GUI tool.

Key Project Information

Project Leader
Federico Casani
Andrea Zonzin

Project Contibutors
(if any)

Mailing List
Subscribe here
Use here

Creative Commons Attribution Share Alike 3.0

Project Type

add link(s)

Release Status Main Links Related Projects

Apha Quality
Please see here for complete information.

Blog if any, add link(s)


Starting with projects such as overtime

The "OWASP Learn About Encoding Project" has not discovered anything new, but rather wants to emphasize the importance of input sanitize and output escaping. In the network there are often errors in the visualization of pages: you see question marks (?) where it should be accented letters, there are strange characters (i.e. A+tilde, A+umlauts) where this should be the "euro" character, and so way. Not only that: but there are communication channels that allow the exchange of characters not properly controlled: i.e. sms messages, chat messages, voip client, ecc.. often contain values are not consistent.

The use of proper Charset is essential for

  • integrity of the data
  • the prevention of the problem of Canonicalization


This is a project that aims to educate developers, systems analysts or anyone who writes code regarding the knowledge of proper use of Charset and Canonicalization. The project will seek to give a comprehensive response by crossing one another most scenarios highlighting the roles of key players (browser, operating system, database, etc. ..).

To achieve this goal we decided to create a tool in two different formats:

  • web application
  • shell tool


Detailed roadmap for future developments:

01/03/09 : Startup

01/03/09 - 15/03/09 : Project Goal Definition

16/03/09 - 31/03/09 : Project Architecture Definition

01/04/09 - 31/06/09 : Code Development

01/07/09 : Alpha release

05/07/09 - 30/07/09 : Bug Fixing

01/08/09 - 30/10/09 : Project Development - enhancement, new feature

01/11/09 : Beta release

02/11/09 - 30/11/09 : Bug Fixing


This category currently contains no pages or media.