The Curly Brace: Querying Extended ASCII Characters in SQL Server

22 June 2010

Querying Extended ASCII Characters in SQL Server

Part of a project requires conversion of ADABAS to SQL Server. ADABAS hearkens from the day when storage space was a precious commodity, so the "Packed" data types were invented. These compress the values stored in the column, to maximize storage utilization.

When converting packed ADABAS File fields to SQL Server relational table columns, some of the packed data was not unpacked(?) correctly, resulting in some interesting characters appearing in SQL Server. The entire data content of a field needs not be packed; ADABAS allows you to leave the first N characters unpacked, and then pack the remainder of the field, and other such options.

It was my job to find all records across the entire database (we're talking millions of records per table) that contain ASCII characters that do not appear on a standard, 108 key, US English, QWERTY keyboard. Constructing a query that iterates through all tables and columns that are varchar data type is easy. However, the SQL Management Studio query editors don't display extended ASCII characters.

The solution was pretty simple. Cast a byte value to a character type, to specify the extended character ranges.

SELECT RecordID
FROM MyTable
WHERE ((patindex('%[' + char(0) + '-' + char(31) + ']%', ColumnName COLLATE Latin1_General_BIN2) &lt;> 0)
      OR (patindex('%[' + char(127) + '-' + char(255) + ']%', ColumnName COLLATE Latin1_General_BIN2) &lt;> 0))

This selects records from the table where the number of extended ASCII (key codes 0-31, and 127-255) characters in a specific column is not 0.

13 comments:

Anonymous26 July, 2012 13:17
Works great and processes very quickly too
ReplyDelete
Replies
Omega Degalo Handra14 July, 2014 11:40
Fantastic. Just what I needed.
ReplyDelete
Replies
Unknown04 September, 2014 07:53
could you please help me, as i am trying to achieve the same using oracle SQL
ReplyDelete
Replies
Unknown04 September, 2014 08:21
Also once we find out the rows satisfying the condition, i need to replace it with # character
ReplyDelete
Replies
Kiran D S05 September, 2014 02:59
Hi all,

i'm new to SQL and working on SQL Server.

i'm struck in getting the logic for selecting records from the table where there are extended ASCII (key codes 0-31, and 127-255) characters in a specific column and replacing all the extended ASCII (key codes 0-31, and 127-255) characters with '#' character.

from above code, i am getting the rows which have extended ASCII (key codes 0-31, and 127-255) characters in a specific column.
i need an another column, to replace the same.

Could you please help in the task.

Thanks in advance.
ReplyDelete
Replies
Anonymous03 August, 2015 11:02
This is exactly what I needed. Thank You VERY MUCH!
ReplyDelete
Replies

Add comment

Please provide details, when posting technical comments. If you find an error in sample code or have found bad information/misinformation in a post, please e-mail me details, so I can make corrections as quickly as possible.

	Subscribe to ATOM
	Subscribe to RSS
	Follow me on Twitter
	Follow me on Linked In

Pages

22 June 2010

Querying Extended ASCII Characters in SQL Server

13 comments: