<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Remove Special Characters in Operating System - HP-UX</title>
    <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843805#M777076</link>
    <description>Generally a rigorous definition of the problem will almost by default lead to a solution. For example, when you state ASCII you seem to be implying printable ASCII but you should note that the ASCII character set includes many non-printable characters such as BEL, ESC, FF, LF, CR, HT, VT -- in addition to punctuation characters. You should really state what you want (or don't want) and then the solution becomes obvious.&lt;BR /&gt;If the answer to the last question is "I don't know" then tyou have more work to do. This lack of precision in your dataset definition may have well contributed to this situation with your input file -- or you may have completely defined the dataset and your instructions were ignored.</description>
    <pubDate>Tue, 15 Aug 2006 16:06:00 GMT</pubDate>
    <dc:creator>A. Clay Stephenson</dc:creator>
    <dc:date>2006-08-15T16:06:00Z</dc:date>
    <item>
      <title>Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843797#M777068</link>
      <description>Hi,&lt;BR /&gt;&lt;BR /&gt;We receive data file from various sources , which we load in our Datamart using shell scripts  / SQL Loader. We requested source system to send data files WITHOUT any special characters (! @ # ~ etc..)  in data. But in some instances we do receive with spl.characters. &lt;BR /&gt;&lt;BR /&gt;In SQL Loader we use ~ as delimiter and this time data came in with ~ in data and messed up the load.&lt;BR /&gt;&lt;BR /&gt;How do i check for special characters and remove them from data file before the data load ?  Data files will be huge in size (700 - 800 MB) , someitmes touches iGB. &lt;BR /&gt;&lt;BR /&gt;1) What is the best method to solve this problem.&lt;BR /&gt;2) how to identify all set of spl characters and removes them ? &lt;BR /&gt;&lt;BR /&gt;Please help. &lt;BR /&gt;&lt;BR /&gt;Note: Shell script OR perl script solution preferred.&lt;BR /&gt;&lt;BR /&gt;Thanks</description>
      <pubDate>Tue, 15 Aug 2006 13:18:47 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843797#M777068</guid>
      <dc:creator>uform</dc:creator>
      <dc:date>2006-08-15T13:18:47Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843798#M777069</link>
      <description>I would suggest using perl to read the file and write to a new one.  Perl could be used to remove the ~ from each line as it is writing it to the new file.  You can then rename the new file, if needed.  I think this will do the trick for you:&lt;BR /&gt;&lt;BR /&gt;#!/usr/bin/perl&lt;BR /&gt;&lt;BR /&gt;open(TEXTDUMP,"myfile.txt");&lt;BR /&gt;open(SANITIZED,"&amp;gt;mynewfile.txt");&lt;BR /&gt;&lt;BR /&gt;while( &lt;TEXTDUMP&gt; ) {&lt;BR /&gt; &lt;BR /&gt; chomp($_);&lt;BR /&gt; $line = $_ &lt;BR /&gt;&lt;BR /&gt; $line =~ s/~//g;&lt;BR /&gt; print SANITIZED "$line\n";&lt;BR /&gt;}&lt;BR /&gt;&lt;BR /&gt;close(SANITIZED);&lt;BR /&gt;close(TEXTDUMP);&lt;/TEXTDUMP&gt;</description>
      <pubDate>Tue, 15 Aug 2006 13:25:42 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843798#M777069</guid>
      <dc:creator>Sean Dale</dc:creator>
      <dc:date>2006-08-15T13:25:42Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843799#M777070</link>
      <description>I would use tr to strip the characters. In this example I am allowing upper and lowercase alpha + numerics + ASCII Linefeeds (Octal 012) and nothing else.&lt;BR /&gt;&lt;BR /&gt;tr -cd "[A-Za-z0-9\012]" &amp;lt; infile &amp;gt; outfile&lt;BR /&gt;&lt;BR /&gt;Note that the 'c' tr option complements so the net effect when combined with the 'd' (delete) option is to delete any characters not explicitly listed in the set.</description>
      <pubDate>Tue, 15 Aug 2006 13:29:35 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843799#M777070</guid>
      <dc:creator>A. Clay Stephenson</dc:creator>
      <dc:date>2006-08-15T13:29:35Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843800#M777071</link>
      <description>Shalom,&lt;BR /&gt;&lt;BR /&gt;What special characters to strip depends on what special characters cause problems.&lt;BR /&gt;&lt;BR /&gt;The best method to solve the problem is the one you tried, have correct data submitted or don't process it.&lt;BR /&gt;&lt;BR /&gt;If you think about it any website asking you to set a password can prevent input of characters that cause problems.&lt;BR /&gt;&lt;BR /&gt;That being said, your best bet on such large files is a perl or other shell script.&lt;BR /&gt;&lt;BR /&gt;You have to read the data form one file and write it to another.&lt;BR /&gt;&lt;BR /&gt;You might also want to see if there is an upgrade for your sql loader.&lt;BR /&gt;&lt;BR /&gt;Good Luck,&lt;BR /&gt;&lt;BR /&gt;SEP</description>
      <pubDate>Tue, 15 Aug 2006 13:32:52 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843800#M777071</guid>
      <dc:creator>Steven E. Protter</dc:creator>
      <dc:date>2006-08-15T13:32:52Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843801#M777072</link>
      <description>If you truly want to delete the special characters then Clay's solution is the best one. However, if these special characters define fields, you may want to change tr to substitute a space or other character for the special char. tr can delete or substitute.</description>
      <pubDate>Tue, 15 Aug 2006 15:11:19 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843801#M777072</guid>
      <dc:creator>Bill Hassell</dc:creator>
      <dc:date>2006-08-15T15:11:19Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843802#M777073</link>
      <description>ok , using tr command.. how to &lt;BR /&gt;just AVOID all ascii characters&lt;BR /&gt;instead on mentioning allowable character set.  because then i may need to include &lt;BR /&gt;. , - etc..&lt;BR /&gt;&lt;BR /&gt;i also want to know how can i get list of &lt;BR /&gt;all spl characters list.</description>
      <pubDate>Tue, 15 Aug 2006 15:31:22 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843802#M777073</guid>
      <dc:creator>uform</dc:creator>
      <dc:date>2006-08-15T15:31:22Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843803#M777074</link>
      <description>If I understand your last question then remove the complement option and you will see what you are filtering out:&lt;BR /&gt;&lt;BR /&gt;tr -d "[A-Za-z0-9\012]" &amp;lt; infile &amp;gt; outfile&lt;BR /&gt;&lt;BR /&gt;For readability, you may want to not delete the linefeeds so:&lt;BR /&gt;&lt;BR /&gt;tr -d "[A-Za-z0-9]" &amp;lt; infile &amp;gt; outfile&lt;BR /&gt;&lt;BR /&gt;You can use the output of this command to better tune the first version I listed.&lt;BR /&gt;</description>
      <pubDate>Tue, 15 Aug 2006 15:44:23 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843803#M777074</guid>
      <dc:creator>A. Clay Stephenson</dc:creator>
      <dc:date>2006-08-15T15:44:23Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843804#M777075</link>
      <description>How about pre-processing the input data loads before using SQL*Loader to do the bulk loads. Since you already know the field delimiter, search for its occurence within all the data files and remove/replace it with something other than a tilde ~. Afterwards you can insert the tilde as a field delimiter in order to provide SQL*Loader a method to differentiate the columns.&lt;BR /&gt;&lt;BR /&gt;I'm assuming that the data files come as is and you do some processing on your side in order to separate the fields within; which could be a totally wrong assumption.</description>
      <pubDate>Tue, 15 Aug 2006 15:57:25 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843804#M777075</guid>
      <dc:creator>Sandman!</dc:creator>
      <dc:date>2006-08-15T15:57:25Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843805#M777076</link>
      <description>Generally a rigorous definition of the problem will almost by default lead to a solution. For example, when you state ASCII you seem to be implying printable ASCII but you should note that the ASCII character set includes many non-printable characters such as BEL, ESC, FF, LF, CR, HT, VT -- in addition to punctuation characters. You should really state what you want (or don't want) and then the solution becomes obvious.&lt;BR /&gt;If the answer to the last question is "I don't know" then tyou have more work to do. This lack of precision in your dataset definition may have well contributed to this situation with your input file -- or you may have completely defined the dataset and your instructions were ignored.</description>
      <pubDate>Tue, 15 Aug 2006 16:06:00 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843805#M777076</guid>
      <dc:creator>A. Clay Stephenson</dc:creator>
      <dc:date>2006-08-15T16:06:00Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843806#M777077</link>
      <description>Thanks .  &lt;BR /&gt;&lt;BR /&gt;i could find a code which was in use years back in our project. later on for some reason they stopped using it.&lt;BR /&gt;&lt;BR /&gt;I'm not a perl programmer, I have no clue  what this does, but from the document i guess this does the same what we are discussing here. Also i see a note in the document saying ~ symbol is not handled using this code. How and where to include ~ symbol in this code , so that it get deleted&lt;BR /&gt;while running this script. &lt;BR /&gt;&lt;BR /&gt;Haa i see tr (atlast familar one :) ) &lt;BR /&gt;&lt;BR /&gt;more FileScan.pl&lt;BR /&gt;#!/usr/bin/perl -w&lt;BR /&gt;&lt;BR /&gt;#check the arguments and report error if invalid.&lt;BR /&gt;$numArgs = $#ARGV + 1;&lt;BR /&gt;$numArgs == 2 or  die("error usage: nr &lt;INPUT_FILE&gt; &lt;OUTPUT_FILE&gt;");&lt;BR /&gt;&lt;BR /&gt;#Open input file for reading&lt;BR /&gt;open INPUT_FILE, $ARGV[0] or die("Couldn't open input file: $ARGV[1] ");&lt;BR /&gt;&lt;BR /&gt;#Open Output file for writing&lt;BR /&gt;open OUTPUT_FILE, "&amp;gt;$ARGV[1]" or die("Couldn't open output file: $ARGV[2] ");&lt;BR /&gt;&lt;BR /&gt;# read one line at a time and write it back to the output file&lt;BR /&gt;# after replacing the non printable characters with spaces.&lt;BR /&gt;# Note: the contents of the line are held in $_ variable&lt;BR /&gt;while (&lt;INPUT_FILE&gt;) {&lt;BR /&gt;&lt;BR /&gt;tr [\176-\377] [\040-\040];&lt;BR /&gt;tr [\016-\037] [\040-\040];&lt;BR /&gt;tr [\001-\011] [\040-\040];&lt;BR /&gt;tr [\013-\014] [\040-\040];&lt;BR /&gt;print OUTPUT_FILE;&lt;BR /&gt;}&lt;/INPUT_FILE&gt;&lt;/OUTPUT_FILE&gt;&lt;/INPUT_FILE&gt;</description>
      <pubDate>Tue, 15 Aug 2006 16:28:07 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843806#M777077</guid>
      <dc:creator>uform</dc:creator>
      <dc:date>2006-08-15T16:28:07Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843807#M777078</link>
      <description>&amp;gt;Also i see a note in the document saying ~ symbol is not handled using this &amp;gt;code. How and where to include ~ symbol in this code , so that it get deleted &amp;gt;while running this script. &lt;BR /&gt;&lt;BR /&gt;The code you pasted does handle tilde (~) characters in the line: &lt;BR /&gt;&lt;BR /&gt;tr [\176-\377] [\040-\040];&lt;BR /&gt;&lt;BR /&gt;Note -&amp;gt; octal number \176 == ~&lt;BR /&gt;</description>
      <pubDate>Tue, 15 Aug 2006 16:46:51 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843807#M777078</guid>
      <dc:creator>Sandman!</dc:creator>
      <dc:date>2006-08-15T16:46:51Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843808#M777079</link>
      <description>The first place to start defining the desired characters (or undesired) is: man ascii&lt;BR /&gt; &lt;BR /&gt;That man page is really useful with both octal and hex versions of all the possible characters in ASCII (note that all 256 bit patterns are defined in ASCII). So in your code snipett:&lt;BR /&gt; &lt;BR /&gt;tr [\176-\377] [\040-\040];&lt;BR /&gt;tr [\016-\037] [\040-\040];&lt;BR /&gt;tr [\001-\011] [\040-\040];&lt;BR /&gt;tr [\013-\014] [\040-\040];&lt;BR /&gt; &lt;BR /&gt;176 through 377 will change ~ and any character with the 8th bit set to a space (040). The 016-137 takes care of codes from SO (shift out) to US (unit separator). 001-011 takes care of SOH (start of header) to HT (horizontal tab) and 013-014 are VT (vertical tab) and NP (new page, aka, formfeed). &lt;BR /&gt; &lt;BR /&gt;So your code does indeed replace (not delete) the unwanted characters with spaces -- a very important consideration.</description>
      <pubDate>Tue, 15 Aug 2006 16:52:52 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843808#M777079</guid>
      <dc:creator>Bill Hassell</dc:creator>
      <dc:date>2006-08-15T16:52:52Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843809#M777080</link>
      <description>And since \176 is a "~", it gets translated to a space as well.</description>
      <pubDate>Tue, 15 Aug 2006 17:33:05 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843809#M777080</guid>
      <dc:creator>A. Clay Stephenson</dc:creator>
      <dc:date>2006-08-15T17:33:05Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843810#M777081</link>
      <description>&lt;BR /&gt;tr [:punct:] &amp;lt; filename &amp;gt; converted&lt;BR /&gt;&lt;BR /&gt;will remove the characters listed in your example.&lt;BR /&gt;&lt;BR /&gt;tr [:punct:] "[ *]" &amp;lt; filename &amp;gt; converted&lt;BR /&gt;&lt;BR /&gt;will substitute the characters with space keeping a fixed field format.&lt;BR /&gt;</description>
      <pubDate>Tue, 15 Aug 2006 20:29:12 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843810#M777081</guid>
      <dc:creator>Rory R Hammond</dc:creator>
      <dc:date>2006-08-15T20:29:12Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843811#M777082</link>
      <description>Thanks everyone. OK I'm getting some idea about this perl script now. one more question though, &lt;BR /&gt;&lt;BR /&gt;man ascii  shows 3 sets of outputs&lt;BR /&gt;1) Octal - Character&lt;BR /&gt;2) Hexadecimal - Character&lt;BR /&gt;3) Decimal - Character&lt;BR /&gt;&lt;BR /&gt;i understand&lt;BR /&gt;176(oct)=126(Hex)=7E(Dec) all implies ' ~ '.&lt;BR /&gt;&lt;BR /&gt;But where in script its mentioned its octal comparison ?  And man ascii doesnt show &lt;BR /&gt;oct numbers &amp;gt; 177 , but in script i have till 377 , so what are the remaining characters ? &lt;BR /&gt;&lt;BR /&gt;Sorry if these are too basic questions abt perl scripting ...&lt;BR /&gt;&lt;BR /&gt;Thanks for help.&lt;BR /&gt;</description>
      <pubDate>Wed, 16 Aug 2006 09:35:30 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843811#M777082</guid>
      <dc:creator>uform</dc:creator>
      <dc:date>2006-08-16T09:35:30Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843812#M777083</link>
      <description>You are the one that mentioned you found a comment indicating that "~" was not handled but your Perl code definitely translates "~" \176 to "space" \040.&lt;BR /&gt;&lt;BR /&gt;The characters above 177 octal (127 decimal) have the 8th bit set and are undefined by ASCII. These characters are often used to display graphics (corners, lines, arrows, smiley faces, etc.) but the actual character that is displayed is entirely dependent upon the display device (terminal or printer) and the current character set. In any event, your program simply translates any character with the 8th bit set to 1 to a space.&lt;BR /&gt;</description>
      <pubDate>Wed, 16 Aug 2006 09:44:56 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843812#M777083</guid>
      <dc:creator>A. Clay Stephenson</dc:creator>
      <dc:date>2006-08-16T09:44:56Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843813#M777084</link>
      <description>Yes , i was the one mentioned ~ was not handled in this code (as per the document). But you people cleared &lt;BR /&gt;that out saying its handled.&lt;BR /&gt;&lt;BR /&gt;My question was , where in perl script its mentioned its a octal comparision ? meaning  if i just say    &lt;BR /&gt;&lt;BR /&gt;tr [\40] [\33];&lt;BR /&gt;&lt;BR /&gt;how can one find what comparison is this , as 40  and 33 are avbl both in  Hex and Decimal. &lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 16 Aug 2006 10:21:07 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843813#M777084</guid>
      <dc:creator>uform</dc:creator>
      <dc:date>2006-08-16T10:21:07Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843814#M777085</link>
      <description>&amp;gt; tr [\40] [\33];&lt;BR /&gt;&lt;BR /&gt;&amp;gt; how can one find what comparison is this , as 40 and 33 are avbl both in Hex and Decimal.&lt;BR /&gt; &lt;BR /&gt;This is not something unique to Perl but the tr command's of variables. If a number starts with 0 (zero) then the number is interpreted as octal and only the numbers 01234567 are valid. So you use the octal table. The convention of using \040 to represent octal 40 carries over from shell handling of numeric values. The man page for tr says:&lt;BR /&gt; &lt;BR /&gt;" The escape character \ can be used as in the shell to remove special&lt;BR /&gt;      meaning from any character in a string.  In addition, \ followed by 1,&lt;BR /&gt;      2, or 3 octal digits represents the character whose ASCII code is&lt;BR /&gt;      given by those digits."&lt;BR /&gt; &lt;BR /&gt;The square brackets tell tr to process the enclosed characters as a 'class' where the class can be one or more characters or a range of characters (like [a-z] or [0-9] or a class name such as [:alpha:] or [:upper:]. The man page helps a lot with the tr command (which is the same whether run from the command line or inside a Perl script).</description>
      <pubDate>Wed, 16 Aug 2006 12:04:12 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843814#M777085</guid>
      <dc:creator>Bill Hassell</dc:creator>
      <dc:date>2006-08-16T12:04:12Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843815#M777086</link>
      <description>Thanks everyone . Appreciate all your help.&lt;BR /&gt;one last question , i heard there is a size restriction using pearl script for this purposes.&lt;BR /&gt;&lt;BR /&gt;MEaning our input feed file could be as big as 1 GB , can this pearl script take this &lt;BR /&gt;file as input and give me an output file without special characters ? or shud i complicate it by split the input file and reconnect them later ? &lt;BR /&gt;&lt;BR /&gt;Thanks&lt;BR /&gt;</description>
      <pubDate>Fri, 18 Aug 2006 13:52:23 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843815#M777086</guid>
      <dc:creator>uform</dc:creator>
      <dc:date>2006-08-18T13:52:23Z</dc:date>
    </item>
    <item>
      <title>Re: Remove Special Characters</title>
      <link>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843816#M777087</link>
      <description>No, Perl will handle it in this case because you are not "slurping" the entire file but rather processing it one line at the time. "Slurping" (and that's a real technical term just like "magic cookies") means that you are reading the entire input file into an array and then processing it.</description>
      <pubDate>Fri, 18 Aug 2006 14:20:48 GMT</pubDate>
      <guid>https://community.hpe.com/t5/operating-system-hp-ux/remove-special-characters/m-p/3843816#M777087</guid>
      <dc:creator>A. Clay Stephenson</dc:creator>
      <dc:date>2006-08-18T14:20:48Z</dc:date>
    </item>
  </channel>
</rss>

