A script

ivy1234 · ‎03-21-2011

I have a file , the file as below ,

aa,bb,cc
aa,cc,dd
cc,dd,ee
ff,zz
dd,aa

the file have many lines, and the content may be duplicated and separated by "," sign . Now if I want to erase some contents in the file

1) if the content is duplicate , then output 1 time
2) the result should be in 1 line.

so my desired output is

aa
bb
cc
dd
ee
ff
zz

can advise what can i do ? thx

Raj D. · ‎03-21-2011

ivy,

check this out:

$ cat file | tr "," "\n" | uniq -u

Hth,
Raj.

" If u think u can , If u think u cannot , - You are always Right . "

ivy1234 · ‎03-21-2011

thx ,

but it do not handle duplicate case , that mean the output is deplicate ,

can advise if I want if the data is duplicated then do not output the same data , what can i do ? thx

ivy1234 · ‎03-21-2011

thx

the |uniq -u seems not work in this case.

Raj D. · ‎03-22-2011

ivy,
You can use uniq -c and cut the numeric first field,

$ cat file | tr "," "\n" | uniq -c | cut -c 1-2

I cant check it now, as don't hv system now.
Hth,
Raj.

" If u think u can , If u think u cannot , - You are always Right . "

Steven Schweda · ‎03-22-2011

man uniq
man sort

alp$ < 1474162.txt tr ',' '\n' | sort | uniq
aa
bb
cc
dd
ee
ff
zz
alp$

Try it first without the "| uniq".

Hein van den Heuvel · ‎03-22-2011

For 'uniq' to work the stream has to be sorted first

Man...
"DESCRIPTION
Discard all but one of successive identical lines from INPUT"

Here the solution with "tr | sort | uniq" probably works just fine.

For modest dataset you may also want to play with PERL to allow for more tricky splitting, parsing, counting and printing.

In this simple example we can set up an array value for each word found and at the end ( eskimo kiss: }{ :-) print all the keys thus established

$ perl -lne '$x{$_}=1 for split /,/ } { print for (sort keys %x) ' x.txt
aa
bb
cc
dd
ee
ff
zz

fwiw,
Hein

Raj D. · ‎03-22-2011

Ivy,
Here you go with awk,
# cat file | tr "," "\n" | awk '!x[$0]++'

aa
bb
cc
dd
ee
ff
zz
#

Enjoy, Have fun! Remember to assign points to all posts,
Raj.

" If u think u can , If u think u cannot , - You are always Right . "

Mel Burslan · ‎03-22-2011

Raj is still missing the point. uniq only senses consecutive lines which were duplicates

aa
bb
aa
bb
bb

ran thru uniq, will generate:

aa
bb
aa
bb

not

aa
bb

the one liner should be something like this:

cat file | tr "," "\n" | sort | uniq

Hope this helps

________________________________
UNIX because I majored in cryptology...

Steven Schweda · ‎03-22-2011

> cat file | tr "," "\n" | sort | uniq

Geez. Why didn't _I_ think of that. No,
wait...

And my version lacked the (much hated) "cat".
And, when picoseconds count, I figure that
'x' should be faster than "x" -- no looking
for dollar signs in 'x'.

Dennis Handly · ‎03-22-2011

>Steven: ... | sort | uniq

You can optimize this by using "sort -u".

Raj D. · ‎03-22-2011

Mel, thanks good thing leaned abt uniq,
, uniq only senses consecutive lines which were duplicates. sometime I used to wonder why uniq not working properly, now it make sense, t u.

" If u think u can , If u think u cannot , - You are always Right . "

Arturo Galbiati · ‎03-23-2011

cat file | tr ',' '\n'|sort -u
HTH,
Art

Raj D. · ‎03-23-2011

# cat file|tr "," "\n" | awk '!x[$0]++' #Enjoy!.

" If u think u can , If u think u cannot , - You are always Right . "

Categories

Company

Local Language

Forums

Discussions

Forums

Discussions

Forums

Discussions

Forums

Discussions

Forums

Discussions

Discussions

Forums

Forums

Discussions

Forums

Discussions

Forums

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Discussion Boards

Community

Resources

Other HPE Sites

Discussions

Forums

Blogs

A script

A script

Re: A script

Re: A script

Re: A script

Re: A script

Re: A script

Re: A script

Re: A script

Re: A script

Re: A script

Re: A script

Re: A script

Re: A script

Re: A script