- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - Linux
- >
- Re: counting the number of times a word appears in...
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 03:51 AM
09-26-2005 03:51 AM
Solved! Go to Solution.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 03:54 AM
09-26-2005 03:54 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
ought to do it.
Pete
Pete
- Tags:
- wc
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 03:57 AM
09-26-2005 03:57 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 04:04 AM
09-26-2005 04:04 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
Thanks,
BL.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 04:05 AM
09-26-2005 04:05 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
ARGHH! You're right of course. I remember a similar question a month or so ago but I can't find it at the moment and don't remember what the answer was.
Pete
Pete
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 04:11 AM
09-26-2005 04:11 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
You can still use grep & wc -w
Save the grepped lines in a tmp file
Then just read through that file in a loop and increment a counter from the wc -w output.
Should work.
Rgds,
Jeff
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 04:19 AM
09-26-2005 04:19 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
Jeff I do not understand your response, even if I throw all the lines with either smtp | SMTP into an output file I will still have all the other words (mail stuff that would come up in the wc command.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 04:19 AM
09-26-2005 04:19 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
cat file | awk '{
for ( i=1; i<=NF; i++)
num[$i]++;
}
END {
for ( word in num )
print word, num[word];
}' | grep -i smtp
- Tags:
- awk
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 04:24 AM
09-26-2005 04:24 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
cat file | tr "[:upper:]" "[:lower:]" |
tr -cs "[a-z0-9']" "\012" | sort |
uniq -c | sort +0nr +1d
convert all uppercase to lowercase
replace all characters not a-z0-9' with a new line. that means one word per line
sort because uniq expects sorted input
uniq counts the number of times each word appears then sort first from most to least frequent then alphabetically
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 04:27 AM
09-26-2005 04:27 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
exactly as typed, including the '\' !!
# sed -e 's/smtp/smtp\
/g' your_file | grep -i "smtp" | wc -l
maybe use tr in there to convert SMTP to smtp?
changes "smtp" into "smtp\n" so each smtp is on a separate line, then uses grep/wc to count them. Just a thought.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 04:42 AM
09-26-2005 04:42 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 04:43 AM
09-26-2005 04:43 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
Test file with 3 lines
this is a test of the smtp; but the SMTP would %smtp smtp SMTP& work
helo smtp
cat /tmp/test | tr "[:upper:]" "[:lower:]" | tr -cs "[a-z0-9]" "\012" |sort | uniq -c | sort +0nr +1d
+ cat /tmp/test
+ tr [:upper:] [:lower:]
+ tr -cs [a-z0-9] \012
+ sort
+ sort +0nr +1d
+ uniq -c
1 helo smtp
1 this is a test of the smtp; but the smtp
1 would %smtp smtp smtp& work
With your awk scrip i get the following error good ole bail out
Tahoe: /tmp ./t2.sh
#!/bin/ksh -xv
cat /tmp/test |awk '{
for (i=1, i<=NF;i++)
num[$i]++;
}
END{
for ( word in num )
print word, num[word];
}'|grep -i smtp
+ cat /tmp/test
+ awk {
for (i=1, i<=NF;i++)
num[$i]++;
}
END{
for ( word in num )
print word, num[word];
}
+ grep -i smtp
syntax error The source line is 2.
The error context is
for >>> (i=1, <<<
awk: The statement cannot be correctly parsed.
The source line is 2.
syntax error The source line is 2.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 04:46 AM
- Tags:
- Perl
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 04:53 AM
09-26-2005 04:53 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 04:54 AM
09-26-2005 04:54 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
should be
| tr -cs "[a-z0-9]" "\012*"
star after \012
and
The error context is
for >>> (i=1, <<<
awk: The statement cannot be correctly parsed.
i=1, < no comma, it is a semi colon ";"
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 04:58 AM
09-26-2005 04:58 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 05:11 AM
09-26-2005 05:11 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
To improve thins a bit, I'd amend my expression thusly:
# perl -lne '$count++ while (m/\bsmtp\b/ig);END{print $count}' logfile
This eliminates counting the string "smtp" (in any case or mixture) *within* the bounds of another string. That is, this ould *not* count "this_was_from_smtp".
Regards!
...JRF...
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 07:43 PM
09-26-2005 07:43 PM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
Within awk set your field separator to the word smtp or SMTP and then count its occurence within the input file as:
# awk -F"smtp|SMTP" 'BEGIN{cnt=0} {cnt+=(NF-1)} END{print cnt}' input_file
cheers!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-26-2005 10:21 PM
09-26-2005 10:21 PM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
# awk -F"[sS][mM][tT][pP]" 'BEGIN{cnt=0} {cnt+=(NF-1)} END{print cnt}' input_file
cheers!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-27-2005 01:54 AM
09-27-2005 01:54 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
09-27-2005 06:40 AM
09-27-2005 06:40 AM
			
				
					
						
							Re: counting the number of times a word appears in a file
						
					
					
				
			
		
	
			
	
	
	
	
	
If "cnt" is declared in the BEGIN section of the awk construct then I don't know why it needs to be set equal to 1 and if it's defined in the ACTION section, then every new line read would clobber its contents and reset it back to 1. Could you share the awk construct you used to get the correct result?
Maybe I'm not able to understand your requirements.
thx
