English 中文(简体)
grep a tab in UNIX
原标题:
  • 时间:2009-12-01 11:23:13
  •  标签:
  • unix
  • grep

How do I grep tab ( ) in files on the Unix platform?

问题回答

If using GNU grep, you can use the Perl-style regexp:

grep -P  	  *

The trick is to use $ sign before single quotes. It also works for cut and other tools.

grep $ 	  sample.txt

I never managed to make the metacharacter work with grep. However I found two alternate solutions:

  1. Using <Ctrl-V> <TAB> (hitting Ctrl-V then typing tab)
  2. Using awk: foo | awk / /

From this answer on Ask Ubuntu:

Tell grep to use the regular expressions as defined by Perl (Perl has as tab):

grep -P "	" <file name>

Use the literal tab character:

grep "^V<tab>" <filename>

Use printf to print a tab character for you:

grep "$(printf  	 )" <filename>

One way is (this is with Bash)

grep -P  	 

-P turns on Perl regular expressions so will work.

As user unwind says, it may be specific to GNU grep. The alternative is to literally insert a tab in there if the shell, editor or terminal will allow it.

Another way of inserting the tab literally inside the expression is using the lesser-known $ quotation in Bash:

grep $ foo	bar         # matches eg.  foo<tab>bar 

(Note that if you re matching for fixed strings you can use this with -F mode.)

Sometimes using variables can make the notation a bit more readable and manageable:

tab=$ 	                # `tab=$(printf  	 )` in POSIX
id= [[:digit:]]+ 
name= [[:alpha:]_][[:alnum:]_-]* 
grep "$name$tab$id"     # matches eg. `bob2<tab>323`

There are basically two ways to address it:

  1. (Recommended) Use regular expression syntax supported by grep(1). Modern grep(1) supports two forms of POSIX 1003.2 regex syntax: basic (obsolete) REs, and modern REs. Syntax is described in details on re_format(7) and regex(7) man pages which are part of BSD and Linux systems respectively. The GNU grep(1) also supports Perl-compatible REs as provided by the pcre(3) library.

    In regex language the tab symbol is usually encoded by atom. The atom is supported by BSD extended regular expressions (egrep, grep -E on BSD compatible system), as well as Perl-compatible REs (pcregrep, GNU grep -P).

    Both basic regular expressions and Linux extended REs apparently have no support for the . Please consult UNIX utility man page to know which regex language it supports (hence the difference between sed(1), awk(1), and pcregrep(1) regular expressions).

    Therefore, on Linux:

    $ grep -P  	  FILE ...
    

    On BSD alike system:

    $ egrep  	  FILE ...
    $ grep -E  	  FILE ...
    
  2. Pass the tab character into pattern. This is straightforward when you edit a script file:

    # no tabs for Python please!
    grep -q       *.py && exit 1
    

    However, when working in an interactive shell you may need to rely on shell and terminal capabilities to type the proper symbol into the line. On most terminals this can be done through Ctrl+V key combination which instructs terminal to treat the next input character literally (the V is for "verbatim"):

    $ grep  <Ctrl>+<V><TAB>  FILE ...
    

    Some shells may offer advanced support for command typesetting. Such, in bash(1) words of the form $ string are treated specially:

    bash$ grep $ 	  FILE ...
    

    Please note though, while being nice in a command line this may produce compatibility issues when the script will be moved to another platform. Also, be careful with quotes when using the specials, please consult bash(1) for details.

    For Bourne shell (and not only) the same behaviour may be emulated using command substitution augmented by printf(1) to construct proper regex:

    $ grep "`printf  	 `" FILE ...
    

Use echo to insert the tab for you grep "$(echo -e \t)"

grep "$(printf )" worked for me on Mac OS X

A good choice is to use sed.

sed -n  /	/p  file

Examples (works in bash, sh, ksh, csh,..):

[~]$ cat testfile
12 3
1 4 abc
xa      c
        a       c2
1 23

[~]$ sed -n  /	/p  testfile 
xa      c
        a       c2
[~]$ sed -n  /	a	/p  testfile
        a       c2

(This answer has been edited following suggestions in comments. Thank you all)

use gawk, set the field delimiter to tab ( ) and check for number of fields. If more than 1, then there is/are tabs

awk -F"	"  NF>1  file

+1 way, that works in ksh, dash, etc: use printf to insert TAB:

grep "$(printf  BEGIN	END )" testfile.txt

On ksh I used

grep "[^I]" testfile

The answer is simpler. Write your grep and within the quote type the tab key, it works well at least in ksh

grep "  " *

Using the sed-as-grep method, but replacing the tabs with a visible character of personal preference is my favourite method, as it clearly shows both which files contain the requested info, and also where it is placed within lines:

sed -n  s/	/****/g  file_name

If you wish to make use of line/file info, or other grep options, but also want to see the visible replacement for the tab character, you can achieve this by

grep -[options] -P  	  file_name | sed  s/	/****/g 

As an example:

$ echo "A	B
foo	bar" > test
$ grep -inH -P  	  test | sed  s/	/****/g 
test:1:A****B
test:2:foo****bar

EDIT: Obviously the above is only useful for viewing file contents to locate tabs --- if the objective is to handle tabs as part of a larger scripting session, this doesn t serve any useful purpose.

This works well for AIX. I am searching for lines containing JOINED< >ACTIVE

voradmin cluster status | grep  JOINED$ 	 ACTIVE

 vorudb201   1       MEMBER(g) JOINED        ACTIVE
*vorucaf01   2       SECONDARY JOINED        ACTIVE

You might want to use grep "$(echo -e )"

Only requirement is echo to be capable of interpretation of backslash escapes.

These alternative binary identification methods are totally functional. And, I really like the one s using awk, as I couldn t quite remember the syntaxic use with single binary chars. However, it should also be possible to assign a shell variable a value in a POSIX portable fashion (i.e. TAB=echo "@" | tr "100" "11"), and then employ it from there everywhere, in a POSIX portable fashion; as well (i.e grep "$TAB" filename). While this solution works well with TAB, it will also work well other binary chars, when another desired binary value is used in the assignment (instead of the value for the TAB character to tr ).

The $ notation given in other answers is shell-specific -- it seems to work in bash and zsh but is not universal.

NOTE: The following is for the fish shell and does not work in bash:

In the fish shell, one can use an unquoted , for example:

grep 	 foo.txt

Or one can use the hex or unicode notations e.g.:

grep X09 foo.txt
grep U0009 foo.txt

(these notations are useful for more esoteric characters)

Since these values must be unquoted, one can combine quoted and unquoted values by concatenation:

grep "foo"	"bar"

You can also use a Perl one-liner instead of grep resp. grep -P:

perl -ne  print if /	/  FILENAME




相关问题
Really strange grep 2.5.1 bug in cat d reading long lines

Recently a peer and I discovered an interesting bug in GNU grep 2.5.1 in which standard input with lines greater than 200,000,000 characters causes grep to fail, even if the pattern is not in one of ...

grep a tab in UNIX

How do I grep tab ( ) in files on the Unix platform?

how to grep a variable in the shell program? [duplicate]

#!/bin/bash for ((var=0; var<20; var++)) do echo " Number is: $(grep Multiple_Frame = echo **$var** 20mrf.txt | wc -l)" >>statisic.txt done This shell program cannot produce correct ...

GREP - finding all occurrences of a string

I am tasked with white labeling an application so that it contains no references to our company, website, etc. The problem I am running into is that I have many different patterns to look for and ...

Grep doesn t work correctly with .as files

Here s the statement I m running: grep -i -H ConfigureControls *.as Note that I m forcing file names with the -H flag. What I get back is: } } trac} } this.chairControls.debug....

热门标签