The awk command is mainly used in Linux for text processing, and is especially good at extracting, filtering and manipulating structured data by rows and columns. Its core uses include: 1. Extract specific column content, and divide fields with spaces or tabs by default. References such as $1, $2, etc., such as who | awk '{print $1}' can display the user name; 2. Filter row content by condition, such as awk '/error/ {print}' can find log rows containing "error", or filter specific column values through $3 == "404"; 3. Custom delimiters to process complex formats, specify input delimiters with -F parameter, such as awk -F: '{print $1, $6}' to process colon-delimited /etc/passwd files, and set output delimiters through OFS variables to beautify the results.
In Linux systems, the awk
command is mainly used for text processing, especially to extract, filter and manipulate data by rows and columns. It is especially suitable for processing structured text (such as log files, CSV files, etc.), allowing you to quickly crawl key information or generate reports.

Extract specific column content
This is one of the most common uses of awk
. By default, awk
will split each line into multiple fields by space or tab, and you can reference them by $1
and $2
.
For example, view the username of the currently logged in user:

who | awk '{print $1}'
This way, only the column of username can be displayed. You can also output multiple fields at the same time, such as:
who | awk '{print $1, $2}'
You can clearly see the user name and terminal device information.

Filter row content by condition
In addition to extracting columns, awk
also supports filtering logic similar to "if" statements. For example, if you want to find the line containing "error" in a log file:
awk '/error/ {print}' /var/log/syslog
Or more specific, such as finding rows whose third column is equal to a certain value:
awk '$3 == "404" {print}' access.log
This writing method is very suitable for log analysis or data cleaning.
Custom separators handle complex formats
Sometimes data is not separated by spaces, but commas, colons, or other symbols. At this time, you can use the -F
parameter to specify the separator.
For example, when processing /etc/passwd
files, it is separated by colons:
awk -F: '{print $1, $6}' /etc/passwd
This prints out the username and their home directory path. If you want to use other symbols to connect fields when output, you can set the output separator with OFS
variables:
awk -F: 'BEGIN{OFS=" -> "} {print $1, $6}' /etc/passwd
This looks clearer.
Let's summarize
-
awk
is suitable for processing structured text data - You can extract specific columns, filter by condition, and customize separators
- Very practical in system log analysis and data report generation
Basically all that is, it is not complicated but it is easy to ignore its power.
The above is the detailed content of what is the use of awk command in linux. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undress AI Tool
Undress images for free

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

LXD is described as the next-generation container and virtual machine manager that offers an immersive for Linux systems running inside containers or as virtual machines. It provides images for an inordinate number of Linux distributions with support

When encountering DNS problems, first check the /etc/resolv.conf file to see if the correct nameserver is configured; secondly, you can manually add public DNS such as 8.8.8.8 for testing; then use nslookup and dig commands to verify whether DNS resolution is normal. If these tools are not installed, you can first install the dnsutils or bind-utils package; then check the systemd-resolved service status and configuration file /etc/systemd/resolved.conf, and set DNS and FallbackDNS as needed and restart the service; finally check the network interface status and firewall rules, confirm that port 53 is not

If you find that the server is running slowly or the memory usage is too high, you should check the cause before operating. First, you need to check the system resource usage, use top, htop, free-h, iostat, ss-antp and other commands to check CPU, memory, disk I/O and network connections; secondly, analyze specific process problems, and track the behavior of high-occupancy processes through tools such as ps, jstack, strace; then check logs and monitoring data, view OOM records, exception requests, slow queries and other clues; finally, targeted processing is carried out based on common reasons such as memory leaks, connection pool exhaustion, cache failure storms, and timing task conflicts, optimize code logic, set up a timeout retry mechanism, add current limit fuses, and regularly pressure measurement and evaluation resources.

As a system administrator, you may find yourself (today or in the future) working in an environment where Windows and Linux coexist. It is no secret that some big companies prefer (or have to) run some of their production services in Windows boxes an

Frankly speaking, I cannot recall the last time I used a PC with a CD/DVD drive. This is thanks to the ever-evolving tech industry which has seen optical disks replaced by USB drives and other smaller and compact storage media that offer more storage

In Linux systems, 1. Use ipa or hostname-I command to view private IP; 2. Use curlifconfig.me or curlipinfo.io/ip to obtain public IP; 3. The desktop version can view private IP through system settings, and the browser can access specific websites to view public IP; 4. Common commands can be set as aliases for quick call. These methods are simple and practical, suitable for IP viewing needs in different scenarios.

Built on Chrome’s V8 engine, Node.JS is an open-source, event-driven JavaScript runtime environment crafted for building scalable applications and backend APIs. NodeJS is known for being lightweight and efficient due to its non-blocking I/O model and

Data replication is the process of copying your data across multiple servers to improve data availability and enhance the reliability and performance of an application. In MySQL replication, data is copied from a database from the master server to ot
