Broad Network


White Space in C

C Tutorials from Roots – Part 18

Forward: In this part of the series, I explain what is white space and how it is used in C.

By: Chrysanthus Date Published: 20 Aug 2012

Introduction

This is part 18 of my series, C Tutorials from Roots. In this part of the series, I explain what is white space and how it is used in C. When an ordinary man looks at a printed document or a web page, he can call the spaces that do not have text or pictures, white space. These spaces are actually blank spaces; they do not have to be white in color.  On a computer screen, some of these spaces have characters and a programmer has to be conscious of which character creates which particular type of white space. The meaning of these characters and their particular type of white spaces are listed and explained in this tutorial.

Note: If you cannot see the code or if you think anything is missing (broken link, image absent), just contact me at forchatrans@yahoo.com. That is, contact me for the slightest problem you have about what you are reading.

Type of Characters
The ordinary man considers, A, as a character, B, as another character, C, as another character, 5 as a character, 7 as character, and so on. On the computer keyboard, you see non-commonly used characters such as the asterisk, *. In many computer languages, a white space character consists of two items. It begins with a back slash followed by text. These two items effectively form the white space character. For example, \n, is a white space character for a good number of computer languages; it has been mentioned before. These two-item characters are better called Escape Sequences. They are also called Special Characters.

The Horizontal Tab
While somebody is writing with a pen on a piece of paper, if he wants to start a new paragraph he does not start at the left margin; he shifts a bit to the right (indents). That indentation can be considered as a horizontal tab. There is an escape sequence that can be used to achieve this with C. It is, \t. It begins with a back slash, followed by ‘t’ in lower case. This special character is called the horizontal tab (i.e. \t). Read and try the following code:

#include <stdio.h>

int main()
    {
        char *str = "tand the sentence begins";
                printf(str);
    
        return 0;
    }

At the output you should see a long horizontal space in front of the text. The horizontal tab character can be placed anywhere in the string and you can have more than one of them in a string. You can use the horizontal tab character (\t) to format output of a lot of text that is in table format. However, there are better ways of formatting output. Today input and output is done with windows. After this series I will write another series, which will show you how to do that.

The Vertical Tab
As the horizontal tab exists, a Vertical Tab in the vertical direction also exists. For many languages, the escape sequence for the vertical tab is ‘\v’. You have a back slash followed by ‘v’ in lower case. This character (\v) may not work with your MinGW compiler at the default settings.

Form Feed
A form feed is more of an instruction than a blank space character. It is called a white space character because it can cause a blank space. Imagine that there are about ten lines of text for a document. Also imagine that in the middle of this text, you have the escape sequence, \f, which is what many languages use as the form feed character. Now, while the page that has this text is being printed (displayed), when the printer (or screen) reaches this escape sequence, it should not print the rest of the text below on the current page; it should advance the page, leaving a blank space and then starts printing the rest of the text on the next page (paper). Form Feed means: print the rest of the text on the next page, just after feeding in the next page (paper to the printer). If the printer meets this character at the end of the current page, then no blank space would be produced, as the rest of the text would be printed (or displayed) on the next page fed.

Line Terminators
Two escape sequences are described below as white space characters, but they do not really produce blank spaces. However, they affect where the next line or text would be printed or displayed.

The horizontal tab and vertical tab white space characters are by themselves blank spaces. The form feed character can produce a blank space depending on its position in the current page; in itself, it is more of an instruction than a blank space character. The two escape sequences below, are not blank characters by themselves. They are actually line terminators, but in many forums they are called, white space characters.

Carriage Return
Imagine that a line of text is to be displayed (printed) and there is the escape sequence, \r in the middle of the line of text. \r is known as the Carriage Return character for many languages. When the printer or screen reaches this point, it sends the ink (or light) to the beginning of the current line. After this, if printing were to continue, the current line will be written over, by the right half of the line of text. The carriage return escaped sequence is normally used in conjunction with the Newline escape sequence (see below).

The Newline
Imagine that a line of text is to be displayed and there is the escape sequence, \n in the middle of the line of text. \n is known as the Newline character for many languages. When the printer or screen reaches this point, it sends the ink (or light) to the next (to-be-displayed) line. It is not clear whether the ink should go to the beginning or middle or end of the next line. If the programmer wants printing to continue at the beginning of the next line, then he has to use both \r and \n together (i.e. \r\n) at the same point in the line of text. With some languages (compilers or interpreters), \n alone serves the purpose of the presence of both \r and \n.

Read and try the following code:

#include <stdio.h>

int main()
    {
        char *str = "This is the first sentence. \r\nThis is the second sentence.";
                printf(str);

        return 0;
    }

The output should be made of two lines. The first line should have the first sentence and the second line should have the second sentence even though both sentences are in one line in the code.

Note
The escape sequences for the white spaces are not displayed as \t, \f, etc. The user sees only their effects. The space, horizontal tab and vertical tabs characters can be considered as pure white space characters.  The form feed and especially the line terminators, can be considered as indirect white space characters.

We can stop here for now. See you in the next part of the series.

Chrys
NEXT

Comments

Become the Writer's Fan
Send the Writer a Message