Systematic Errors in Fortran Programs

top_left_banner SimCon logo

Are there Systematic Errors in Fortran Programs?

We consider systematic errors to be features in the code which:

may cause a program to behave in ways which are not intended;
are not always detected by compilers (Compilation errors are eliminated in program development);
belong to recognisable classes, for example, mis-matched arguments or uninitialised variables;
are sufficiently common to be of concern.

And yes there are. We usually see of the order of one anomaly per 50 lines of commented code.

Error Detection by fpt

fpt traps many classes of error and anomaly. Some issues are always trapped, some are trapped by specific fpt commands, and some are detected by instrumenting the code so that they are exposed at run-time. The commands used are shown for each of the classes of error described below.

There are some classes of systematic error which fpt cannot yet trap. These are also listed, and we hope to trap them in the future. If you are aware of any classes of error not listed here please tell us!

The Classes of Error

Note that some of the issues listed here do not always cause errors. These are listed because they may cause errors or may cause problems in migration. Please follow the links from the headings for more detailed descriptions.

Issues in Variable Names and Declarations

Keywords used for symbols

Fortran keywords are not reserved words. Words like PROGRAM, TYPE and DATA can be used for variables. This causes surprisingly few problems. However, we have seen some errors, and it does wonders for the readability of the code. More...

Intrinsic Function Names used for Symbols

Fortran intrinsic function names are also not reserved and may be used for variables. There are hundreds of them, and it is quite easy to use one of the more obscure ones by accident. The most dangerous cases are where they are used to name arrays. More...

Inconsistent Use of Names

Two issues are of particular importance:

Different values for the same parameter name (e.g. g = 32.2 and g = 9.81 )
Different COMMON block addresses for the same name

More...

Equivalence of Different Types or Kinds

Equivalence of different real kinds usually arises by accident and will cause an error. Equivalence of different integer kinds used for bit and byte manipulation may not be portable. More...

Objects Forced to Unaligned Addresses

Sequence derived type constructs, COMMON statements and (non-standard) structure definitions may force variables to poorly aligned addresses. This can expose compiler errors and will usually lead to inefficiency. More...

Errors in Single Statements

Array References Out-of-Bounds

Compilers usually trap array references out-of-bounds where the array indices are static expressions. Array bounds can usually be checked at run-time, but run-time bounds checking may be turned off to improve run speed, and some systems only check that a reference is within the array and do not verify the bounds of the separate indices. More...

Unsafe Data Type Coercions

The most common problem of this type is the use of integer variables as if they were logical. Fortran does not define a translation between integer and logical, and different systems use different conventions. More...

Loss of Precision

The use of a single low-precision object may degrade the results from all of the high precision components in an expression. More...

Accidental Whole Array Assignments

If an unsubscripted array is assigned a scalar value, all elements of the array receive that value. If the indices of an array reference are omitted in error, the entire array is overwritten. More...

Zero Integer Expressions

In Fortran, the results of integer divisions or exponentiation of integers are integer. Sub-expressios like (2/3) and 10**(-9) have the value 0. This leads to some interesting errors. More...

Structural Errors in Multiple Sub-programs and Statements

Unreachable Code

Unreachable code does not necessarily indicate an error. Sections of code may be switched in or out of compilation by macro pre-processor switches or by the use of Fortran parameters. However, it does sometimes indicate a problem. More...

Inconsistent Order of Evaluation

If an expression contains two or more function invocations the Fortran standard explicitly states that the order of evaluation is undefined. This causes problems:

If the functions have side-effects;
If program execution is traced as part of an investigation. The trace may follow different paths in different conditions.

More...

Errors in Usage of Variables

The issues here are:

Variables read before they are assigned
Variable values assigned but not used
Variables declared but unused

More...

Unused Statement Labels

Unused labels have no effect on program execution, but they may reduce the code readability and may indicate a problem. More...

Inconsistent Sub-program Arguments

In general, the actual arguments passed to a sub-program should match the formal arguments in the sub-program definition in:

Data type
Data kind
Passing protocol - i.e. passed by reference, by value, as a label, as a pointer or as a sub-program name.
String length if a character object
Array shape
INTENT - i.e. whether input, output, input and output, a sub-program name etc.

Compilers usually make these checks if the sub-program interface is visible in the compilation unit where the call is made, and in the case of INTENT, if the INTENT is declared in the interface. Usually, no check is made if the sub-program interface is not visible, and most of the errors observed occur in this case.

A large number of errors are observed in the INTENT of sub-program arguments, and this issue is therefore treated separately in this list. More...

INTENT of Arguments

The INTENT of a sub-program argument may be declared to be IN, OUT, INOUT or may be undeclared. The Fortran standard explicitly states that the declared INTENT may be used to guide code generation. Therefore an error may occur if:

the INTENT is declared IN and the argument value is modified;
the INTENT is declared OUT and the argument may be read before it is assigned a value;
the INTENT is declared in any way and the argument is a sub-program or intrinsic function name.

More...

Optional Arguments

An error may occur if a sub-program accesses an optional argument which was not present in the current call. To protect against this, access to optional arguments may be enclosed in IF PRESENT(...) constructs. More...

Inconsistent Use of Logical Units

Errors may occur if the same logical unit number is used to access two or more files. If an OPEN statement opens a file on a unit which is already in use, the file attached to that unit is immediately closed. Some systems do not flush the file buffers. More...

Failure to Export Overloads

Sub-programs may be setup to overload operations on derived types. For example, the * operator may be overloaded. If the overloads are specified in a Fortran module and are not exported, compilation errors will occur when the overload is used, and the problem will be detected and fixed.

However, it is possible to overload the assignment operator. In this case, if the overload is not exported, no error occurs and the assignment simply copies the right-hand-side quantity to the left-hand-side. The overload fails silently. More...

System Errors

Compiler Bugs

Compilers are computer programs, and where modern Fortran is combined with all of the legacy of extended FORTRAN 77, they are asked to do a great deal. There will be bugs, and systems need to be in place to identify them. More...

Compilation Errors

These are not systematic errors because programmers remove them rapidly as they occur. They are included in this list simply to comment that they are also reported by the analysis tools.

Errors in Legacy Code

Accidental Equivalence

COMMON blocks are layed out separately in every sub-program in which they are referenced. When there are multiple text definitions of a COMMON block there is a serious risk of mis-alignment. When this occurs, objects at the same COMMON block address are accidentally made equivalent.

This is a legacy issue because COMMON blocks are disappearing from use. However, many legacy codes are important and errors from this source must be trapped. More...

Multiple DATA Initialisation of COMMON Blocks

In FORTRAN 77 there was no problem in initialising the same COMMON block in multiple BLOCK DATA sub-programs. In extended FORTRAN 77, for example, DEC VAX systems, COMMON blocks could also be initialised in multple subroutines and functions. But under Linux, OSX and Unix this will not work. Every initialisation of a COMMON block initialises the whole block, and any uninitialised components are set to zero. The COMMON block is finally populated by which ever initialisation regime happened to be linked last. All other data are lost. More...

Fixed Format Anomalies

The introduction of the free-format layout in Fortran 90 eliminated a host of potential errors. The most important cases are:

Code running past column 72 into the comment field;
Missing spaces between tokens, which may mask errors;
Spaces within symbols, keywords etc. which often did mask errors;
Exclamation as a continuation character, which could kill half a statement.

More...

Issues which fpt Can Not Yet Trap

Units and Dimensions

It is possible, by static analysis, to analyse the code and to infer the relationships between the units and dimansions of the variables. Research is in progress, and the current version of fpt (Version 4.0-e) has an experimental command to check units. More...

It is also possible to modify the code to replace all REAL and COMPLEX declarations and types by emulated types and to attach units and dimansions to each instance. These are then propagated and tested at run-time. Again, research is in progress. fpt has commands which will re-engineer the code and the run-time support is under development.

Pointer De-referencing

Fortran pointers, particularly those which are components of derived types, can be accidentally de-referenced when they are passed as sub-program arguments. A handler is needed to check for this situation.

Index Associations

Suppose we have two 2-D arrays, A and B. In a DO loop, A(i,j) is associated in some way with B(i,j). Subsequently we find an loop in which A(i,j) is associated with B(j,i). We should raise a diagnostic asking whether the assciation is the wrong way around.