Rapid Application Development toolkit for building Administrative Web Applications

RADICORE - A Development Infrastructure for PHP

By Tony Marston

2nd August 2003
Amended 1st May 2024

As of 10th April 2006 the software discussed in this article is available as open source and can be downloaded from www.radicore.org

Introduction
Background
The 3 Tier Architecture
Presentation layer
Business/Domain layer
Data Access layer
The Model-View-Controller (MVC) design pattern
The Model component
The View component
The Controller component
The combined architecture
Object Classification
Infrastructure Design
Forms Families
Structure, Behaviour, Content ...
... and Style
Infrastructure Implementation
Component scripts
Transaction Pattern (Controller) scripts
Generic (abstract) Table class
Common Table Methods
Hook Methods
Common Table Properties
Database Table (Model) classes
Default class file
Validation class
DML class
View object
Screen Structure scripts
XML Documents
XSL Stylesheets
XSL Transformation process
HTML output
CSS files
AUDIT class
Workflow Engine
Levels of Reusability
The Path to Reusability
Extending this Infrastructure
References
Amendment History
Comments

Introduction

With every programming language I have worked in it has become normal practice, after having developed an initial series of programs, to identify a common structure to which all subsequent programs should be built. This may take some time as it involves a bit of trial and error in playing with the different ways in which a task can be achieved in order to find the methodology that gives the most advantages in the long term. Eventually this development infrastructure/environment should contain the following:-

A description as to how the user interfaces should be built so as to provide a consistent look and feel.
A list of naming conventions to identify how objects, functions and programs should be named.
A description of the directory structure used in the development environment so everybody knows where each type of object is supposed to be located.
A description of the strategies to be used when building programs.
A library of standard functions that can be used to perform common tasks.

The list can be extended even further, but that will do as a starting point.

Background

The PHP development environment that I have devised was created by taking a set of sample programs which I had assembled for a previous language and rewriting them in PHP. These sample programs were used as patterns or templates for all other programs in my applications as they represented all the combinations of structure and behaviour that I had encountered. To build a new component all I had to do was identify the right template, then specify which database tables(s) and columns I wished to show on the screen for the new component. Each database table was accessed via its own separate service component which contained all the business rules associated with that table plus the code to communicate with the database. As each user screen accessed each database table by its own service component it meant that business logic was shared and not duplicated. Both user screens and service components used shared code in the form of 'include' files, so it was possible to update the shared library and have the changes automatically picked up by the various components.

From the outset my aim was to produce an environment with the following characteristics:-

To be based on the 3 Tier Architecture so that the logic for the presentation layer, the business layer and the data access layer would be contained within separate components. This architecture allows the code within any one of the layers to be changed without affecting any of the other layers.
To make use of as many standard reusable scripts as possible. I have been long familiar with the advantages of having libraries of reusable subroutines and functions in other development languages, and my previous language, a 4th generation tool called UNIFACE, had the extra ability of components being able to inherit code from component templates.
To make use of the OO capabilities of PHP, but only where I saw an advantage in doing so. I do not believe OO is a universal panacea as any old problems which it claims to solve are counter-balanced by new problems which it creates. OOP techniques do not prevent errors, they simply create a new class of error.

What I actually produced is as follows:-

As I was familiar with XML and XSL before learning PHP, and because PHP already incorporates modules for those two disciplines, I chose to generate all HTML output via XML/XSL transformations. I did read about other templating engines that are available for PHP, but as these are written in PHP and tied to PHP I rejected them as I wanted something that was totally independent of the underlying language and preferably written to 'open' standards and therefore accessible to a larger community.
After building several screens with XSL stylesheets I identified a great deal of common code and moved it to a series of smaller XSL files which can be 'included' at runtime. Now when I build a stylesheet for a component it contains a large number of calls to my library of common templates.
All HTML output is written to W3C standards and is therefore browser-independent. The output is actually XHTML 1.0 Strict with all formatting specified via Cascading Style Sheets (CSS). There is no JavaScript as this is not controlled by any W3C standard and there are too many incompatibilities with its implementation between the different browsers. This decision allows the software to work as intended on any browser, the only condition being that the browser complies to W3C standards. If anyone has a non-standard browser then any problems are theirs, not mine. Note that while none of the framework components use javascript it does support the ability for individual application subsystems to include javascript in their components.
With UNIFACE I had successfully implemented the 3 Tier Architecture by using service components in the middle layer, and as there are distinct similarities between 'components' and 'objects' (see Using PHP Objects to access your Database Tables (Part 1) for details) I found it very easy to build a class for each database table where previously I had used a service component. The ability for a component to inherit code from a component template was replaced by the ability for a class to inherit code from it's superclass.
In my first implementation I had a standard set of functions within my abstract database class which dealt with the creation and execution of all DML (Data Manipulation Language) statements. I have subsequently been able to extract all of those functions and place them in a class of their own, thus creating a totally separate DML object in the data access layer.
I have thus ended up with a 'true' 3 Tier structure as
1. Only my data access layer has any form of communication with the database.
2. All business rules are processed by objects within the business layer.
3. All interfacing with the user is done by components within the presentation layer.
4. There is no direct communication between the presentation and data access layers. All communication is routed through the business layer.
Although I had not intended to use the Model-View-Controller (MVC) design pattern in my infrastructure (a previous encounter with someone else's disastrous implementation had not convinced me of any benefits), I later realised that what I had produced was a good fit to the MVC principles. It just goes to show that it is not what you implement but how you implement it that is important.

Interestingly enough my decision to have all HTML output generated through XSL transformations instead of directly by PHP code actually paid enormous dividends by producing a great deal more reusable code than I had originally anticipated. I started by creating a script which performed an operation on a database table then wrote a second script to perform the same operation on a different table. I then compared the two scripts to see what was duplicated and could therefore be put into a sharable file, and what was different and which would have to remain in a script of its own. By careful engineering of the code I ended up with the situation where there were basically only two differences:

The name of the database table to be accessed.
The name of the XSL file to be used to transform the output.

I ended up with three types of PHP script in my presentation layer:

Unique Component scripts which identify which Model, View and Controller components to use.
Sharable Transaction Pattern (Controller) scripts to carry out the communication with the business layer objects and which produce the HTML output.
Screen structure scripts which control how the output is displayed. Some of these can be shared by several components.

When building the XSL stylesheets I came across common code which I was able to move into separate files as XSL templates (subroutines). These templates can be accessed by any number of stylesheets using the <xsl:include> command. A later improvement meant that instead of having a separate XSL stylesheet for each screen where the field names and field labels were hard-coded I could use a much smaller number of generic stylesheets and have the list of field names and field labels supplied within the XML document. The type of HTML control to be used for each field is written to the XML document as a series of field attributes, and a standard XSL template uses these attributes to generate the correct HTML code.

The 3 Tier Architecture

Any piece of software can be subdivided into the following areas:

Presentation logic = User Interface, displaying data to the user, accepting input from the user.
Business logic = Business Rules, handles data validation and task-specific behaviour.
Data Access logic = Database Communication, constructing SQL queries and executing them via the relevant API.

This topic is discussed in greater detail in What is the 3-Tier Architecture?

If you put the code which deals with presentation logic (the generation of HTML documents), business logic (the processing of business rules) and data access logic (the generation and execution of DML (SQL) statements) into a single component then what you have is a single tier structure, as shown in Figure 1.

Figure 1 - 1 Tier architecture

If you split off all the code that handles the communication with the physical database to a separate component then you have a 2 tier architecture, as shown in Figure 2.

Figure 2 - 2 Tier architecture

If you go one step further and split the presentation logic from the business logic you have a 3 Tier Architecture, as shown in Figure 3. Note that there is no direct communication between the presentation and data access layers - everything must go through the business layer in the middle.

Figure 3 - 3 Tier Architecture

When this architecture is implemented the benefits will become apparent as more code can be shared instead of being duplicated. Several components in the presentation layer can share the same component in the business layer, and all components in the business layer share the same component in the data access layer. This is shown in Figure 4. Note also that a presentation layer component can access more than one business layer component, and a business layer component can access other business layer components.

Figure 4 - 3 Tier Architecture in operation

The big advantage of a 3-tier system is that it is possible to change the contents of any one of the tiers/layers without having to make corresponding changes in any of the others. For example:

A change from one DBMS to another would only require a change to the component in the data access layer.
A change in the Use Interface, for example from desktop to the web, would only require changes to the components in the presentation layer.

You should also notice here that the Business object is only responsible for assembling data in response to a request from the Presentation object. The Business object does not know or care what the Presentation object does with that data - it may build it into a compiled form, an HTML page, a PDF document, a CSV file, an XML document in response to a web service request, or whatever.

By having separate layers with different responsibilities this architecture also makes it possible to use different teams of developers to work on each. That means I can need PHP skills for the business layer, SQL skills for the data access layer, and (X)HTML, CSS and XSL skills for the presentation layer. It may be easier to find developers with skills in one of these areas rather than all three.

The environment which I have created is based on the 3-tier architecture, as shown in Figure 5.

Figure 5 - Environment/Infrastructure Overview

Note that each object in the above diagram is a hyperlink which, when clicked, will take you to the relevant component description which is also contained in the following list:

Component scripts
Transaction Pattern (Controller) scripts
Database Table (Model) classes
Generic (abstract) table class
Validation class
DML class
View Object
Screen Structure scripts
XML documents
XSL Stylesheets
XSL Transformation process
HTML output
CSS files
AUDIT class
Workflow Engine

Note that this is proper 3-Tier Architecture, not the pseudo variety as claimed by many who seem to think that the arrangement of web browser, web server and database automatically constitutes a 3 Tier system. It is the construction of the software in the middle that decides whether the system is 1, 2 or 3 Tier. It is only when a system has separate components to deal with the different areas of logic that it can be truly described as 3-tier.

The infrastructure described in this document has the following degrees of separation:-

Presentation layer

This is part of 3 Tier Architecture. It contains a separate component script for each transaction or task within the system. This is a simple mechanism which identifies which model, view and controller components to use.

The controller script handles the transaction behaviour.

The database table class (model) identifies the entity or entities which need to be accessed.

The Controller performs operations on the Model in order to change the state of the Model after which it is injected into the View object.

The View object extracts data from the Model, transforms it into another format, usually into HTML, and sends the result back to the client. There will be different view objects for alternative formats such as PDF or CSV.

Business/Domain layer

This is part of 3 Tier Architecture. It contains a separate table class for each database table or business entity. Each table class is a subclass of a generic table class so that it can inherit as much generic code as possible.

All communication with the physical database is handled by the generic table class through a separate DML class.

Primary data validation is handled by the generic table class through a validation class. Secondary data validation is performed by customisable methods within each table class.

This layer also communicates my Workflow Engine in order to determine if a new workflow case needs to be created, and to progress each case through its various stages.

Data Access layer

This is part of 3 Tier Architecture. It consists of a one or more DML objects which issue the functions to communicate with the physical database(s). There is a separate DML class for each database engine (MySQL, PostgreSQL, Oracle, SQL Server). This is sometimes referred to as a Data Access Object (DAO).

Not only is it possible within the same transaction to access tables in different databases, it is also possible to access tables through different database engines.

This layer also communicates my AUDIT class in order to record all changes made to an application database in a separate 'audit' database so that they can be reviewed using online enquiry screens.

The Model-View-Controller (MVC) design pattern

It was some time after I had developed this infrastructure that I discovered that it also contained an implementation of the MVC design pattern. This is discussed in more detail in The Model-View-Controller (MVC) Design Pattern for PHP.

Figure 6 - The Model-View-Controller structure

Note that each object in the above diagram is a hyperlink which, when clicked, will take you to the relevant component description.

The Model component

A model is an object which directly manages the data, logic and rules of the application. In my infrastructure this is implemented as a series of table classes, one for each table in the database, which inherit a large amount of code from the abstract table class. Each class handles the data validation and business rules for a single database table, but note that all communication with the physical database is routed through a separate DML Object, with a separate DML class for each supported DBMS engine.

The View component

A view is some form of visualisation of the state of the model. In my infrastructure I can format and send data to the client in one of three possible formats:

HTML - This component is given two pieces of information: (a) a screen structure script which identifies which XSL stylesheet to use and which data elements go where on the screen, and (b) one or more database table objects which have already been filled with data. This component will then extract all the data from the database table object(s) and write it out to an XML document. Other data, such as the menu buttons, navigation buttons, pagination and scrolling details, action buttons, et cetera, is also added to the XML document, after which it is transformed into HTML output. For each database table there is typically a list view (containing multiple occurrences with data arranged horizontally) and a detail view (containing a single occurrence with data arranged vertically). The same detail view can be used by the ADD, ENQUIRE, UPDATE, DELETE and SEARCH screens.
PDF - This component is given two pieces of information: (a) a report structure script which identifies which data elements go where on the report, and (b) a single database table object which has issued an SQL "select" statement. It will extract the data one row at a time, write it out to the PDF document, then read the next row.
CSV - This component is given a single database table object which has issued an SQL "select" statement. It will extract the data one row at a time, write it out to the CSV document, then read the next row. The very first row of data is preceded by a list of the column names.

The Controller component

A controller offers facilities to change the state of the model. It accepts input from the user and instructs the model to perform actions based on that input, then updates the View to show the results of those actions.

In my infrastructure this is implemented as a series of component scripts which link to one of a series of transaction pattern (controller) scripts, one for each Transaction Pattern.

The combined architecture

When the 3 Tier Architecture is combined with the Model-View-Controller design pattern it produces the structure, which can be referred to as Model-View-Controller-DAO (MVCD), as shown in Figure 7 below:

Figure 7 - MVC and 3 Tier Architecture combined

Note that each of the above boxes is a hyperlink which will take you to a detailed description of that component.

An alternative diagram which shows the same information in a different way is shown in Figure 8 below::

Figure 8 - The MVC and 3-Tier architectures combined

You should clearly see that these two patterns do not fight each other, all they do is overlap so that a single component in one is split into two components in the other.

A more detailed structure diagram is shown in Figure 5 above.

Object Classification

In his article How to write testable code the author identifies three main categories or classifications that can be used to describe an object:

Entities	An object whose job is to hold state and associated behavior. The state (data) can be persisted to and retrieved from a database. Examples of this might be Account, Product or User. In my framework each database table has its own Model class.
Services	An object which performs an operation. It encapsulates an activity but has no encapsulated state (that is, it is stateless). Examples of Services could include a parser, an authenticator, a validator or a transformer (such as transforming raw data into HTML, CSV or PDF). In my framework all Controllers, Views and DAOs are services.
Value objects	An immutable object whose responsibility is mainly holding state but may have some behavior. Examples of Value Objects might be Color, Temperature, Price and Size. PHP does not support value objects, so I do not use them. I have written more on the topic in Value objects are worthless.

This is also discussed in When to inject: the distinction between newables and injectables.

The components in the RADICORE framework fall into the following categories:

Models are Entities, with a separate class for each table in the application database.
Views are Services, with a separate component for HTML, PDF and CSV output.
Controllers are Services, with a separate component for each Transaction Pattern.
Data Access Objects are Services, with a separate class for each supported DBMS engine (MySQL, PostgreSQL, Oracle and SQL Server).

It should also be noted that:

Services are supplied within the framework and do not have to be created or modified by the developer. Services are also application-agnostic in that they do not contain any knowledge of any part of any application, they do not contain any business rules.
Entities are generated by the application developer, one for each database table, using functions within the Data Dictionary, and are the only components which contain information regarding the application. Each concrete table class inherits standard code from an abstract table class, table metadata from a table structure file, but uses "hook" methods to define application-specific business rules.

Infrastructure Design

Some of my approaches to infrastructure design are based on experiences which I have had in previous languages. It is encouraging to know that some of my design decisions as just as valid now as they were then. It just goes to show that quality is ageless.

Forms Families

Some designers have the peculiar notion that the complexity of a system is directly proportional to the number of components it contains, therefore they try to pack as many functions as possible into a single component. In order to maintain the contents of a typical database table it is usual to provide the following functionality:

The ability to browse through all or selected occurrences (rows).
The ability to define selection criteria in order to retrieve selected occurrences.
The ability to create/insert new occurrences.
The ability to read/enquire the details of existing occurrences.
The ability to amend/update existing occurrences.
The ability to delete existing occurrences.

It is possible to put all this functionality into a single component, but the end result is a very large, very complex component. If this approach is duplicated throughout the entire system the end result is a collection of very large, very complex components. In my experience the size of a component is directly proportional to the amount of effort needed to maintain it, so smaller is better.

The alternative approach, one which I first found to be successful when COBOL was my primary language and which was just as successful when I switched to UNIFACE, is to provide each of these facilities in a separate component. This may produce a large number of components, but at least they are small and simple. The arguments for the 'small and simple' approach against the 'large and complex' are explored in more detail in my article Component Design - Large and Complex vs. Small and Simple.

When I read that when designing components for web pages the 'small and simple' approach was preferred over the 'large and complex' this did not pose a problem for me as this has been my design philosophy for 20 years.

Now when I want to write software to maintain the contents of a typical database table I build a 'family' of small components where each one performs just one of the previously mentioned functions. This produces a family of components (sometimes referred to as forms or screens) with the structure shown in Figure 9. Each of these components has its own Transaction Pattern (Controller) script. This also made it much easier to allow the Role Based Access Control (RBAC) mechanism in my framework to grant or restrict access to individual members of each family, otherwise it would require more complex code within a composite/compound component.

Figure 9 - A typical Family of Forms

Note that all the boxes in the above diagram are hyperlinks which will take you to a description of that component.

In this structure the LIST (parent) component is the only one that is available on a menu button - all the other child components can only be selected from a navigation button within a suitable parent component. In most cases the child component will need the primary key of an occurrence in the parent component before it can load any data on which it is supposed to act. In this case the required occurrence in the parent screen must be marked as selected using the relevant checkbox before the hyperlink or control button for the child component is pressed.

Another difference between these components is that the LIST component shows multiple rows of details, one occurrence per row, whereas the others will show the details for a single occurrence. As the layout for the SEARCH, INSERT, UPDATE, DELETE and ENQUIRE screens is extremely similar I have managed to provide for their construction with a single XSL stylesheet. This means that for each database table I only need 2 XSL files, as shown in Figure 10.

Figure 10 - XSL files required for a family of forms

This is made possible as one of the parameters used in the XSL transformation process is $mode. This is used within the XSL stylesheet to determine if each field can be input/amended by the user or should be display only.

In my original infrastructure each database table required its own version of the list.xsl and detail.xsl files as the table, field names and field labels had to be hard-coded inside them, but I have since enhanced my XSL library so that a small number of generic stylesheets can be used for any number of database tables. This is done by providing the list of field names and field labels which are to appear in the data area of the screen in a separate screen structure file which is then copied into the XML document, as documented in Reusable XSL Stylesheets and Templates and uses updated versions of my std.list1.xsl and std.detail1.xsl stylesheets.

Structure, Behaviour, Content ...

In my long career as a software engineer I have written countless hundreds of components, and many times I have come across the situation where I have been asked to create a new component which is "just like that one, but which works on this set of data". In this situation it is necessary to identify those parts of the original component which can be reused 'as is' and those parts which have to be altered. In order to do this I break down each component into the following areas:

Structure - how many tables or objects it deals with, arranged in different areas or zones, and how they are arranged in relation to one another.
Behaviour - what action it performs, such as listing multiple rows, or creating/reading/updating/deleting a single row.
Content - which database table(s) and field(s) does it deals with.

The trick now is to make different templates or patterns based on a particular combination of structure and behaviour so that when you build a component from a template all you have to do is specify the content. No two languages provide the same method of creating reusable templates, so what works in one particular language may be totally impossible in another. With PHP the method I have devised is to produce scripts in two categories - Screen Structure scripts which identify the content and reusable Transaction Pattern (Controller) scripts which deal with the structure and behaviour.

... and Style

A feature of HTML documents is that the visual presentation of each page can be altered quite easily. By 'visual presentation' I mean any of the following:

Fonts - different parts of the page can use different fonts of different sizes.
Colours - different parts of the page can use different foreground and/or background colours.
Images - background colours can be replaced by background images.
Positioning - elements of a page may be positioned using either absolute or relative coordinates.

Although it is possible to include all style specifications within an HTML document it is not considered to be good practice. The most efficient method is to extract all style specifications and keep them in a separate Cascading Style Sheet (CSS) file or files. In this way it is possible to update the contents of a single CSS file and have that change automatically inherited by all the pages which reference the styles defined within that CSS file. Without the use of a CSS file it would be necessary to update each page individually, which on a large site with many HTML pages could be a long and laborious process.

The term 'cascading' means that an HTML document can actually refer to a series of CSS files. These files will be scanned in the order in which they were defined and their contents merged so that a single specification is the result. In my infrastructure I use several CSS files

Infrastructure Implementation

Although this infrastructure appears to be quite complex due to the large number of components, each component is responsible for just a small area and is therefore relatively simple. The trick is to know which components have to be created by the developer, which components have already been written and are available for immediate use, and how they all hang together.

Component scripts

This is item (1) in Figure 5.

There is one of these for each task (user transaction) within the application. Each task has an entry on the MNU_TASK table in the MENU database so that it can be identified on the ROLE-TASK table (access control list) and made to appear or disappear on either a menu bar or a navigation bar.

Here is an example of one of these scripts:

<?php
$table_id = "person";                      // identify the Model
$screen   = 'person.detail.screen.inc';    // identify the View
require 'std.enquire1.inc';                // activate the Controller
?>

As you can see this is a simple script whose purpose is to identify the following:

$table_id identifies the Model part of MVC. This is one of the generated database table classes.
$screen identifies the View part of MVC. This is one of the generated screen structure scripts.
include identifies the Controller part of MVC. This is one of the pre-written controller scripts.

Note that each Controller can work with any Model, but is does not know which one until it is given a value for $table_id at run-time.

Because each task in the application has its own controller script in the file system it can be activated simply by putting the address of that script into the browser's address bar, thus avoiding the need for a front controller. However, you should only ever activate a task using the relevant menu button or navigation button otherwise the framework will disallow it.

The URL which appears in the browser's address bar will be in the format <protocol><domain>/<subsystem>/<table>(<pattern>)<suffix>.php where:

<protocol> is either 'http://' or 'https://'
<domain> is the internet domain name, such as 'www.radicore.org'
<subsystem> is the subsystem name. The entire application will be broken down into a number of separate subsystems each of which has its own directory in the file system and its own database in the DBMS. The framework itself has four subsystems - MENU, AUDIT, DICTIONARY and WORKFLOW - onto which any number of other subsystems can be added. My main ERP application, for example, contains over fifteen additional application subsystems.
<table> is any one of the table classes which represent one of the tables in that subsystem's database.
<pattern> is the identity of one of the Transaction Patterns which are supported by the framework.
<suffix> is an optional suffix where a table may have several tasks which use the same pattern but for different purposes.

Unlike the Transaction Script pattern from Martin Fowler's Patterns of Enterprise Application Architecture (PoEAA) which contains all the processing in a single script, this script does nothing but identify other components which carry out the relevant processing.

Each script in this category simply specifies the Model and View before handing control over to a particular Controller.

The component script for a transaction is automatically created when that transaction is generated from the Data Dictionary.

Sample scripts for each pattern can be found in the /radicore/default/ directory.

Transaction Pattern (Controller) scripts

This is item (2) in Figure 5.

This can also be referred to as a Transaction Controller or a Page Controller.

Each controller script contains the code to translate the HTTP request into method calls on its Model class(es). As each Model class contains the same set of common table methods this means that any Controller can be reused with any Model. These method calls will then perform the actions which are necessary to complete the request. For example:

The LIST1 controller will allow the user to browse the contents of a database table.
The SEARCH1 controller will allow the user to specify selection criteria which will be used by the LIST transaction to filter its results.
The ADD1 controller will allow the user to add a record.
The ENQUIRE1 controller will allow the user to view the contents of a selected record.
The UPDATE1 controller will allow the user to update the contents of a selected record.
The DELETE1 controller will allow the user to delete a selected record.

Please note that I do not combine these actions into a single Controller as that would require calling the controller with an argument identifying which action it is required to perform. Each Controller is hard-wired to perform only a single action, and its behaviour is only affected depending on whether it was called with the GET or POST method.

There is a separate Controller for each of my Transaction Patterns.

Each script in this category is based on a particular combination of structure and behaviour. The actual content is identified by the Screen Structure script which is set in the calling Component script. This means that one of these Controller scripts may be called by many different Component scripts, as shown in Figure 11.

Figure 11 - Many Component scripts to one Transaction Pattern (Controller) script

None of these Controller scripts generates any HTML output directly. This is done by the view object which creates an XML document containing all the relevant data, which is then transformed into HTML using a separate XSL stylesheet.

Here is an example of one of my scripts:

<?php
// name = std.enquire1.inc

// type = enquire1

// This will display a single selected database occurrence using $where
// (as supplied from the previous screen)

require 'include.general.inc';

// identify mode for xsl file
$mode = 'enquire';

// initialise session
initSession();

// look for a button being pressed
if ($_SERVER['REQUEST_METHOD'] == 'POST') {
   if (isset($_POST['finish']) or (isset($_POST['finish_x']))) {
      // cancel this screen, return to previous screen
      scriptPrevious();
   } // if
} // if

// create a class instance for the main database table
require "classes/$table_id.class.inc";
$dbobject = new $table_id;

$dbobject->sql_select  = &$sql_select;
$dbobject->sql_from    = &$sql_from;
$dbobject->sql_where   = &$sql_where;
$dbobject->sql_groupby = &$sql_groupby;
$dbobject->sql_having  = &$sql_having;
// check that primary key is complete
$dbobject->checkPrimaryKey = TRUE;

// define action buttons
$act_buttons['finish'] = 'FINISH';

// retrieve profile must have been set by previous screen
if (empty($where)) {
   scriptPrevious('Nothing has been selected yet.');
} // if
   
// get data from the database
$fieldarray = $dbobject->getData($where);
   
if ($dbobject->getErrors()) {
   // some sort of error - return to previous script
   scriptPrevious($dbobject->getErrors());
} // if
	
// check number of rows returned
if ($dbobject->getNumRows() < 1) {
   scriptPrevious('Nothing retrieved from the database.');
} // if

$fieldarray = $dbobject->getExtraData($fieldarray);
	
// build list of objects for output to XML data
$xml_objects[]['root'] = &$dbobject;

// build XML document and perform XSL transformation
$view = new radicore_view($screen_structure);
$html = $view->buildXML($xml_objects, $errors, $message);
echo $html;

?>

Please note the following points:

All files containing classes for database tables are in the format '<table_id>.class.inc', so all the Component script need do is supply a value for $table_id and the Controller script can create an object from that class.
By setting the object variable checkPrimaryKey to TRUE I will trigger the code to check that the $where string contains values for all fields which make up the primary key for this database table. The primary key details are specified in the $fieldspec array.
Each database table class contains a standard getData method which returns the $fieldarray array of rows (starting at row zero), and each row contains an associative array of fieldname=fieldvalue pairs. Note that while the default query which is generated will be SELECT * FROM $this->tablename [WHERE ...] this query can be customised to anything which is valid SQL. This includes JOINs to other tables, UNIONS, subqueries, Common Table Expressions, et cetera.
By default the getData method will select all columns from the specified table, but this can be altered to specify any number of columns from any number of tables by settings in the $sql_??? variables.
The entire contents of $fieldarray will be output to the XML document. Note that no field names need be specified in the script as they are all extracted from $fieldarray.
The $fieldspec array from the database object may contain entries which will be included in the XML output as attributes in order to affect the outcome of the XSL transformation. For example noedit will cause a field to be read-only, while nodisplay will cause the field to be excluded from the HTML output altogether.

Here is a brief explanation of my user-defined functions which are contained within file include.inc:

initSession() - carries out all processing required when a script starts. This includes reading the session data to obtain the contents of the variable $where which contains any selection criteria passed down from the previous script.
scriptPrevious() - will return processing to the previous script with an optional error message. Note that this is not the same as pressing the browser's back button.
array2where() - transforms an associative array into a string which can be used as the WHERE clause in an SQL SELECT statement. The first parameter is the array, the optional second parameter identifies the subset of fields to be included.
$view->buildXML() - takes the $xml_objects array and transfers all the data to an XML document, after which it will perform the XSL Transformation process using the XSL stylesheet which was specified in the screen structure script to produce the HTML output.

You should notice that the above script does not contain any hard-coded database, table or field names, therefore it can be used for any database table within the system. The points to consider are:

The name of the table to work on is passed down in the $table variable which is set by the component script.
That variable name is used to obtain a class definition from an 'include' file, and an object is instantiated from that class with a generic name, which in this case is $object.
The controller communicates with the object using standard method names which are common to all objects as they are inherited from the generic table class.
The data which comes out of the database object is a standard associative array of fieldname=fieldvalue pairs. This is extracted by the view object and transferred to the XML document in an equivalent fieldname=fieldvalue format.

I have some controller scripts which work on more than one database object, such as when dealing with a parent-child relationship or a many-to-many relationship, but the principles are exactly the same.

Generic (abstract) table class

This is item (4) in Figure 5.

This is an abstract class which is based on the ideas outlined in Using PHP Objects to access your Database Tables (Part 1) and (Part 2). It is called an 'abstract' class as it cannot be instantiated into an object due to missing information which is identified in the common table properties. This missing information is not supplied until a 'concrete' Model subclass is instantiated into an object. This class contains a set of common table methods which support the standard CRUD operations which operate on any database table in the application. These methods and properties are automatically inherited by every concrete subclass. Every public method called by a Controller on a Model is an instance of the Template Method Pattern which means that it executes a pre-defined sequence of steps in various sub-methods. Some of these sub-methods contain invariant/fixed code while others, which are initially empty in the abstract class, are variable/customisable and can be declared in the subclass in order to execute additional code which is specific to that subclass.

Common Table Methods
Methods called externally	Methods called internally	UML diagram
$object->insertRecord($_POST)	$fieldarray = $this->pre_insertRecord($fieldarray); if (empty($this->errors) { $fieldarray = $this->validateInsert($fieldarray); } if (empty($this->errors) { $fieldarray = $this->commonValidation($fieldarray); } if (empty($this->errors) { $fieldarray = $this->dml_insertRecord($fieldarray); $fieldarray = $this->post_insertRecord($fieldarray); }	ADD1 Pattern
$object->updateRecord($_POST)	$fieldarray = $this->pre_updateRecord(fieldarray); if (empty($this->errors) { $fieldarray = $this->validateUpdate($fieldarray); } if (empty($this->errors) { $fieldarray = $this->commonValidation($fieldarray); } if (empty($this->errors) { $fieldarray = $this->dml_updateRecord($fieldarray); $fieldarray = $this->post_updateRecord($fieldarray); }	UPDATE1 Pattern
$object->deleteRecord($_POST)	$fieldarray = $this->pre_deleteRecord(fieldarray); if (empty($this->errors) { $fieldarray = $this->validateDelete($fieldarray); } if (empty($this->errors) { $fieldarray = $this->dml_deleteRecord($fieldarray); $fieldarray = $this->post_deleteRecord($fieldarray); }	DELETE1 Pattern
$object->getData($where)	$where = $this->pre_getData($where); $fieldarray = $this->dml_getData($where); $fieldarray = $this->post_getData($fieldarray);	ENQUIRE1 Pattern

Here the methods called externally are the ones which are called from the Controller while the methods called internally are called only from within the abstract table class which is inherited by every Model. Each external method then acts as a wrapper for a group of internal methods.

Hook methods

Notice that before and after each database operation, which has the "dml_" prefix, there are pairs of "pre_" and "post_" methods. These will contain calls to "hook" methods which have the following format:

function _cm_whatever ($fieldarray)
// interrupt standard processing with custom code
// if anything is placed in $this->errors the operation will be terminated.
{
    // customisable code goes here

    return $fieldarray;

} // _cm_whatever

Although these methods are defined in the abstract class and called in specific places in the processing flow they have absolutely no effect as all they do is return the input argument $fieldarray untouched. They are designed to be overridden when necessary in a concrete subclass in order to replace the default behaviour (which is to do nothing) with custom logic which is specific to that subclass. This is how I implemented the Template Method Pattern.

Please note the following:

Instead of using separate arguments or setters/getters for each column I pass all the data around in a single $fieldarray argument, which is usually from the $_POST array. This means that none of the calling code has to mention any columns by their names as this would make the calling code tightly coupled when it should be loosely coupled.
I do not have separate methods for load(), validate() and store() as it would be possible to change a column's value after the validate() and before the store(), thus allowing the possibility of invalid data to be written to the database. As these operations always have to be processed in the same sequence I perform them in a wrapper method so that the sequence cannot be interrupted. As well being more efficient by reducing the number of calls, this approach also enables the contents of the wrapper to be modified in the future, such as by adding a new "hook" method, without having to amend the wrapper's API.
There is no single validate() method as the requirements for the INSERT, UPDATE and DELETE operations are completely different.
- During an INSERT or UPDATE operation basic validation is carried out in the validation object.
- During a DELETE operation the contents of the $child_relations array will be used to check that the record can be deleted and to update/delete all records from child tables if necessary.
There is no single store() method as the requirements for the INSERT, UPDATE and DELETE operations are completely different.
The methods with the "dml_" prefix do not actually communicate with the database themselves, instead they pass control to a separate Data Access Object where there is a separate version for each supported DBMS.
If any methods with the "dml_" prefix fail when the generated SQL query is executed then the entire script is aborted. Relevant details will be written to the application's error log as well as being emailed to the system administrator.
If any errors are detected then suitable messages will be inserted into the $this->errors array. This causes the dml_??? method to be skipped, and the Controller will perform a rollback() instead of a commit() for the current database transaction.
There is only a single getData($where) method as the $where string can contain any combination of selection criteria without the need for separate finder methods. This may return any number of rows. If multiple rows are expected the pagination limits can be controlled by the $rows_per_page and $pageno variables.
At the end each Model's processing the View object can extract all the data out of a Model with a single call to the getFieldArray() method. The entire contents of this array will be copied to an XML document before being transformed in HTML with an XSL stylesheet.

Common Table Properties

In order to turn the abstract class into a concrete class the class constructor will execute code to fill the empty common table properties with data. These properties hold metadata, not application data. This metadata is supplied in a separate <tablename>.dict.inc file which is exported from the Data Dictionary.

$this->dbname	This value is defined in the class constructor. This allows the application to access tables in more than one database. It is standard practice in the RADICORE framework to have a separate database for each subsystem.
$this->tablename	This value is defined in the class constructor and is unique within the database.
$this->fieldspec	The identifies the columns (fields) which exist in this table and their specifications (type, size, etc).
$this->primary_key	This identifies the column(s) which form the primary key. Note that this may be a compound key with more than one column. Although some modern databases allow it, it is standard practice within the RADICORE framework to disallow changes to the primary key. This is why surrogate or technical keys were invented.
$this->unique_keys	A table may have zero or more additional unique keys. These are also known as candidate keys as they could be considered as candidates for the role of primary key. Unlike the primary key these candidate keys may contain nullable columns and their values may be changed at runtime.
$this->parent_relations	This has a separate entry for each table which is the parent in a parent-child relationship with this table. This also maps foreign keys on this table to the primary key of the parent table. This array can have zero or more entries.
$this->child_relations	This has a separate entry for each table which is the child in a parent-child relationship with this table. This also maps the primary key on this table to the foreign key of the child table. This array can have zero or more entries.
$this->fieldarray	This holds all application data, usually the contents of the $_POST array, but it could also contain rows of data fetched from a database. It can either be an associative array for a single row or an indexed array of associative arrays for multiple rows. This removes the restriction of only being able to deal with one row at a time, and only being able to deal with the columns for a single table. This also avoids the need to have separate getters and setters for each individual column as this would promote tight coupling which is supposed to be a Bad Thing ™.

Note that there is not a separate class property for each data column in the table. As the data which comes into a PHP program, either from the user interface or the database, is always presented as an array I do not see any need to insert additional code to split each array into its component parts. This is why I use a standard $fieldarray argument for all application data. This produces maximum flexibility with loose coupling which is far better than the alternative, which is tight coupling.

Database Table (Model) classes

This is item (3) in Figure 5.

Each database table (or business entity) is represented by its own class which extends (is a subclass of) the generic table class. This contains a mixture of invariant methods containing default code plus a selection of "hook" methods which can be copied into each subclass and populated with custom code in order to override the default behaviour.

This component is an implementation of the Table Module pattern from Martin Fowler's Patterns of Enterprise Application Architecture (PoEAA). It also serves as the Model in the Model-View-Controller design pattern.

The class file for each database table does not have to be generated by hand - with the introduction of A Data Dictionary for PHP Applications it is possible to import the table structures directly from the database's INFORMATION SCHEMA into the data dictionary, then to export those structures into files which can be accessed directly by the application code.

When the table class file is initially generated it contains only a small amount of code, as shown in the following example:

<?php
require_once 'std.table.class.inc';
class #tablename# extends Default_Table
{
    // ****************************************************************************
    // class constructor
    // ****************************************************************************
    function __construct ()
    {
        // save directory name of current script
        $this->dirname   = dirname(__file__);
        
        $this->dbname    = '#dbname#';
        $this->tablename = '#tablename#';
        
        // call this method to get original field specifications
        // (note that they may be modified at runtime)
        $this->fieldspec = $this->loadFieldSpec();
        
    } // __construct
    
// ****************************************************************************
} // end class
// ****************************************************************************
?>

The loadFieldSpec() method will load the contents of a separate table structure file into the common table properties which were defined in the abstract table class.

Note that this class can deal with any number of database rows - I do not have one version to deal with a single row and a second version to deal with a collection of rows.

Note that none of these classes produces output in any particular format, such as HTML, PDF, CSV or whatever. All application data within each table object is held in a single untyped array called $fieldarray, and this array is not transformed into another format until it is processed by an external object. The actual formatting is performed by a dedicated View component using whatever raw data is provided by the Model.

Validation class

This is item (5) in Figure 5.

The generic validation class handles primary validation (sometimes referred to as declarative checking) of all user input. It compares the contents of the input array ($fieldarray) with the contents of the $fieldspec array to check that the input data for each field conforms to that field's specifications. It puts any error messages in the current object's $errors array. If this validation was not performed any attempt to write invalid data to the database would produce an SQL error and cause the program to terminate.

This class has two public methods - validateInsertPrimary($fieldarray, $fieldspec) and validateUpdatePrimary($fieldarray, $fieldspec).

The input array (usually the $_POST array or its equivalent) is an array of fieldname=fieldvalue pairs where every value is a string.

The $fieldspec array is an associative array of fieldname=fieldspec pairs. The fieldspec portion is another associative array of keyword=value pairs. This is obtained by reading the contents of the table structure file which is filled with information which was initially extracted from the database's INFORMATION SCHEMA and imported into the Data Dictionary before being exported as a PHP script.

The $errors array is an array of fieldname=errormsg pairs. It can therefore contain error messages for any number of fields.

Primary validation is limited to the following checks:

That all required fields have a non-null value.
That fields do not exceed their maximum size.
That date fields contain valid dates.
That time fields contain valid times.
That numeric fields contain valid numbers, with options for minimum/maximum values and number of decimal places.
A string field may have an optional subtype of email_address which causes the string to be checked against the relevant pattern.
A string field may have an optional subtype of file which causes a check to ensure that a file with that name exists.
Any nullable (optional) field will be set to NULL instead of being left as an empty string.

Secondary validation, such as comparing the contents of one field against the contents of another, must be defined within the individual table subclass using the empty classes provided in the superclass.

It is also possible to supplement the generic validation with the addition of plug-ins, as described in Extending the Validation class.

DML class (Data Access Object)

This is item (6) in Figure 5.

This is the only object in the system which carries out any communication with the physical database. It receives requests from the Generic Table class from which it generates the appropriate DML (Data Manipulation Language) or SQL (Structured Query Language) commands. It then executes these commands by calling the relevant API for the database in question. It does not have a separate version for each table, it has a single version which can handle any table within the database. As it exists in the Data Access layer it can also be referred to as the Data Access Object (DAO).

There is a separate class for each database engine as each engine has its own set of APIs. This design also allows me to isolate and deal with any differences in syntax between the various engines. The name of the class file is in the format dml.???.class.inc where '???' can be MySQL, PostgreSQL, Oracle, SQL Server or whatever. The CONFIG.INC file identifies which database engine is to be used for which database. Although it is usual to have all the databases within a single server instance, it is also possible to have those databases spread across multiple servers on different IP addresses, or even different database engines. It is possible to switch from one database engine to another simply by changing the value in the CONFIG.INC file.

This class is based on the ideas outlined in Using PHP Objects to access your Database Tables (Part 1) and (Part 2). Some of the methods it contains are as follows:

getData ($dbname, $tablename, $where) - will retrieve any number of records from the database using a SELECT statement which is constructed as required. The result is an associative array of 'name=value' pairs indexed by row number. There are pagination options which break down large volumes of data into separate pages for display purposes.
insertRecord ($dbname, $tablename, $fieldarray) - will insert a single row using the contents of $fieldarray (usually the $_POST array). A check is made before the INSERT to ensure that the primary key and any candidate keys are currently unused.
updateRecord ($dbname, $tablename, $newarray, $oldarray) - will update a single row using the contents of $newarray (usually the $_POST array). This is first compared with $oldarray (the current database values) so that only those fields which have changed are included in the DML statement. The identity of the primary key for use in the WHERE clause is extracted using the contents of the $fieldspec array. If any candidate key has changed it is first checked for uniqueness.
deleteRecord ($dbname, $tablename, $fieldarray) - will delete a single row using the contents of $fieldarray. The identity of the primary key for use in the WHERE clause is extracted using the contents of the $fieldspec array.

Note that within a single transaction it is possible to access tables in more than one database and through more than one database engine.

View Object

This is item (7) in Figure 5.

It uses as its input the contents of the screen structure script and all the database table objects which were accessed by the controller script.

The screen structure script identifies which XSL stylesheet to use for the HTML output, and a list of field names which need to be displayed in the data area.

The processing steps are as follows:

Create an XML document which will contain all the data from the database table objects.
Add to this document the structure details from the screen structure script.
Add to this document all the data for the menu bar, title bar, navigation bar, action bar, pagination and scrolling areas.
Perform an XSL transformation using the constructed XML document and specified XSL stylesheet. This will usually be one of the generic stylesheets, although it is possible to create a custom stylesheet for particular circumstances. The result of this transformation will be the HTML output.

The HTML output is the text file which is sent back to the client's web browser. This is rendered into a viewable page with the assistance of one or more CSS files which are the recommended way of specifying a standard style in a group of HTML documents.

There are different view objects for creating the output in different formats, such as PDF or CSV.

Screen Structure scripts

This is item (8) in Figure 5.

These are simple scripts which do nothing but identify the view or content for the output screen. Each one identifies the name of an XSL stylesheet and a list of table names, field names and field labels that will be used during the XSL transformation process to produce the HTML output.

The default script for a transaction is automatically created when that transaction is generated from the Data Dictionary.

Sample scripts for each pattern can be found in the /radicore/default/screens/en/ directory with the name <pattern>.screen.inc.

Scripts for each subsystem can be found in the /radicore/<subsystem>/screens/<language>/ directory. The default value for <language> is 'en' (English), but other language codes can be used - refer to Internationalisation and the Radicore Development Infrastructure for details.

Although the parent LIST screen in Figure 9 will require its own Screen Structure file, all the CHILD screens can share the same one as they all use the same structure. The differences in how the fields are displayed for each of the child components is handled by a combination of the $mode parameter within the Transaction Pattern (Controller) script (insert, update, delete, enquire) and individual field attributes within the XML document. These attributes can be specified within the $fieldspec array for that table class, or can be supplied at runtime through custom code.

Here is a sample file:

<?php
// this identifies which XSL stylesheet to use
$structure['xsl_file'] = 'std.detail1.xsl';

// this identifies which XML data is to go into which XSL zone
$structure['tables']['main'] = 'person';

// this specifies the width of each column
$structure['main']['columns'][] = array('width' => 150);
$structure['main']['columns'][] = array('width' => '*');

// the following may also be used 
$structure['main']['columns'][] = array('class' => 'classname');

// this identifies the label and field which is to be displayed in each row
$structure['main']['fields'][] = array('person_id' => 'ID');
$structure['main']['fields'][] = array('first_name' => 'First Name');
$structure['main']['fields'][] = array('last_name' => 'Last Name');
$structure['main']['fields'][] = array('initials' => 'Initials');
$structure['main']['fields'][] = array('nat_ins_no' => 'Nat. Ins. No.');
$structure['main']['fields'][] = array('pers_type_id' => 'Person Type');
$structure['main']['fields'][] = array('star_sign' => 'Star Sign');
$structure['main']['fields'][] = array('email_addr' => 'E-mail');
$structure['main']['fields'][] = array('value1' => 'Value 1');
$structure['main']['fields'][] = array('value2' => 'Value 2');
$structure['main']['fields'][] = array('start_date' => 'Start Date');
$structure['main']['fields'][] = array('end_date' => 'End Date');
$structure['main']['fields'][] = array('selected' => 'Selected');
?>

In this example there is a single data zone called main which is linked with an object called person. Some screens have two or more zones which are linked to different objects. At runtime the fields will be extracted from each object and displayed in the relevant zone. Note that a field must exist both within the object and within the screen structure file in order for it to be displayed.

The name of this file is provided by the Component script in the $screen variable. It is read in by the view object and its contents are added to the XML document to appear something like this:

<root>
  ......
  <structure>
    <main id="person">
      <columns>
        <column width="150"/>
        <column width="*"/>
      </columns>
      <row>
        <cell label="ID"/>
        <cell field="person_id" />
      </row>
      <row>
        <cell label="First Name"/>
        <cell field="first_name"/>
      </row>
      <row>
        <cell label="Last Name"/>
        <cell field="last_name"/>
      </row>
      <row>
        <cell label="Initials"/>
        <cell field="initials"/>
      </row>
      
      ....

      <row>
        <cell label="Start Date"/>
        <cell field="start_date"/>
      </row>
      <row>
        <cell label="End Date"/>
        <cell field="end_date"/>
      </row>
    </main>
  </structure>
</root>

Several different layouts are now available for displaying user data. For more details on how these can be specified please refer to XSL Structure files in The Model-View-Controller (MVC) Design Pattern for PHP.

As the screen structure file is loaded into memory at the start of each script but not used until the very end, you have the opportunity to make dynamic amendments to the structure before it is used to create and display the HTML output. This is explained in the following links:

XML documents

This is item (9) in Figure 5.

XML (Extensible Markup Language) is a simple but flexible text format. It is based on an open standard which is maintained by the World Wide Web Consortium. It is used in this infrastructure to provide the XSL transformation process with all the data it needs to produce the HTML output.

The XML document is generated automatically at runtime by the view object. The technique which is used to create this file in PHP 4 is described in Using PHP 4's DOM XML functions to create XML documents from SQL data. For PHP 5 and above please refer to Using PHP 5's DOM functions to create XML documents from SQL data instead.

Each XML document can contain any of the following data:

Values from any number of database table objects. These will use the table names and field names obtained from the database. All application data can be extracted from these objects using the standard getFieldArray() method, which avoids the need to have separate getters for each property.
Each field may also have attributes which indicate how the field should be displayed, or to hold an error message.
A list of table names, field names and field labels obtained from the Screen Structure script. For forms of type LIST this will also provide the column headings.
Data to construct the Menu bar.
Data to construct the Navigation bar.
Data to construct the Pagination or Scrolling areas.
Data for the Action bar.
Data for the Message area.
Data for any lookup (picklist) fields such as dropdown lists or radio groups.

XSL Stylesheets

This is item (10) in Figure 5.

Each component requires an XSL stylesheet in order to transform the data in the XML document into HTML output. In an earlier version of this infrastructure I used different stylesheets for each database table which had the table names, field names and field labels all hard-coded, but I have subsequently found a way to use a smaller number of generic stylesheets. Instead of having the field details hard-coded within the stylesheet I am now able to extract that information from within the XML document using information supplied in a Screen Structure script. This is is documented in Reusable XSL Stylesheets and Templates.

Although my whole web application uses fewer than ten generic stylesheets there is still some code which is needed in more than one stylesheet. This code has been extracted and placed in a library of XSL templates which can be incorporated into any stylesheet at runtime by means of an <xsl:include> command. This is, in effect, a library of standard XSL subroutines.

Using the components in Figure 9 as an example I would use a generic LIST stylesheet for the parent component and a generic DETAIL stylesheet for all the child components. Variations in how the individual fields are displayed within the various child components is handled primarily by the $mode variable which is passed as a parameter during the XSL transformation process. This is used as follows:

If $mode = 'input' or 'search' then all fields are editable.
If $mode = 'read' or 'delete' then all fields are non-editable.
if $mode = 'update' then primary key fields are non-editable.
If $mode = 'search' then any boolean fields are given a third option to emulate a tri-state checkbox (yes, no, undefined).

In addition to the $mode parameter the handling of individual fields can be affected by specific attributes in the XML document. These can either be set into the $fieldspec array or altered at runtime using custom code.

The noedit attribute will make the field non-editable.
The nodisplay attribute will make the field invisible.

The type of HTML control (textbox, dropdown, radio group, etc) to be used for each field in the HTML output is completely dynamic in nature. This is a 3 stage process:

The default HTML control is initially defined within the $fieldspec array, but this can be changed at runtime with custom code.
As the contents of each database object is written out to the XML document by the Transaction Pattern (Controller) script various details from the $fieldspec array are included with each field as XML attributes.
During the XSL transformation process a standard XSL template will use the field attributes to build the HTML control to the supplied specifications.

XSL Transformation process

This is item (11) in Figure 5.

This process will take the contents of an XML document and transform it to another document (in this case an HTML document) using rules contained within an XSL stylesheet. These are all open standards which are supervised by the World Wide Web Consortium.

It is possible to send both the XML and XSL files to the client and have the transformation performed within the client's browser (client-side transformation), but this is unreliable due to the different levels (sometimes non-existent) of XML/XSL support in different browsers. It is much safer to perform the transformation in a single place (the web server) where the software is under the control of the web developer. This is known as a server-side transformation.

The technique which I use to perform XSL transformations in PHP 4 is described in Using PHP 4's Sablotron extension to perform XSL Transformations. For PHP 5 and above please refer to Using PHP 5's XSL extension to perform XSL Transformations instead.

HTML output

This is item (12) in Figure 5.

This is the document which is sent back to the client's browser is response to the request. Its content should conform to the HTML 5 specification which is supervised by the Web Hypertext Application Technology Working Group.

In an effort to make my output viewable on as many web browsers as possible I stick to the following guidelines:

All output is HTML 5 which is structurally clean and free of any style details. All style specifications (fonts, colours and layout) are held within separate CSS files.
There is no javascript in the framework, but it does support the ability for individual application subsystems to include javascript in their components.
There are no third party controls or plugins (ActiveX or Flash).
There are no proprietary extensions.

CSS files

This is item (13) in Figure 5.

These are Cascading Style Sheets which hold all the styling information (fonts, colours, sizes, positioning, etc) for all HTML documents produced by the application. The tags within each HTML document refer to a style by a class name, and the specifications for each of these classes is held within a CSS file. In this way it becomes possible to change the style specifications for any tag in all documents simply by changing the specifications within a single CSS file.

The following CSS files are available:

global - a selection of files exist within the CSS subdirectory which set the style for the entire application. It is possible to choose any one of these using the style/theme option in the Update Session data screen. Additional CSS files may be created and copied into this subdirectory, in which case they will automatically become available for selection.
local - if it is required to change the global setting for a CSS element, or to create a new CSS element within a single subsystem, then instead of creating a complete global CSS file it is possible to insert these local modifications into a local CSS file called 'style_custom.css', which exists in every subdirectory. When the HTML document is rendered it will use the contents of both the global and local CSS files. If any setting exists in both files then the local setting will override the global setting.

AUDIT class

This is item (14) in Figure 5.

This class is responsible for detecting all database changes (INSERTs, UPDATEs and DELETEs) and recording them in a separate 'audit' database so that they can be reviewed using online enquiry screens. This is documented in Creating an Audit Log with an online viewing facility.

The only additional code required in any database table class is the setting of a class variable called $audit_logging. By default this is TRUE (the table will be logged) but it can be set to FALSE to disable logging.

Workflow Engine

This is item (15) in Figure 5.

Sometimes when a particular task is performed, such as 'Take Customer Order', this has to be followed by a series of other tasks in a particular sequence such as 'Charge Customer', 'Pack Order' and 'Ship Order'. Without a Workflow Engine these subsequent tasks must be selected and processed manually, which is where mistakes and inefficiencies can arise.

The purpose of a Workflow System is to manage these tasks in a controlled fashion. This system should have the following components:

A method whereby different Workflow processes can be defined. This must identify the triggering task and the sequence of subsequent tasks.
A mechanism which automatically creates a new workflow case when a triggering task is processed, then progresses that case through its various stages.
A method whereby outstanding tasks (workitems) in workflow cases which require human intervention appear in a list which prompts the relevant users that intervention is required. A task should be activated simply by clicking on its entry in this list.
A method whereby the status of any individual workflow case can be reviewed.

The Workflow Engine which I have created as an extension to this development infrastructure is documented in An activity based Workflow Engine for PHP. The engine is activated from within my generic table class therefore no additional programmer coding is required.

Levels of Reusability

Since a major motivation for object-oriented programming is software reuse, it should follow that the effective use of its features can be assessed by the volume of reusable code it produces. When creating an application which contains many components you should be able to compare the amount of code that you have to write with the amount of code that you don't have to write, where the latter is supplied in pre-written and reusable components that can be shared. If you find yourself writing code that has already been written then you are violating the Don't Repeat Yourself (DRY) principle. A library of reusable components provides the following advantages:

The library modules contain code which has already been written to provide certain functionality, therefore it is not necessary to waste more time in writing more code to provide the same functionality.
The library modules contain code which has already been tested, therefore this cuts down the testing phase of new components which use these modules.
By using standard library modules for standard tasks the developer does not have to waste time in reinventing the wheel (and possibly creating a square wheel).
By using standard code to provide certain functionality it means that the same functionality will be provided in a consistent manner across multiple components. This will be less confusing to the user.
By using standard library modules it becomes possible to make a change in a single module and have that change immediately incorporated into every component that references that module.

This framework contains the following reusable components:

45 Page Controller scripts, one for each Transaction Pattern.
1 Generic (abstract) table class which is inherited by every concrete table class. This implements the Template Method Pattern, which has a mixture of invariant and variable methods, for every operation than can be called by a Controller on a Model. It contains thousands of lines of code which is split across 200+ invariant methods and 100+ variable "hook" methods, which means that it provides support for a huge number of activities.
1 Validation class which checks that each piece of user input conforms to its definition within the database.
4 DML class files which generates every SQL query, with a separate class for each supported DBMS.
1 View object for all HTML output, with additional classes for CSV and PDF output.
12 reusable XSL stylesheets which work in conjunction with the generated screen structure scripts to produce the HTML output for any Model.
Several CSS files which specify how each HTML element will be displayed. Several alternatives are supplied so that the user can switch from one to another.

The RADICORE framework is not just a collection of library functions which need to be called by the developer, it is a true framework as it implements the Hollywood Principle (don't call us, we'll call you). The framework itself consists of the following subsystems:

Role Based Access Control (RBAC) to identify users, tasks (user transactions), access roles, menu hierarchies, et cetera. It also provides a standard Login screen.
Audit logging which logs all database updates to an Audit database where they can be viewed using a single set of screens.
Workflow Engine to define workflows so that the completion of one action will automatically trigger another action.
A Data Dictionary used by developers to import the table structures from the database's INFORMATION SCHEMA so that the following components can be generated:
- A separate table class file and table structure file for each database table.
- For each task: a component script and a screen structure script.

The framework is a modular system which is comprised of a number of integrated subsystems, each of which has its own database, its own directory structure in the file system, and its own entries in the framework database. New application subsystems can be added at any time by following these steps:

Create a new subsystem.
Build the directory structure for that subsystem.
After creating your database import its details into the Data Dictionary.
For each table:
1. Export those details to create the class file for each database table.
2. Define any relationships between tables and export the amended table structure file.
3. Generate tasks by selecting a table and linking it to a Transaction Pattern.
4. Modify screen labels, titles and button text (optional)
Custom processing can be added to any Model by overriding any of the "hook" methods.

An idea of the amount of reusability when creating a typical family of forms for a database table can be shown in figure 12:

Figure 12 - Levels of Reusability

This shows the following:

A business entity requires one database table class which is shared by six transactions.
All LIST transactions share the same LIST controller scripts, all INSERT transactions share the same INSERT controller scripts, etc.
All LIST screens share the same list.xsl stylesheet.
All DETAIL (search, insert, update, delete and enquire) transactions share the same screen definition.
All DETAIL (search, insert, update, delete and enquire) screens share the same detail.xsl stylesheet.

I have spent 20+ years working for software houses where the task has been to develop many different applications for many different customers in a swift and cost-effective manner. To be truly reusable an infrastructure should not only work for different components within the same application but for different applications entirely. This offers two advantages:

It avoids the necessity of constructing a new infrastructure for each new application.
It avoids the non-productive time in learning a new infrastructure when developers switch from one application to another.

I have witnessed at first hand the advantages of being able to use such infrastructures as I have used them with two entirely different languages - COBOL and UNIFACE. As I was personally responsible for creating those two infrastructures I had all the relevant experience to create a new one in PHP.

As a final example I have used this framework to build a large ERP application the first version of which was called TRANSIX which went live in 2008, but since 2014 has been extended into a product called the GM-X Application Suite. This is comprised of the following:

20 subsystems, each with its own database.
450+ Model classes, one for each database table. Because of the Template Method Pattern each table subclass need only contain table-specific code in the various "hook" methods.
1,200+ relationships, often connecting one subsystem to another.
4,000+ component scripts, one for each task (user transaction).

More information regarding the levels of reusability which I have achieved can be found in the following:

The advantage of having all this reusable code at your disposal is that you don't have to spend time writing it yourself. This also means that you don't have to spend time in designing the code that you don't have to write, as explained in How much time can be saved.

The Path to Reusability

Some of you may be wondering how exactly I achieved these levels of reusability. The answer is quite simple - I did not follow what were later known as "best practices" for the simple reason that I did not know that they existed. In the previous two decades I had encountered numerous documents called Programming Standards where each organisation, and sometimes each team within that organisation, had its own set of standards. As I gained more and more experience I began to see flaws in each of these documents, which is why I began to create my own version using a mix'n'match approach where I took the ideas I liked, ignored the ideas which I didn't like, and sprinkled in a few ideas of my own. When I came to build my own PHP software I did not start by building an application, I built a framework which would allow me to build any application I wanted. This was based on the frameworks which I had built previously in COBOL and UNIFACE, so I already knew what needed to be done. All I had to do was work out how to do it in this new language.

After reading the PHP online manual, and looking at sample code which I found in some online tutorials and books which I bought, I came to the conclusion that PHP offered the following features, which were not available in my previous languages, which could be used to generate a higher volume of reusable code:

Encapsulation - Where the data (properties) for an entity, and the operations (methods) which can be performed on that data, are placed into a single capsule known as a class. All interaction with that entity should be done through an object which is built from that class.
Inheritance - This promotes code reuse since methods and properties shared by several classes can be placed in a superclass, and new classes can start off having code available by inheriting from that superclass. Generally it is methods which are reused more often than properties.
Polymorphism - When several objects share the same method signatures it is possible to have a piece of code which calls that signature, but where it uses a different object at run time using a mechanism called Dependency Injection. In this case it is the calling code which is reused, not the inherited code.

My previous experience with database applications had taught me the following:

Every table in the database is a separate entity with its own structure and its own business rules.
Every table in the database is subject to exactly the same operations - Create, Read, Update and Delete (CRUD).
Every application is comprised of a number of tasks (user transactions) where each task performs one or more operations on one or more tables.
The program for each task requires two types of code:
- Standard boilerplate code to handle the movement of data from the screen to the database, and then from the database to the screen (or other output format).
- Unique code to handle the business rules or task-specific behaviour.

Before I started on my new framework I had already made the following decisions:

It would be based on the 3 Tier Architecture which I had encountered in UNIFACE. This was actually quite easy as programming with objects is automatically 2 tier to begin with.
Instead of creating each HTML page individually I had already noticed recurring patterns among the screen layouts, so I decided to use a templating system to generate screens from these patterns. I decided to use XML and XSL as I had already encountered them in my work with UNIFACE and had also experimented with them on my home computer.
I had already demonstrated the ability to create libraries of reusable components instead of duplicating blocks of code in numerous places, so I set out to use the object-oriented capabilities of PHP to produce as many reusable components as possible.

I decided on the following implementation details:

I would have a separate class for each database table. The class would have knowledge of the table's structure as well as being responsible for all the business rules concerning that table.
The class would have separate methods to deal with each of the four CRUD operations. While all the code samples which I saw had separate methods for load(), validate() and store() I had already learned that when a group of functions always has to be executed in the same order that a wise programmer would create a wrapper function for that group so that he could call that group with a single statement instead of a separate statement for each member of that group. These "wrapper methods" are outlined in common table methods.
I already knew enough about SQL to realise that if it was necessary to retrieve data from several related tables that it was more efficient to create a single SELECT query containing JOIN statements to the related tables instead of being forced to generate a separate query for each table. In some cases the JOIN statements can be automatically included by the framework. In other cases the SELECT query can be customised using code in the _cm_pre_getData() method.
I noticed in all the code samples that every table column was given its own property in the class, which then required a separate line of code to put data into or get data out of that property. I also noticed that when a form's data was sent to the script from the client's browser it appeared in the form of the $_POST array, and when a row was retrieved from the database it the form of the FETCH array. It struck me that all that effort in deconstructing each of those arrays into its component parts so that each part could be accessed separately and by name was unnecessary effort. I had played with PHP arrays and discovered that they were far better than anything on offer in my previous languages, so I decided to avoid those unnecessary lines of code by passing all that data from one component to another as a single $fieldarray variable. Some programmers should recognise this approach is a prime example of loose coupling which is far better than tight coupling.
In all the codes samples which I saw each Model class required its own Controller to handle all the user transactions for that Model. I was taught the same method in my early COBOL days, but I later learned that there were advantages in splitting large multi-purpose programs into smaller single-purpose units. This is why I created a separate Controller for each user transaction where each could handle no more than a single output format.

Because of the simple decisions described above I was able, using the facilities available in PHP and my own intellect, to refactor the code to increase the volume of code which I could reuse and therefore decrease the volume of code which I had to write. This refactoring took the following forms:

As each table was subject to the same set of CRUD operations which were duplicated in the corresponding set of table methods I was able to move those methods to an abstract table class which could then be inherited by each concrete table (Model) class.
Originally I coded each table's list of column names and their specifications into the $fieldspec array of each class by hand, but I later invested the time and effort in creating a Data Dictionary so that I could extract all the information I needed from the database's INFORMATION SCHEMA and store it in my own database where it could be tailored before being exported to produce a table class file and a table structure file for each table. Note that if a table's structure ever changes all I have to do is re-import the updated structure into the Data Dictionary and then re-export it to regenerate the table structure file. This process will not replace the table class file as it may have been updated to include some custom code.
I already knew that it was necessary to validate all user input before passing it to the database, but now that I had a single $fieldarray array containing all the user data and a $fieldspec array containing all the specifications for each field (column) it was a straightforward process to write a validation class to perform this validation by comparing the contents of one array with the other. It was also easy to add a call to this validation object inside the common table methods so this validation could be performed automatically without any effort from the developer. Note that the validation rules are slightly different depending on which operation is being performed.
When I wanted to perform some additional validation or add business rules to a table class I would take advantage of the wrapper methods inside the abstract class by adding calls to empty methods which can be overridden in any concrete subclass. These empty methods are known as "hook" methods and form part of the Template Method Pattern.
After I had created a set of Controllers to handle the user transactions described in this family of forms for my first database table I set about building a similar set of Controllers for my second database table. This resulted in a great deal of duplicated code, so I set about separating the similar from the different so that I could put the similar into a reusable module and isolate the different in a unique module. I quickly noticed that the only difference was the name of the table with which the Controller communicated, so I created a separate component script for each user transaction and modified each Controller so that it could be used with any Model. This was made possible by virtue of the fact that each Model shares exactly the same set of methods and all application data is passed around in a single $fieldarray variable. This means that no Controller has any hard-coded references to any table names or any column names.
Because of my decision to build all HTML pages using XSL stylesheets I was able to create a single View object to perform this task. Extracting all application data from any Model could be accomplished with a single call to the getFieldArray() method, so I did not have to worry about creating different versions for different tables because of explicit references to individual property/column names. The code to copy the contents of this array to an XML document was straightforward, as described in Using PHP 5's DOM functions to create XML files from SQL data. All that was left was to load in the relevant XSL stylesheet and perform the XSL transformation. Originally I created a separate stylesheet for each HTML page, but I was able later to replace these custom built stylesheets with a set of reusable XSL stylesheets where the specifics of each individual page are now defined in a separate screen structure script.

The end result of all these steps was a large amount of reusable code which is built into the framework and which does not have to be written by the developer:

All Models are generated by the framework using the Export Table to PHP function and inherit all their standard code from an abstract table class which is built into the framework.
All Views are pre-written and built into the framework. Each can work with any Model in the application.
All XSL stylesheets which are used in the HTML View are pre-written and built into the framework. Each is designed to work with one or more specific Controllers.
All Page Controllers are pre-written and built into the framework. Each will perform a pre-defined set of operations on any given Model.
All Data Access Objects are pre-written and built into the framework and can operate on any table in the database.

That is a HUGE amount of code which does not have to be written, but there is a small amount which is left over. However, the fact that every user transaction (task) can be fulfilled by a pre-written Controller and a pre-written View has enabled me to catalog each of the possible combinations in my library of Transaction Patterns. This in turn has made it possible for me to generate the code for a user transaction by using the Select Transaction Pattern and Generate PHP scripts functions which link a Model to a pattern.

What this means is that it is possible to create a new table in your database and generate the transactions to view and maintain the contents of that table in a matter of minutes without writing *ANY* code whatsoever - no PHP, no HTML, no SQL. If you don't believe me then I encourage you to watch this video. If you think that YOUR favourite framework is capable of matching that level of productivity then I dare you to take this challenge.

While the generated tasks will only perform the basic functionality the code to handle the more specific business rules can be added in later using any of the empty "hook" methods which are defined in the abstract class but which can be overridden in each concrete subclass.

You should be able to see that each of the above decisions I took was made with one outcome in mind - to increase the amount of code which could be reused. By increasing the amount of code that you *DON'T* have to write I am decreasing the amount of code that you *DO* have to write, and even the dumbest of programmers should realise that there is nothing smaller than no code at all.

Extending this Infrastructure

Having an infrastructure with which you can build applications is one thing, but in my long experience I have also found it useful to build utility components which may be of benefit to those applications. This type of component is not specific to any particular application, but may be used in conjunction with any number of different applications. Amongst those I have built are:

A Role Based Access Control (RBAC) system (also known as a Menu and Security system) which can act as the front end for any application. It allows tasks to be defined in a hierarchy of menus and allows user access to individual tasks to be granted or denied to individual user roles. This is documented in:
- A Role-Based Access Control (RBAC) system for PHP
- User Guide to the Menu and Security (RBAC) System
An audit facility which records all changes made to application databases in a separate 'audit' database so that they may be reviewed using online enquiry screens. This is documented in:
- Creating an Audit Log with an online viewing facility.
An activity based Workflow Engine for PHP which is built around Petri Nets. This allows workflow processes to be defined, creates workflow cases when the triggering event is fired, and adds workitems to user menu screens. This is documented in:
- An activity based Workflow Engine for PHP
A Data Dictionary which allows details of an application database to be imported, modified and exported for use in PHP scripts, thus saving the need to create these details by hand. This is documented in:
- A Data Dictionary for PHP Applications
The capability to allow program-generated text (error messages, screen titles, field labels, button text, etc) to be displayed in more than one language. This is documented in:
- Internationalisation and the Radicore Development Infrastructure
A wide-ranging set of security features. This is documented in:
- The RADICORE Security Model

References

These are reasons why I consider some ideas on how to do OOP "properly" to be complete rubbish:

Amendment History

01 May 2024	Added The Path to Reusability.
02 Oct 2023	Updated Generic (abstract) table class to include common table properties and common table methods.
04 Feb 2023	Added Object Classification.
30 April 2012	Added a description for View object.
15 July 2005	Added a reference to a new article entitled Internationalisation and the Radicore Development Infrastructure.
21 June 2005	Amended Screen Structure scripts to show the new layouts caused by the provision of more flexible options.
17 June 2005	Added a reference to a new article entitled A Data Dictionary for PHP Applications.
16 Sep 2004	Added a reference to a new article entitled An activity based Workflow Engine for PHP.
10 Sep 2004	Added section Extending this Infrastructure.
10 Aug 2004	Added better descriptions for the individual components. Moved all the Frequently Asked Questions to a separate document.
03 June 2004	Added a section on Style which explains the advantages of using Cascading Style Sheets.
02 May 2004	Added a reference to The Model-View-Controller (MVC) Design Pattern for PHP.
28 Apr 2004	Added a reference to Reusable XSL Stylesheets and Templates which describes a method which enables me to use a single generic stylesheet for many database tables instead of having a customised stylesheet for each individual database table.
10 Nov 2003	Created a sample application to demonstrate the techniques described in this document. This is described in A Sample PHP Application. The code can be run on my website here, or can be downloaded here and run locally.
08 Sep 2003	Split my abstract database class again so that the code which performs generic validation (declarative checks) is now contained within its own class. Refer to Business layer for details.
31 Aug 2003	Split my abstract database class into two so that the construction and execution of all DML statements is now contained within its own class. Refer to Data Access layer for details.

counter