452 lines
		
	
	
		
			20 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
			
		
		
	
	
			452 lines
		
	
	
		
			20 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
=head1 DEVELOPER NOTES
 | 
						|
 | 
						|
This document aims to help developers understand the intent and design of the
 | 
						|
code within Netdisco. Patches and feedback are always welcome :-)
 | 
						|
 | 
						|
=head1 Introduction
 | 
						|
 | 
						|
This release of Netdisco is built as a L<Dancer> application, and uses many
 | 
						|
modern technologies and techniques. Hopefully this will make the code easier
 | 
						|
to manage and maintain in the long term.
 | 
						|
 | 
						|
Although Dancer is a web application framework, it provides very useful tools
 | 
						|
for command line applications as well, namely configuration file management
 | 
						|
and database connection management. We make use of these features in the
 | 
						|
daemon and deployment scripts.
 | 
						|
 | 
						|
Overall the application tries to be as self-contained as possible without also
 | 
						|
needing an excessive number of CPAN modules to be installed. However, Modern
 | 
						|
Perl techniques have made dependency management almost a non-issue, and
 | 
						|
Netdisco can be installed by and run completely within an unprivileged user's
 | 
						|
account, apart from the PostgreSQL database setup.
 | 
						|
 | 
						|
Finally the other core component of Netdisco is now a L<DBIx::Class> layer for
 | 
						|
database access. This means there is no SQL anywhere in the code, but more
 | 
						|
important, we can re-use the same complex queries in different parts of
 | 
						|
Netdisco.
 | 
						|
 | 
						|
The rest of this document discusses each "interesting" area of the Netdisco
 | 
						|
codebase, hopefully in enough detail that you can get hacking yourself :-)
 | 
						|
 | 
						|
=head1 Versioning
 | 
						|
 | 
						|
This is Netdisco major version 2. The minor version has six digits, which are
 | 
						|
split into two components of three digits each. It's unlikely that the major
 | 
						|
version number (2) will increment. Each "feature" release to CPAN will
 | 
						|
increment the first three digits of the minor version. Each "bug fix" release
 | 
						|
will increment the second three digits of the minor version.
 | 
						|
 | 
						|
Stable releases will have an even "feature" number. Beta releases will have an
 | 
						|
odd "feature" number and also a suffix with an underscore, to prevent CPAN
 | 
						|
indexing the distribution. Some examples:
 | 
						|
 | 
						|
 2.002002     - "feature" release 2, "bug fix" release 2
 | 
						|
 2.002003     - another bug was found and fixed, hence "bug fix" release 3
 | 
						|
 2.003000_001 - first beta for the next "feature" release
 | 
						|
 2.003000_002 - second beta
 | 
						|
 2.004001     - the next "feature" release
 | 
						|
 | 
						|
 | 
						|
=head1 Global Configuration
 | 
						|
 | 
						|
Dancer uses YAML as its standard configuration file format, which is flexible
 | 
						|
enough for our needs, yet still simple to edit for the user. We no longer need
 | 
						|
a parser as in the old version of Netdisco.
 | 
						|
 | 
						|
At the top of scripts you'll usually see something like:
 | 
						|
 | 
						|
 use App::Netdisco;
 | 
						|
 use Dancer ':script';
 | 
						|
 | 
						|
First, this uses C<App::Netdisco>, which is almost nothing more than a
 | 
						|
placeholder module (contains no actual application code). What it does is set
 | 
						|
several environment variables in order to locate the configuration files.
 | 
						|
 | 
						|
Then, when we call "C<use Dancer>" these environment variables are used to
 | 
						|
load two YAML files: C<config.yml> and C<< <environment>.yml >> where
 | 
						|
C<< <environment> >> is typically either C<production> or C<development>.
 | 
						|
 | 
						|
The concept of "environments" allows us to have some shared "master" config
 | 
						|
between all instances of the application (C<config.yml>), and then settings
 | 
						|
for specific circumstances. Typically this might be logging levels, for
 | 
						|
example. The default file which C<App::Netdisco> loads is C<development.yml>
 | 
						|
but you can override it by setting the "C<DANCER_ENVIRONMENT>" environment
 | 
						|
variable.
 | 
						|
 | 
						|
Dancer loads the config using YAML, merging data from the two files. Config is
 | 
						|
made available via Dancer's C<setting('foo')> subroutine, which is exported.
 | 
						|
So now the C<foo> setting in either config file is easily accessed.
 | 
						|
 | 
						|
Another line commonly seen in scripts is this:
 | 
						|
 | 
						|
 use Dancer::Plugin::DBIC 'schema';
 | 
						|
 | 
						|
This plugin saves a lot of effort by taking some database connection
 | 
						|
parameters from the configuration file, and instantiating DBIx::Class database
 | 
						|
connections with them. The connections are managed transparently so all we
 | 
						|
need to do to access the Netdisco database, with no additional setup, is:
 | 
						|
 | 
						|
 schema('netdisco')->resultset(...)->search({...});
 | 
						|
 | 
						|
 | 
						|
=head1 DBIx::Class Layer
 | 
						|
 | 
						|
DBIx::Class, or DBIC for short, is an Object-Relational Mapper. This means it
 | 
						|
abstracts away the SQL of database calls, presenting a Perl object for each
 | 
						|
table, set of results from a query, table row, etc. The advantage is that it
 | 
						|
can generate really smart SQL queries, and these queries can be re-used
 | 
						|
throughout the application.
 | 
						|
 | 
						|
The DBIC layer for Netdisco is based at L<App::Netdisco::DB>. This is the
 | 
						|
global schema class and below that, under L<App::Netdisco::DB::Result> is a
 | 
						|
class for each table in the database. These contain metadata on the columns
 | 
						|
but also several handy "helper" queries which can be called.  There are also
 | 
						|
C<ResultSet> classes which provide additional "pre-canned" queries.
 | 
						|
 | 
						|
Netdisco's DBIx::Class layer has excellent documentation which you are
 | 
						|
encouraged to read, particularly if you find it difficult to sleep.
 | 
						|
 | 
						|
=head2 Results and ResultSets
 | 
						|
 | 
						|
In DBIC a C<Result> is a table and a C<ResultSet> is a set of rows retrieved
 | 
						|
from the table as a result of a query (which might be all the rows, of
 | 
						|
course). This is why we have two types of DBIC class.
 | 
						|
Items in the C<Result> generally relate to the single table
 | 
						|
directly, and simply. In the C<ResultSet> class are more complex search
 | 
						|
modifiers which might synthesize new "columns" of data (e.g. formatting a
 | 
						|
timestamp) or subroutines which accept parameters to customize the query.
 | 
						|
 | 
						|
However, regardless of the actual class name, you access them in the same way.
 | 
						|
For example the C<device> table has an L<App::Netdisco::DB::Result::Device>
 | 
						|
class and also an L<App::Netdisco::DB::ResultSet::Device> class. DBIC merges
 | 
						|
the two:
 | 
						|
 | 
						|
 schema('netdisco')->resultset('Device')->get_models;
 | 
						|
 | 
						|
=head2 Virtual Tables (VIEWs)
 | 
						|
 | 
						|
Where we want to simplify our application code even further we can either
 | 
						|
install a VIEW in PostgreSQL, or use DBIx::Class to synthesize the view
 | 
						|
on-the-fly. Put simply, it uses the VIEW definition as the basis of an SQL
 | 
						|
query, yet in the application we treat it as a real table like any other.
 | 
						|
 | 
						|
Some good examples are a fake table of only the active Nodes (as opposed to
 | 
						|
all nodes), or the more complex list of all ports which are connected together
 | 
						|
(C<DeviceLink>).
 | 
						|
 | 
						|
All these tables live under the
 | 
						|
L<App::Netdisco::DB::Result::Virtual> namespace, and so you
 | 
						|
access them like so (for the C<ActiveNode> example):
 | 
						|
 | 
						|
 schema('netdisco')->resultset('Virtual::ActiveNode')->count;
 | 
						|
 | 
						|
=head2 Versioning and Deployment
 | 
						|
 | 
						|
To manage the Netdisco schema in PostgreSQL we use DBIx::Class's deployment
 | 
						|
feature. This attaches a version to the schema and provides all the code to
 | 
						|
check the current version and do whatever is necessary to upgrade.
 | 
						|
The schema version is stored in a new table called
 | 
						|
C<dbix_class_schema_versions>, although you should never touch it.
 | 
						|
 | 
						|
The C<netdisco-db-deploy> script included in the distribution performs the
 | 
						|
following services:
 | 
						|
 | 
						|
 * Installs the dbix_class_schema_versions table
 | 
						|
 * Upgrades the schema to the current distribtion's version
 | 
						|
 | 
						|
This works both on an empty, new database, and a legacy database from the
 | 
						|
existing Netdisco release, in a non-destructive way. For further information
 | 
						|
see L<DBIx::Class::Schema::Versioned> and the C<netdisco-db-deploy> script.
 | 
						|
 | 
						|
The files used for the upgrades are shipped with this distribution and stored
 | 
						|
in the C<.../App/Netdisco/DB/schema_versions> directory. They are generated
 | 
						|
using the C<nd-dbic-versions> script which also ships with the distribution.
 | 
						|
 | 
						|
=head2 Foreign Key Constraints
 | 
						|
 | 
						|
We have not yet deployed any FK constraints into the Netdisco schema. This is
 | 
						|
partly because the current poller inserts and deletes entries from the
 | 
						|
database in an order which would violate such constraints, but also because
 | 
						|
some of the archiving features of Netdisco might not be compatible anyway.
 | 
						|
 | 
						|
Regardless, a lack of FK constraints doesn't upset DBIx::Class. The
 | 
						|
constraints can easily be deployed in a future release of Netdisco.
 | 
						|
 | 
						|
 | 
						|
=head1 Web Application
 | 
						|
 | 
						|
The Netdisco web app is a "classic" Dancer app, using most of the bundled
 | 
						|
features which make development really easy. Dancer is based on Ruby's Sinatra
 | 
						|
framework. Its style is for many "helper" subroutines to be exported into the
 | 
						|
application namespace, to do things such as access request parameters,
 | 
						|
navigate around the "handler" subroutines, manage response headers, and so on.
 | 
						|
 | 
						|
Pretty much anything you want to do in a web application has been wrapped up
 | 
						|
by Dancer into a neat helper routine that does the heavy lifting. This
 | 
						|
includes configuration and database connection management, as was discussed
 | 
						|
above. Also, templates can be executed and Netdisco uses the venerable
 | 
						|
L<Template::Toolkit> engine for this.
 | 
						|
 | 
						|
Like most web frameworks Dancer has a concept of "handlers" which are
 | 
						|
subroutines to which a specific web request is routed. For example if the user
 | 
						|
asks for "C</device>" with some parameters, the request ends up at the
 | 
						|
L<App::Netdisco::Web::Device> package's "C<get '/device'>" handler. All this
 | 
						|
is done automatically by Dancer according to some simple rules. There are also
 | 
						|
"wrapper" subroutines which we use to do tasks such as setting up data lookup
 | 
						|
tables, and handling authentication.
 | 
						|
 | 
						|
Dancer also supports AJAX very well, and it is used to retrieve most of the
 | 
						|
data in the Netdisco web application in a dynamic way, to respond to search
 | 
						|
queries and avoid lengthy page reloads. You will see the handlers for AJAX
 | 
						|
look similar to those for GET requests but do not use Template::Toolkit
 | 
						|
templates.
 | 
						|
 | 
						|
Compared to the current Netdisco, the handler routines are very small. This is
 | 
						|
because (a) they don't include any HTML - this is delegated to a template, and
 | 
						|
(b) they don't include an SQL - this is delegated to DBIx::Class. Small
 | 
						|
routines are more manageable, and easier to maintain. You'll also notice use
 | 
						|
of modules such as L<Net::MAC> and L<NetAddr::IP::Lite> to simplify and make
 | 
						|
more robust the handling of data.
 | 
						|
 | 
						|
=head2 Running the Web App
 | 
						|
 | 
						|
Dancer apps conform to the "PSGI" standard interface for web applications,
 | 
						|
which makes for easy deployment under many stacks such as Apache, FCGI, etc.
 | 
						|
See L<Dancer::Deployment> for more detail.
 | 
						|
 | 
						|
At a minimum Netdisco can run from within its own user area as an unprivileged
 | 
						|
user, and ships with a simple web server engine (see the user docs for
 | 
						|
instructions). The C<netdisco-web> script uses L<Daemon::Control> to daemonize
 | 
						|
this simple web server so you can fire-and-forget the Netdisco web app without
 | 
						|
much trouble at all. This script in turn calls C<netdisco-web-fg> which is the
 | 
						|
real Dancer application, that runs in the foreground if called on its own.
 | 
						|
 | 
						|
All web app code lives below L<App::Netdisco::Web>, but there are also some
 | 
						|
helper routines in L<App::Netdisco::Util::Web> (for example sorting device
 | 
						|
port names).
 | 
						|
 | 
						|
=head2 Authentication
 | 
						|
 | 
						|
Dancer includes (of course) good session management using cookies and a memory
 | 
						|
database. You should change this to a disk database if using a proper forking
 | 
						|
web server installation so that sessions are available to all instances.
 | 
						|
 | 
						|
Session and authentication code lives in L<App::Netdisco::Web::AuthN>. It is
 | 
						|
fully backwards compatible with the existing Netdisco user management, making
 | 
						|
use of the database users and their MD5 passwords.
 | 
						|
 | 
						|
There is also support for unauthenticated access to the web app (for instance
 | 
						|
if you have some kind of external authentication, or simply trust everyone).
 | 
						|
 | 
						|
=head2 Templates
 | 
						|
 | 
						|
In the C<share/views> folder of this distribution you'll find all the
 | 
						|
Template::Toolkit template files, with C<.tt> extensions. Dancer first loads
 | 
						|
C<share/views/layouts/main.tt> which is the main page wrapper, that has the HTML
 | 
						|
header and so on. It then loads other templates for sections of the page body.
 | 
						|
This is a typical Template::Toolkit "wrapper" configuration, as noted by the
 | 
						|
C<[% content %]> call within C<main.tt> that loads the template you actually
 | 
						|
specified in your Dancer handler.
 | 
						|
 | 
						|
All templates (and Javascript and Stylesheets) are shipped in the Dancer
 | 
						|
distribution and located automatically by the application (using the
 | 
						|
environment variables which L<App::Netdisco> set up). The user doesn't have to
 | 
						|
copy or install any files.
 | 
						|
 | 
						|
There's a template for the homepage called C<index.tt>, then separate
 | 
						|
templates for searching, displaying device details, and showing inventory.
 | 
						|
These are, pretty much, all that Netdisco ever does.
 | 
						|
 | 
						|
Each of these pages is designed in a deliberately similar way, with re-used
 | 
						|
features. They each can have a "sidebar" with a search form (or additional
 | 
						|
search parameters). They also can have a tabbed interface for sub-topics.
 | 
						|
 | 
						|
Here's where it gets interesting. Up till now the page content has been your
 | 
						|
typical synchronous page load (a single page comprised of many templates) in
 | 
						|
response to a GET request. However the content of the tabs is not within this.
 | 
						|
Each tab has its content dynamically retrieved via an AJAX request back to the
 | 
						|
web application. Javscript triggers this automatically on page load.
 | 
						|
 | 
						|
This feature allows the user to search and search again, each time refreshing
 | 
						|
the data they see in the tab but without reloading the complete page with all
 | 
						|
its static furniture. AJAX can, of course, return any MIME type, not only JSON
 | 
						|
but also HTML content as in this case. The templates for the tabs are
 | 
						|
organised below C<share/views/ajax/...> in the distribution.
 | 
						|
 | 
						|
=head2 Stylesheets
 | 
						|
 | 
						|
The main style for Netdisco uses Twitter Bootstrap, which is a stylish modern
 | 
						|
library of styles and javascript used on many websites. It does a lot of heavy
 | 
						|
lifting, providing simple CSS classes for all of the standard web page
 | 
						|
furniture (forms, tables, etc). Check out the documetation at the Twitter
 | 
						|
Bootstrap web site for more information.
 | 
						|
 | 
						|
These stylesheets are of course customised with our own C<netdisco.css>. We
 | 
						|
try to name all CSS classes with a prefix "C<nd_>" so as to be distinct from
 | 
						|
Twitter Bootstrap and any other active styles.
 | 
						|
 | 
						|
All stylesheets are located in the C<share/public/css> folder of the
 | 
						|
distribution and, like the templates, are automatically located and served by
 | 
						|
the Netdisco application. You can also choose to serve this content statically
 | 
						|
via Apache/etc for high traffic sites.
 | 
						|
 | 
						|
Although Twitter Bootstrap ships with its own set of icons, we use an
 | 
						|
alternative library called Fontawesome. This plugs in easily to Bootstrap and
 | 
						|
provides a wider range of scaleable vectored icons which are easy to use.
 | 
						|
 | 
						|
=head2 Javascript
 | 
						|
 | 
						|
Of course many parts of the Netdisco site use Javascript, beginning with
 | 
						|
retrieving the page tab content itself. The standard library in use is jQuery,
 | 
						|
and the latest version is shipped with this distribution.
 | 
						|
 | 
						|
Many parts of the Netdisco site have small Javscript routines. The code for
 | 
						|
these, using jQuery as mentioned, lives in two places. The main C<netdisco.js>
 | 
						|
file is loaded once in the page HTML header, and lives in
 | 
						|
C<share/public/javascripts/netdisco.js>. There's also a
 | 
						|
C<netdisco_portcontrol.js> which is included only if the current user has Port
 | 
						|
Control rights.
 | 
						|
 | 
						|
Netdisco also has Javascript routines specific to the device search or device
 | 
						|
details pages, and these files are located in C<share/views/js/...> because
 | 
						|
they're loaded within the page body by the templates. These files contain a
 | 
						|
function C<inner_view_processing> which is called each time AJAX delivers new
 | 
						|
content into a tab in the page (think of it like a callback, perhaps).
 | 
						|
 | 
						|
Also in the C<share/public/javascripts/...> folder are the other public
 | 
						|
libraries loaded by the Netdisco application:
 | 
						|
 | 
						|
The Toastr library is used for "Growl"-like notifications which appear in the
 | 
						|
corner of the web browser and then fade away. These notify the user of
 | 
						|
successful background job submission, and jos results.
 | 
						|
 | 
						|
The d3 library is a graphics toolkit used to display the NetMap feature. This
 | 
						|
works differently from the old Netdisco in that everything is generated
 | 
						|
on-the-fly using SQL queries (C<DeviceLinks> resultset) and this d3 library
 | 
						|
for rendering.
 | 
						|
 | 
						|
Finally Twitter Bootstrap also ships with a toolkit of helpful Javascript
 | 
						|
driven features such as the tooltips and collapsers.
 | 
						|
 | 
						|
 | 
						|
=head1 Job Daemon
 | 
						|
 | 
						|
The old Netdisco has a job control daemon which processes "port control"
 | 
						|
actions and also manual requests for device polling. The new Netdisco also has
 | 
						|
a daemon, although it is a true separate process and set of libraries from the
 | 
						|
web application. However, it still makes use of the Dancer configuration and
 | 
						|
database connection management features mentioned above.
 | 
						|
 | 
						|
The job daemon is backwards compatible with the old Netdisco database job
 | 
						|
requests table, although it doesn't yet log results in the same way. Most
 | 
						|
important, it cannot yet poll any devices for discovery or macsuck/arpnip,
 | 
						|
although that's next on the list!
 | 
						|
 | 
						|
All code for the job daemon lives under the L<App::Netdisco::Daemon> namespace
 | 
						|
and like the rest of Netdisco is broken down into manageable chunks.
 | 
						|
 | 
						|
=head2 Running the Job Daemon
 | 
						|
 | 
						|
Like the web application, the job daemon is fully self contained and runs via
 | 
						|
two simple scripts shipped with the distribution - one for foreground and one
 | 
						|
for background execution (see the user docs for instructions).
 | 
						|
 | 
						|
The C<netdisco-daemon> script uses L<Daemon::Control> to daemonize so you can
 | 
						|
fire-and-forget the Netdisco job daemon without much trouble at all. This
 | 
						|
script in turn calls C<netdisco-daemon-fg> which is the real application, that
 | 
						|
runs in the foreground if called on its own.
 | 
						|
 | 
						|
=head2 Daemon Engineering
 | 
						|
 | 
						|
The job daemon is based on the L<MCE> library, which handles the forking and
 | 
						|
management of child processes doing the actual work. This actually runs in the
 | 
						|
foreground unless wrapped with Daemon::Control, as mentioned above. MCE
 | 
						|
handles three flavours of "worker" for different tasks.
 | 
						|
 | 
						|
One goal that we had designing the daemon was that sites should be able to run
 | 
						|
many instances on different servers, with different processing capacities.
 | 
						|
This is both to take advantage of more processor capability, but also to deal
 | 
						|
with security zones where you might only be able to manage a subset of devices
 | 
						|
from certain locations. Netdisco has always coped well with this via its
 | 
						|
C<discover_*> and similar configuration, and the separate poller process.
 | 
						|
 | 
						|
So, the single Manager "worker" in the daemon is responsible for contacting
 | 
						|
the central Netdisco database and booking out jobs which it's able to service
 | 
						|
according to the local configuration settings. Jobs are "locked" in the
 | 
						|
central queue and then copied to a local job queue within the daemon.
 | 
						|
 | 
						|
Along with the Manager we start zero or more of two other types of worker.
 | 
						|
Some jobs such as port control are "interactive" and the user typically wants
 | 
						|
quick feedback on the results. Others such as polling are background tasks
 | 
						|
which can take more time and are less schedule sensitive. So as not to starve
 | 
						|
the "interactive" jobs of workers we have two types of worker.
 | 
						|
 | 
						|
The Interactive worker picks jobs from the local job queue relating to device
 | 
						|
and port reconfiguration only. It submits results directly back to the central
 | 
						|
Netdisco database.
 | 
						|
 | 
						|
The Poller worker (is not yet written!) and similarly picks job from the local
 | 
						|
queue, this time relating to device discovery and polling.
 | 
						|
 | 
						|
There is support in the daemon for the workers to pick more than one job at a
 | 
						|
time from the local queue, in case we decide this is worth doing. However the
 | 
						|
Manager won't ever book out more jobs from the central Netdisco job queue than
 | 
						|
it has workers available (so as not to hog jobs for itself against other
 | 
						|
daemons on other servers). The user is free to configure the number of
 | 
						|
Interactive and Poller workers in their C<config.yml> file (zero or more of
 | 
						|
each).
 | 
						|
 | 
						|
=head2 SNMP::Info
 | 
						|
 | 
						|
The daemon obviously needs to use L<SNMP::Info> for device control. All the
 | 
						|
code for this has been factored out into the L<App::Netdisco::Util> namespace.
 | 
						|
 | 
						|
The L<App::Netdisco::Util::Connect> package provides for the creation of
 | 
						|
SNMP::Info objects along with connection tests. So far, SNMPv3 is not
 | 
						|
supported. To enable trace logging of the SNMP::Info object simply set the
 | 
						|
C<INFO_TRACE> environment variable to a true value.  The Connect library also
 | 
						|
provides routines to map interface and PoE IDs.
 | 
						|
 | 
						|
Configuration for SNMP::Info comes from the YAML files, of course. This means
 | 
						|
that our C<mibhome> and C<mibdirs> settings are now in YAML format. In
 | 
						|
particular, the C<mibdirs> list is a real list within the configuration.
 | 
						|
 | 
						|
Other libraries will be added to this namespace in due course, as we add more
 | 
						|
functionality to the Job Daemon.
 | 
						|
 | 
						|
=head2 DBIx::Class Layer
 | 
						|
 | 
						|
The local job queue for each Job Daemon is actually an SQLite database running
 | 
						|
in memory. This makes the queue management code a little more elegant. The
 | 
						|
schema for this is of course DBIx::Class using Dancer connection management,
 | 
						|
and lives in L<App::Netdisco::Daemon::DB>.
 | 
						|
 | 
						|
There is currently only one table, the port control job queue, in
 | 
						|
L<App::Netdisco::Daemon::DB::Result::Admin>. It's likely this name will change
 | 
						|
in the future.
 | 
						|
 | 
						|
=cut
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 | 
						|
 |