Project

General

Profile

Wiki » History » Version 9

John Cary, 12/19/2015 09:15 AM

1 1 Redmine Admin
# What is Bilder?
2
3
{{>toc}}
4
5
Bilder is a cross-platform (Linux, OS X, Windows), meta-build or package management system applicable to LCFs, such the IBM Blue Gene and the Cray series of computers.  It automatically downloads packages, then configures, builds and installs them. It handles updating and building of a collection of packages and projects that have dependency relationships. When one package is updated, Bilder ensures that its dependents are updated as well. It can install in common areas, so that multiple packages can make use of the same dependency builds.
6
7
As of January 16, 2012, Bilder handles multiple builds of over 150 packages, with the multiple builds being, e.g., serial, parallel, shared, or static, as needed.  The platforms include Linux, OS X, AIX, and the specialized Linuces found on the IBM Blue Gene P and the Cray XT4.  It handles the compiler sets of gcc, XL, PathScale and PGI.
8
9
Bilder is not for replacing build systems.  Instead it works with the build systems that come with each package.  It supports packages with builds systems of autotools, CMake, qmake, Distutils, and the one-off builds systems of, e.g., lapack, ATLAS, and PETSc.  In essence, Bilder acts as a repository of build knowledge.
10
11
## Bilder Characteristics
12
13
* Build workflow automation, handling interpackage dependencies, with builds triggered when a dependency has been built.
14
* Uses soft inter-package dependencies: Suppose component A depends on B, and is updated but does not build (fails or is excluded).  Attempt to build A anyway if any other dependency is rebuilt or if A is updated, as the newer A may be consistent with an existing installation of B, or A may be able to build without B.
15
* Integration with version control systems.
16
* Integration with testing.
17
* Support for multiple OSs: Linux, OS X, Windows
18
* Support for multiple compiler sets (gcc, XL, PGI, PathScale, Visual Studio)
19 3 Redmine Admin
  * LCFs have particular preferred compilers, e.g., for which some libraries have been built
20
  * Need to compare performance of code generated by different compilers
21
  * Have to use built libraries  (HDF5, Lapack) when possible for performance
22 1 Redmine Admin
* Ability to use different underlying package configuration/build systems.
23
* Support for different kinds of builds (e.g., parallel, serial, static, shared) for any package.
24
* Collection of build provenance information, including logging of all steps and notification using emails and dashboards.
25
* Allows disabling the builds of particular packages (e.g., so that a system version will be used).
26
* Parallel (multi-threaded or multi-process) builds of independent builds or packages.
27
* Out of place build and installation: with defaults and also user-specified locations.
28
* Defaults for all parameters on all supported platforms that can be overridden by users.
29
* Integration with the Jenkins continuous integration tool.
30
* Searching for packages within the installation area.
31
* Isolation of general logic from specific logic and data
32 3 Redmine Admin
  * General logic in top-level Bilder files
33
  * Package specific logic and data in package files (the files in the package subdirectory)
34
  * Machine specific logic and data in machine files (the files in the machines subdirectory)
35 1 Redmine Admin
36
## What does Bilder not handle?
37
38
* Installing compilers
39
* Probably much more
40
41
# Preparing your machine for Bilder
42
43
* [[Preparing a Windows machine for Bilder]]
44
* [[Preparing a Linux machine for Bilder]]
45
* [[Preparing a Mac machine for Bilder]]
46
47
Then check out a bilder repo and build. Below are some examples.
48
49
# EXAMPLE1: Python Packages
50
51
Build ipython, scipy, tables with one command! This will build these packages and all of their dependencies, which are ipython scipy tables tornado pyzmq pyqt matplotlib hdf5 numexpr setuptools zeromq Cython qt sip numpy Python atlas clapack_cmake chrpath sqlite bzip2 lapack cmake.
52
53
~~~~~~
54 6 John Cary
svn checkout http://ice.txcorp.com/svnrepos/code/bilder/pypkgs/trunk pypkgs
55 1 Redmine Admin
cd pypkgs
56
./mkpypkgs.sh
57
~~~~~~
58
59 2 Tech-X Corporation
# EXAMPLE2: VisIt Visual Analysis Package #
60 1 Redmine Admin
61 2 Tech-X Corporation
![https://wci.llnl.gov/codes/visit/](VisIt)
62
63
Build the [VisIt](https://wci.llnl.gov/codes/visit/) visualization tool with one command! This will build VisIt and all its dependencies, which are visit Imaging visit_vtk qt mesa hdf5 openmpi zlib cmake bzip2.
64 1 Redmine Admin
65
~~~~~~
66 6 John Cary
svn checkout http://ice.txcorp.com/svnrepos/code/bilder/visitall/trunk visitall
67 1 Redmine Admin
cd visitall
68
./mkvisitall.sh
69
~~~~~~
70
71
# Getting Bilder
72
73
Bilder is a set of shell scripts to configure software. All the configure scripts are available from a subversion repository. To access Bilder, enter:
74
75
~~~~~~
76 6 John Cary
svn co https://ice.txcorp.com/svnrepos/code/bilder/trunk bilder
77 1 Redmine Admin
~~~~~~
78
79
# Configuring Bilder
80
81
## Required configuration information 
82
83
Before running Bilder you need to tell it where its configuration information is.  This is a directory.  The value of the environment variable, BILDER_CONFDIR, is set to it.  (E.g., BILDER_CONFDIR=/etc/bilder.)
84
85
Inside that directory, there must be at least two files.  The first, _bilderrc_, defines a variable, _PACKAGE_REPOS_FILE_, that contains the name of the file containing the repositories to be searched for tarballs for packages to be built.  E.g.,
86
87
~~~~~~
88
PACKAGE_REPOS_FILE=${PACKAGE_REPOS_FILE:-"$BILDER_CONFDIR/numpkgssvn.txt"}
89
~~~~~~
90
91
This follows the standard Bilder style, that no variable with a value is overwritten.  This allows the person executing the build instruction to override any variable value on the command line, e.g., using env.
92
93
The Package Repos File, then contains the repos to be searched for packages, with the format:
94
95
~~~~~~
96
    $ cat numpkgssvn.txt 
97
    ####
98
    #
99
    # File:    numpkgssvn.sh
100
    #
101
    # Purpose: List the package repos in the format,
102
    #          subdir,method=URL
103
    #          Where subdir is the desired location for the repo,
104
    #          method = svn to get by svn, empty to get with wget
105
    #          URL is the resource locator
106
    #
107
    # Version: $Id: numpkgssvn.txt 54 2012-04-08 13:52:09Z cary $
108
    #
109
    ####
110
    PACKAGE_REPO: numpkgs,svn=https://ice.txcorp.com/svnrepos/code/numpkgs/trunk
111
~~~~~~
112
113
Each line starting with _PACKAGE_REPO:_ defines the subdir (in this case numpkgs) into which the packages are put, the method (in this case svn) for getting the packages, and after the equals sign, the URL for the directory containing all packages.
114
115
For the method (part between the command and the equals sign) of svn, this means that the svn repo will be checked out as empty, 
116
117
~~~~~~
118
svn co --depth=empty https://ice.txcorp.com/svnrepos/code/numpkgs/trunk numpkgs
119
~~~~~~
120
121
and packages will be obtained by
122
123
~~~~~~
124
svn up pkgname
125
~~~~~~
126
127
in the numpkgs subdirectory.
128
129
## Optional logic in bilderrc 
130
131
It can happen that "hostname -f" does not give the fully qualified hostname for your machine.  In this case, you can define __FQHOSTNAME__ to contain that hostname.
132
133
You can also find the following three methods:
134
135
* _bilderGetAuxData_ defines how to get any auxiliary data needed by a package
136
* _bilderFinalAction_ defines a final action (like posting to a dashboard) to be undertaken at the end of a build run
137
* _signInstaller_ to sign any installers that you create during your build
138
139
## Optional additional logic 
140
141 7 John Cary
You can provide specific logic in domainname files that also define default installation directories and such in files named with the domain name.  Examples are seen in bilder/domains: 
142 1 Redmine Admin
143
~~~~~~
144 7 John Cary
domains$ ls
145
alcf.anl.gov	hpc.mil		nersc.gov	tacc.utexas.edu
146 1 Redmine Admin
~~~~~~
147
148
# Running Bilder
149
150
## Running Bilder for the Novice User 
151
152
First you will need to check out a ''meta-project'' svn repo that includes the source that you want to build along with the bilder scripts repo.
153
154
For example, Tech-X maintains the _visitall_ repo, which can be obtained by:
155
156
~~~~~~
157
svn co https://ice.txcorp.com/svnrepos/code/visitall/trunk visitall
158
~~~~~~
159
160 7 John Cary
In the bilder'ized project, if there is a script usually named "mk<project>all-default.sh" where <project> is the project name that may be abbreviated (e.g. for visitall the script is mkvisitall.sh), then this is the easiest way to run bilder. The options of a top level "default" Bilder script can be seen by running the script with the -h flag.
161 1 Redmine Admin
162
~~~~~~
163 8 John Cary
visitall$ ./mkvisitall-default.sh -h
164
165
Usage: ./mkvisitall-default.sh [WRAPPER OPTIONS] -- [BILDER OPTIONS]
166
167
This wrapper script calls the main Bilder script with arguments
168
to set default locations for build and installation directories
169
on a domainname basis.  It also handles launching the build in a
170
queue.  This script is also meant to ease the use of non-gfortran
171
compilers.  It uses these values to form arguments for the main
172
bilder script, mkvisitall.sh, which it additionally sends the arguments
173
after the double dash, --.
174
175
WRAPPER OPTIONS
176
  -b <dir> .......... Set builds directory. If not used, bilder chooses a default.
177
  -c ................ All-user installations: for non-LCFS, goes into /contrib,
178
                        /volatile or /internal, for LCFSs, goes into group areas
179
  -C ................ Install in separate tarball and repo install dirs
180
...
181
  -- ................ End processing of args for mkall-default.sh, all remaining
182
                        args are passed to the script.
183
184
If you want to see the command line arguments for mkvisitall.sh, then you
185
should run the following command:
186
187
        ./mkvisitall.sh -h
188 1 Redmine Admin
~~~~~~
189
190
191
For this script to work, you must have defined the location of your Bilder configuration directory in the environment variable, BILDER_CONFDIR.  This will be discussed more in [ConfiguringBilder].
192
193
194
## Running Bilder for the Advanced User ... 
195
196 7 John Cary
In the bilder'ized project, there will be a script named "mk<project>all.sh" where <project> is the project name that may be abbreviated (e.g. for nautilus the script is mkvisitall.sh). The options of a top level Bilder script can be seen by running the script with the -h flag, e.g.,
197 1 Redmine Admin
198
~~~~~~
199 7 John Cary
visitall$ ./mkvisitall.sh -h
200
Usage: ./mkvisitall.sh [options]
201
BILDER OPTIONS
202
  -a ................ Update non-subversion packages
203
  -A <addl_sp> ...... Add this to the supra search path.
204
  -b <build_dir> .... Build in <build_dir>.
205
  ...
206 1 Redmine Admin
~~~~~~
207
208
209
## Notes on Installation Directories and Path Modifications 
210
211
Bilder builds all software, when possible, in ''the build directory'' or  <builddir>, which is specified by the _-b_ flag.  It also unpacks tarballs into this directory before building them.
212
213
Bilder defines two installation directories, which may be the same.
214
215 5 Redmine Admin
Tarballs are installed in ''the tarball directory'' or \<tarballdir\>, specified by the _-k_ flag. This is the _/contrib_ directory at Tech-X.
216 1 Redmine Admin
217 5 Redmine Admin
Code from repositories is installed in ''the repo directory'' or \<repodir\>, the directory specified by the _-i_ flag.  At Tech-X, this is typically _/volatile_ or _/internal_.
218 1 Redmine Admin
219
If only one of the above directories is specified, then the other directory defaults to the specified directory.  If neither directory is specified, then both directories default to _$HOME/software_.
220
221
During the build process, _/contrib/autotools/bin:/contrib/valgrind/bin:/contrib/mpi/bin:/contrib/hdf5/bin:/contrib/bin:_ is added to the front of the path so that the installed packages are use to build the packages.
222
223
224
## Debugging Bilder Errors 
225
226
Bilder is a set of bash scripts. The [https://ice.txcorp.com/svnrepos/code/bilder/trunk/ trunk version of the scripts] will tell you exactly what bilder is doing if you know bash programming.
227
228
229
# Bilder's Build Types
230
231
The standard builds of Bilder are
232
233
* ser: static, serial build
234
* par: static, parallel (MPI) build
235
* sersh: shared, serial build
236
* parsh: shared, parallel (MPI) build
237 9 John Cary
* pycsh: shared build compatible with the way Python was built
238 1 Redmine Admin
239
The Bilder standard is to install each build in its own directory.  While libtool allows shared and static builds to be done within the same build, cmake generally does not as discussed at [http://www.cmake.org/Wiki/CMake_FAQ#Library_questions].  Further, to do this on Windows, library names have to differ, as otherwise the static library and the shared interface library files would overwrite each other.  So in the end, is it simply easier to install shared and static libraries in their own directories.
240
241
In all cases, the builds are to be "as complete as possible".  E.g., for HDF5 on Darwin, shared libraries are not supported with fortran.  So in this case, sersh has to disable the fortran libraries.  However, completeness may depend on other criteria.  So, e.g., for trilinos, complete builds are provided, but so are builds that are as complete as possible and compatible with licenses that allow free reuse in commercial products.
242
243
## Static builds 
244
245
The static builds provide the most portable builds, as they eliminate or minimize the need to be compatible with any system shared libraries.  The are also the most widely supported.  For Windows, these mean libraries that import the static runtime library (libcmt).  Generally this means that, for Windows, one should not use a static dependency for a shared build of a project, as doing so typically leads to the dreaded runtime conflict, e.g., http://stackoverflow.com/questions/2360084/runtime-library-mis-matches-and-vc-oh-the-misery.
246
247
## Shared builds 
248
249
Shared builds allow one to reuse libraries among executables, but then one has the difficulty of finding those libraries at runtime.  This can be particularly difficult when moving an installation from one machine to another or when installing a package.  To minimize these headaches, Bilder, as much as possible, uses rpath on Linux.  However, packages need to figure out how to modify any executables or libraries post-build to make an installer.
250
251 9 John Cary
## pycsh builds 
252 1 Redmine Admin
253 9 John Cary
This is a special build that is just a shared build using the compiler that Python was compiled with.  This is generally gcc for Unices and Visual Studio for Windows.  One adds a pycsh build only when the serial compiler is not the compiler used to build Python.
254 1 Redmine Admin
255
# Bilder Hierarchy
256
257 9 John Cary
It is possible to specialize Bilder: per machine, per project and per person. by sourcing file(s) at each level of hierarchy:
258 1 Redmine Admin
259
## Bilder default settings 
260
261
When no specialization files are used, Bilder uses the default settings for the project.
262
263
## By Machine 
264
265
Set of machine files under bilder/machines directory to specify machine specific variables and settings. For example, to build a project on Windows platform with cygwin using Visual Studio 9, we have cygwin.vs9 machine file which sets up the environment as needed by Visual Studio 9. The machine files can be specified by "-m" option.
266
267
## By Project 
268
269
Please see [wiki:ConfiguringBilder Configuring Bilder] on how to set up per project configurations. Here, information needed for the project such as where to obtain third party dependency libraries, default installation directories, set the various variables defining where builds should take place, where installations should go, etc. can be specified.
270
271
## By Person 
272
273
### Default settings using .bilddefrc 
274
275
Every person building a project using Bilder can specify his/her own default settings by creating a .bilddefrc file in their home directory. This will be sourced in the mkXYZall-default.sh file to override any other default project settings. 
276
277
### Settings using .bilderrc 
278
279
Every person building a project using Bilder can specify his/her own settings by creating a .bilderrc file in their home directory. This will be sourced in the mkXYZall.sh file to override any other project settings. 
280
281
## Per package per person 
282
283
In cases where it is necessary to specify settings per package per person, a XYZ.conf file can be specified in the BILDER_CONFDIR/packages directory. If found, this file will be sourced in the mkXYZ.sh script to override all the other settings. If this file is modified, then Bilder will reconfigure and build the package.
284
285
# Running Bilder Through The Defaults Scripts
286
287
The full set of options for Bilder are many, and this gives rise to the potential for mistakes. To facilitate this, we have created the defaultsfcns.sh and mkall-defaults.sh, and then then associated defaults scripts include the latter and execute runBilderCmd:
288
289
~~~~~~
290
    $ cat mkfcall-default.sh 
291
    #!/bin/bash
292
    #
293
    # Determine (and possibly execute) the default Bilder command
294
    # for Facetsall.
295
    #
296
    # $Id: mkfcall-default.sh 593 2012-03-09 15:26:46Z cary $
297
    #
298
    h2.#########################################################
299
    # 
300
    # Set the default variables
301
    mydir=`dirname $0`
302
    mydir=${mydir:-"."}
303
    mydir=`(cd $mydir; pwd -P)`
304
    # Where to find configuration info
305
    BILDER_CONFDIR=$mydir/bilderconf
306
    # Subdir under INSTALL_ROOTDIR where this package is installed
307
    PROJECT_INSTSUBDIR=facets
308
    source $mydir/bilder/mkall-default.sh
309
310
    # Build the package
311
    runBilderCmd
312
    res=$?
313
    exit $res
314
~~~~~~
315
316
The options,
317
318
~~~~~~
319
    $ ./mkfcall-default.sh -h
320
    source /Users/cary/projects/facetsall/bilder/runnr/runnrfcns.sh
321
    WARNING: runnrGetHostVars unable to determine the domain name.
322
    Usage: ./mkfcall-default.sh [options]
323
    This script is meant to handle some of the vagaries that occur
324
    at LCFs and clusters in large systems (which have complicated file
325
    systems) such as those that have high performance scratch systems
326
    and NFS mounted home systems.  This script is also meant to ease
327
    the use of non-gfortran compilers.
328
    OPTIONS
329
      -c              common installations: for non-LCFS, goes into /contrib,
330
                      /volatile or /internal, for LCFSs, goes into group areas
331
      -C              Install in separate tarball and repo install dirs 
332
                      (internal/volatile) rather than software
333
      -E "<options>"  quoted list of extra options to pass to the mk script
334
      -f <file>       File that contains extra arguments to pass
335
                      Default: .extra_args
336
      -F <compiler>   Specify fortran compiler on non-LCF systems
337
      -g              Label the gnu builds the same way other builds occur.
338
      -H <host name>  use rules for this hostname (carver, surveyor, intrepid)
339
      -h              print this message
340
      -i              Software directory is labeled with "internal" if '$USER'
341
                      is member of internal install list
342
      -I              Install in $HOME instead of default location
343
                      (projects directory at LCFs, BUILD_ROOTDIR on non-LCFs)
344
      -j              Maximum allowed value of the arg of make -j
345
      -k              On non-LCFs: Try to find a tarball directory (/contrib)
346
                      On LCFs:     Install tarballs (instead of using facetspkgs)
347
      -m              force this machine file
348
      -n              invoke with a nohup and a redirect output
349
      -p              just print the command
350
      -q <timelimit>  run in queue if possible, with limit of timelimit time
351
      -t              Pass the -t flag to the  mk script (turn on testing)
352
      -v <file>       A file containing a list (without commas) of declared
353
                      environment variables to be passed to mk*.sh script
354
      -w <file>       Specify the name of a file which has a comma-delimited
355
                      list of packages not to build (e.g.,
356
                      plasma_state,nubeam,uedge) Default: .nobuild
357
      --              End processing of args for mkall-default.sh, all remaining
358
                      args are passed to the script.
359
~~~~~~
360
361
362
mostly deal with which directory is to be used for installation, what is the time limit for the build, any extra options to be passed to the build, whether on the command line or in a file.
363
364
An example invocation look like
365
366
~~~~~~
367
mkfcall-default.sh -cin -- -oXZ -E BUILD_ATLAS=true
368
~~~~~~
369
370
which will (c) install in areas common to all users, (i) using the internal rather than the volatile directory for repo installations, (n) in background via nohup, -- what follows are more args for the base script, which are (o) build openmpi if on OS X or Linux, (X) build the newer, experimental packages, (Z) do not invoke the user defined bilderFinalAction method, (E) set this comma delimited list of environment variables, in this case to build Atlas if on Linux or Windows.
371
372
# Using Jenkins with Bilder
373
374
### Setting up Jenkins for use with Bilder ###
375
376
This set of pages is intended to describe how to set up the Jenkins continuous integration tools for launching Bilder jobs (which then handle the builds and testing). It is not intended to describe the most general way to set up Jenkins, but instead it describes a way that relies on having a Linux master node.
377
378
## Starting up a Linux Jenkins master node 
379
380
Install Jenkins using the installation mechanism for your platform.  E.g., see
381
https://wiki.jenkins-ci.org/display/JENKINS/Installing+Jenkins+on+RedHat+distributions.
382
383
*IMPORTANT:* Before starting Jenkins for the first time:
384
* create the directory where Jenkins will do its builds (known as _JENKINS_HOME_, not to be confused with the Jenkins home directory in /etc/passwd, which is initially set to /var/lib/jenkins, which we will assume here)
385
* set the permissions of the Jenkins build directory (e.g., /home/bilder/jenkins)
386
* Add jenkins to any groups as needed (e.g., contrib, research, xxusers)
387
* modify {{{/etc/sysconfig/jenkins}}} as needed.  Our settings are
388
389
390
~~~~~~
391 9 John Cary
JENKINS_HOME="/home/bilder/jenkins"
392
JENKINS_PORT="8300"
393
JENKINS_AJP_PORT="8309"
394
JENKINS_ARGS="--argumentsRealm.passwd.jenkins=somepassword --argumentsRealm.roles.jenkins=admin"
395 1 Redmine Admin
~~~~~~
396
397
(_somepassword_ is not literal.)
398
399
Create an ssh key for jenkins:
400
401
~~~~~~
402 9 John Cary
sudo -u jenkins ssh-keygen
403 1 Redmine Admin
~~~~~~
404
405
It cannot have a passphrase.
406
407
408
Start the jenkins service:
409
410
~~~~~~
411 9 John Cary
sudo service jenkins start
412 1 Redmine Admin
~~~~~~
413
414
415
Set Jenkins to start on boot:
416
417
~~~~~~
418 9 John Cary
sudo chkconfig --level 35 jenkins on
419 1 Redmine Admin
~~~~~~
420
421
## Preparing a Unix Jenkins slave node 
422
423
We will have one node prepared to act as a Jenkins slave for now.  For ease, we will create a Unix slave.  Later we will add more slaves.
424
425
* On the service node, create the user who will run Jenkins.
426
* As that user create the directory where Jenkins will work
427
* Add that user to any groups needed to give it appropriate permissions (e.g., contrib, research, xxusers)
428
* For public-key authentication
429 3 Redmine Admin
  * Add the public key created above for jenkins to that user's ~/.ssh/authorized_keys
430
  * On the master, check that you can do passwordless login by trying: "_sudo -u jenkins ssh jenkins@yourhost_"
431 1 Redmine Admin
* For password authentication
432 3 Redmine Admin
  * Configure /etc/sshd_config to allow password authentication (PasswordAuthentication yes) and restart sshd
433 1 Redmine Admin
434
## Configuring the Linux Jenkins master node 
435
436
* Open a browser and go to _master.yourdomain:8300_ and log in as admin with the password that you set in the JENKINS_ARGS variable, above.
437
* Go to Manage Jenkins -> Manage Plugins -> Available and install the plugins,
438 3 Redmine Admin
  * Jenkins cross-platform shell (XShell)
439
  * Conditional Build-Step
440
  * Matrix Tie Parent
441
  * Jenkins build timeout (Build-timeout)
442 1 Redmine Admin
* Go to Manage Jenkins -> Manage Users and then use _Create User_ to create the users for your Jenkins installation. Make sure to create an administrative user perhaps yourself).
443
* Go to Manage Jenkins -> Configure system and select/set
444 3 Redmine Admin
  * Enable security
445
  * Jenkins's own user database
446
  * _If you wish_, allow users to sign up
447
  * Project-based Matrix Authorization Strategy
448 4 Redmine Admin
      * Add an administrator name with all privileges
449
      * Give anonymous user Overall Read (only)
450 3 Redmine Admin
  * Default user e-mail suffix: e.g., @yourdomain
451
  * Sender: jenkins@yourdomain
452 1 Redmine Admin
* Go to Manage Jenkins -> Manage Nodes -> New Node
453 4 Redmine Admin
  * Fill in name
454
  * Dumb Slave
455
  * You are taken to the configure form:
456
      * \# of executors = 1
457
      * Remote FS root: what you decided upon when creating the slave
458
      * Usage: Leave this machine for tied jobs onlye
459
      * Launch methog: Launch slave gents on Unix machines via SSH
460
      * Advanced:
461
          * Host 
462
          * Username (jenkins)
463 1 Redmine Admin
464
## Creating your first Bilder-Jenkins project 
465
466
We will create the first project to build on the master node. Later we will add more nodes.
467
468
* Go to Jenkins -> New Job
469
    * Build multi-configuration project
470
* Set name (here we will do visitall as our example)
471
* Enable project-based security
472
    * For open source builds, give Anonymous Job Read and Job Workspace permission
473
    * Add user/group as needed
474
* Source Code Management
475
    * Subversion
476
    * Put in your URL, e.g., https://ice.txcorp.com/svnrepos/code/visitall/trunk
477
    * Put in your svn credentials as requested
478
* Build Triggers (examples)
479
    * Build Periodically
480
        * Enter cron parameters, e.g., 0 20 * * *
481
    * Or have this build launched as a post-build step of another build
482
* Configuration Matrix
483
    * Add axis -> slaves (is this available before we add nodes?)
484
        * Add master and the above unix slave
485
* Build Environment
486
    * Abort the build if stuck (if desired)
487
        * Enter your timeout
488
    * Tie parent build to a node
489
        * Select master node
490
* Build
491
    * Add build step -> Invoke XShell command
492
        * Command line: bilder/bildtrol/unibild -d mkvisitall
493
        * Executable is in workspace dir
494
* Post-build Actions
495
    * Aggregate downstream test results
496
        * Select both
497
    * Archive the artifacts (select, see below for settings)
498
    * E-mail Notification
499
        * Set as desired
500
501
502
## Creating a Windows Slave 
503
504
* Get all tools in place on the slave machine by following the instructions at https://ice.txcorp.com/trac/bilder/wiki/BilderOnWindows
505
* Create the jenkins user account (if not already defined) as an Administrative account and log into the windows machine as the jenkins user
506
* Make sure the slave's Windows name and its domain name are consistent.
507
* Install Java (see http://www.java.com) and update the path to include `C:\Windows\SYSWOW64` if on 64 bit Windows and then `C:\Program Files (x86)\Java\jre6\bin`
508
* Create the build directory (e.g., C:\winsame\jenkins)
509
* Set the owner to that directory to the jenkins user via properties->security->advanced->owner.
510
* Install the Visual C++ redistributables from http://www.microsoft.com/download/en/details.aspx?id=5582
511
* Follow the networking, registry, and security related instructions at https://wiki.jenkins-ci.org/display/JENKINS/Windows+slaves+fail+to+start+via+DCOM
512
* (Cribbing from https://issues.jenkins-ci.org/browse/JENKINS-12820)
513
* Start a web browser on the windows slave and connect to the master jenkins web page.
514
* Manage Jenkins -> Manage Nodes -> New Node
515
* Create a new node (Slave node)
516
  * Fill in name, choose it to be the same as the Windows name of the slave
517
  * Dumb Slave
518
  * In the configure form, set
519
    * \# of executors: 1
520
    * Remote FS root: the directory created above (e.g., C:\winsame\jenkins)
521
    * Usage: Leave this machine for tied jobs only
522
    * Launch method: Launch slave agents via java web start
523
* Launch the slave
524
* Press the newly appeared button: Launch by webstart  
525
* A pop up window will be visible with a message as "Connected"
526
* In that pop up window click File-> Install as Windows Service
527
* Find jenkins service in the control panel, ensure that the owner is the jenkins user
528
* UNCHECKED: Set startup to Automatic
529
* Return to browser, take slave node off line in jenkins
530
* Set launch method to: Windows slave as a Windows Service
531
  * Advanced:
532
    * Administrative username (jenkins) ''You may need to type it as computername\jenkins if you get an invalid service account error''
533
    * Password (set as selected in slave setup)
534
    * Use Administrator account given above
535
* Relaunch slave node
536
537
### Use the Slave on the Master 
538
539
You should now be able to select this slave as a build node.
540
541
542
## Launching Bilder through Jenkins 
543
544
Jenkins runs Bilder through the scripts in the bildtrol subdir using the XShell command. The XShell command, when configured to launch _somescript_, actually invokes _somescript.bat_ on Windows and _somescript_ on unix.  The Bilder _.bat_ scripts simply translate the arguments and use them in a call to _somescript_, which is run through Cygwin.
545
546
### Building and testing: jenkinsbild 
547
548
The script, _jenkinsbild_, launches a build from Jenkins using the default scripts.  For this example, we consider building the visualization package _VisIT_, for which the repo is _https://ice.txcorp.com/svnrepos/code/visitall/trunk_.  This repo uses externals to bring in the VisIT source code.  In this
549
case, the simplest XShell command is
550
551
~~~~~~
552 9 John Cary
bilder/jenkins/jenkinsbild mkvisitall
553 1 Redmine Admin
~~~~~~
554
555
which leads to execution of
556
557
~~~~~~
558 9 John Cary
./mkvisitall-default.sh -t -Ci -b builds-internal  -- -Z -w 7
559 1 Redmine Admin
~~~~~~
560
561
which action is described in the rest of the Bilder documentation, but in particular, testing is invoked (-t), packages and repos are installed in separate areas (-C), use the ''internal'' directory for repo installation, do not do any post build action (-Z), and if a build less than 7 days old is found, do not execute the build (-w 7). The arguments after *--* are passed directly to the underlying Bilder script, mkvisitall.sh.
562
563
The _jenkinsbild_ script has very few options:
564
565
566
~~~~~~
567 9 John Cary
Usage: $0 [options]
568
GENERAL OPTIONS
569
  -b ................ Use internal/volatile build directory naming
570
  -m ................ Use mingw on cygwin
571
  -n ................ Do not add tests
572
  -p ................ Print the command only.
573
  -s step ..........  Sets which build step: 1 = internal, 2 = volatile.
574
  -2 ................ Sets build step to 2.
575 1 Redmine Admin
~~~~~~
576
577
At present, the internal/volatile build directory naming is in fact always true.  In this case, the first step (the default) builds in the subdir, *builds-internal*, and the second step (selected with -2 or -s 2) builds in the subdir, *builds-volatile*. Correspondingly, the repo installation directory is the *internal* directory on step 1 and the *volatile* directory on step 2.
578
579
Using mingw on cygwin (-m) is useful for codes that cannot build with Visual Studio.
580
581
Not adding the tests is useful in many instances where one is counting on only a few hosts to do testing.
582
583
The build step (-s2 or -2) will build in *builds-volatile* and install in the volatile directory, but it also determines several options by looking at the email subject of any step-1 build.
584
585
This is geared towards a general philosophy of having two builds, the stable (or internal) build that is done more rarely, and a volatile build that is done every night. So what is done in step 2 depends on the step 1 result, which can be determined from the email subject file, left behind. There are four cases:
586
587
* Step 1 did nothing as there was a sufficiently recent build.  Then step 2 does a full build with tests.
588
* Step 1 was fully successful, both builds and tests.  Then step 2 is not executed.
589
* Step 1 builds succeeded, but some tests failed (and so some packages were not installed).  Then step 2 is executed without testing, as that was done in step 1, and this permits installation of the built, but untested packages.
590
* Step 1 builds failed (and so corresponding tests were not attempted). Then step 2 is not executed, as it will fail as well.
591
592
The error code returned by jenkinsbild for is success (0) if even only the builds succeeded but not the tests. This way the dashboard indicates jenkinsbild build success only. A subsequent job, jenkinstest, determines whether tests passed by examining the email subjects left behind.
593
594 9 John Cary
For either build step, one wants to archive the artifacts listed in bilder/jenkins/jenkinsclean.sh.
595 1 Redmine Admin
in order to collect all results of builds and tests and any created installers.
596
597
### Posting test results: jenkinstest 
598
599
600
# Bilder Architecture
601
602
Bilder has a largely Object Oriented structure, even though it is written in Bash. But like all (even OO) programs, it has a procedural aspect.  Further it is task oriented (as opposed to event driven), with clear start and conclusion. We will break this architecture down into these three aspects: the task flow, the primary objects, and the procedures.
603
604
## Task flow 
605
606
Bilder scripts, like mkvisitall.sh, begin by setting some identifying variables, BILDER_NAME, BILDER_PACKAGE, ORBITER_NAME, and then continue by sourcing bildall.sh, which brings in the Bilder infrastructure: initializations of variables and methods used for building, testing, and installing packages.
607
608
### Global method definition 
609
610
The file, bildall.sh, brings in all of the global methods by first sourcing runr/runrfcns.sh, which contains the minimal methods for executing builds in job queues and reporting the results. It then obtains all of the more Bilder-specific methods by sourcing bildfcns.sh. These include generic methods for determining the build system, preconfiguring, configuring, building, testing (including running tests and collecting results), and installing. These files are the heart of Bilder, as they do all the heavy lifting.
611
612
A trivial, but important method is _techo_, which prints output to both stdout and to a log file.  Another is _decho_, which does the same, but only if DEBUG=true, which is set by the option _-d_.
613
614
### Option parsing 
615
616
Options are parsed through the sourcing of bildopts.sh, which is sourced by bildall.sh.  It then sets some basic command-line-argument derived variables, such as the installation directories, which it checks for writability. This file, bildopts.sh, has been written in such a way that Bilder-derived scripts (like mkvisitall.sh) can add their own arguments.
617
618
### Initialization 
619
620
Initialization is carried out by sourcing of two files, bildinit.sh and bildvars.sh (which are both sourced by bildall.sh). The purpose of bildinit.sh is to handle timing, to clear out indicating variables (like PIDLIST and configFailures), get the Bilder version, and define any path-like environment variables that might get changed in the course of the run.
621
622 9 John Cary
The purpose of bildvars.sh is to determine useful variables for the build.  The first comes from a possible machine file, then by OS (AIX, CYGWIN, Darwin, or Linux; MinGW is a work in progress). Then unset variables are set to default values. These variables contain the compilers for serial (front-end nodes), back-end nodes, parallel, and gcc (as some packages build only with gcc, and the names of the gcc compilers can vary from one system to another).  As well, the flags for all of these compilers are set. 
623 1 Redmine Admin
624
There are some packages that are so basic, that bilder defines variables for them.  These include HDF5, the linear algebra libraries (lapack, blas, atlas), and Boost. These definitions allow the locations of these libraries to be defined on a per machine basis. This is needed particularly for LCFs, which have special builds of HDF5, BLAS, and LAPACK, and for CYGWIN, which must have Boost to make up for deficiencies in the Visual Studio compilers.
625
626
Finally, bildvars.sh prints out all of the determined values.
627
628
### Package building 
629
630 9 John Cary
A Bilder-derived script, like _mkvisitall.sh_, after sourcing _bildall.sh_, specifies targets.  Bilder then uses the information in the package files to determine the dependencies, after which Bilder builds the packages in the order from least to most dependent, which all builds for a package occurring in parallel.
631 1 Redmine Admin
632 9 John Cary
One can write special package files for building multiple packages simultaneously.  That feature is rarely used so we will not discuss it further.
633 1 Redmine Admin
634 9 John Cary
This ability to have multiple builds of possibly multiple, non-interdependent packages simultaneously is a key feature of Bilder. It leads to great savings in time, especially with packages that must be built in serial due to a lack of dependency determination.
635 1 Redmine Admin
636
### Concluding 
637
638
The last part of the task flow is to install the configuration files, to summarize the build, and to email and post log files, build files, and the summary. The configuration files, which are created by _createConfigFiles_ and installed by _installConfigFiles_ into the installation directory, contain the necessary additions to environment variables to pick up the installed software.
639
640
The method, _finish_, then does the remaining tasks. It creates the summary file and emails it to the contact specified by the option parsing.  It then posts all log and build files to Orbiter.
641
642
## Package files 
643
644
Package files define how a package is acquired, how it is configured for building on the particular platform for all builds, how all builds are done, and how they are all installed.  Here we introduce an important distinction: the **tarball packages** are those obtained in the tar.gz format; the **repo packages** are obtained from a Subversion source code repo. Generic tarball packages are found in the Tech-X maintained Subversion repo at https://ice.txcorp.com/svnrepos/code/numpkgs and are available by anonymous svn. The repo packages are typically svn externals to a Bilder project, e.g., for visitall
645
646 6 John Cary
~~~~~~
647 1 Redmine Admin
visitall$ svn pg svn:externals .
648
bilder http://ice.txcorp.com/svnrepos/code/bilder/trunk
649 6 John Cary
bilderconf http://ice.txcorp.com/svnrepos/code/bilder/bilderconf/trunk
650
visit http://portal.nersc.gov/svn/visit/trunk/src
651
visitwindows/distribution http://portal.nersc.gov/svn/visit/trunk/windowsbuild/distribution
652
visittest/data http://portal.nersc.gov/svn/visit/trunk/data
653
visittest/test http://portal.nersc.gov/svn/visit/trunk/test
654
~~~~~~
655
656 1 Redmine Admin
Though written in _Bash_, Bilder uses object concepts. Each package file under packages acts an object, with instantiation, exposed (public) data members, private data members, and methods.  As in OO, these **package-build** objects have the same data members and a common interface.
657
658
Instantiation is carried out by sourcing the package file. At this point, the data associated with that package is initialized as necessary.
659
660
### Package-build data 
661
662
The public data members for a package _PKG__ are
663
664
~~~~~~
665 9 John Cary
PKG_BLDRVERSION # Either the version to install or the
666
                # version from the code repository
667
PKG_DEPS        # The dependencies of this package
668
PKG_BUILDS      # The names of the builds for this package
669
PKG_UMASK       # The "umask" that determines the permissions
670
                # for installation of this package
671 1 Redmine Admin
~~~~~~
672
673
In the syntax of C++, the first underscore would be represented by '.', i.e., pgk.DEPS. Even dynamic binding can be implemented in _Bash_.  I.e., if one
674
has _pkgname_ that holds the name of a package, one can can extract, e.g., BLDRVERSION via
675
676
~~~~~~
677
    vervar=`echo $pkgname | tr 'a-z./-' 'A-Z___'`_BLDRVERSION
678
~~~~~~
679
680
Admittedly, many of these constructs would more easily be accomplished in a language like Python that naturally supports object orientation. The trade-off is that then one does not have the nearly trivial expression of executable invocation or threading that one has in _Bash_.
681
682
In addition, there are the per-build, public variables _PKG_BUILD_OTHER_ARGS_ (e.g., _FACETS_PAR_OTHER_ARGS_ or _BABEL_STATIC_OTHER_ARGS_. These are added to the command-line when configuring a package.  In some cases, a package has more than one builds system, like HDF5, in which case one has two sets of variables, e.g., _HDF5_PAR_CMAKE_OTHER_ARGS_ and _HDF5_PAR_CONFIG_OTHER_ARGS_.
683
684
### Exposed package-build methods 
685
686
All package files are supposed to provide three methods, e.g., _buildPkg_, _testPkg_, and _installPkg_, where "Pkg" is the name of the package being built.  E.g., FACETS has buildFacets, testFacets, installFacets. For untested packages, the second method can simply be empty.
687
688
The method, _buildPkg_, is supposed to determine whether a package needs to be built.  If so, it should either acquire a tarball package or preconfigure (prepare the build system for) a repo package, then configure the package, and finally launch the builds of the package.  Preconfiguring in the example of an _autotools_ package involves invoking the _autoreconf_ and other executables for creating the various configuration scripts.  In many other cases there is no associated action.  If the Bilder infrastructure is used, then all builds are executed in a separate thread, and at the end of the _buildPkg_ method all the process IDs for these builds have been stored in both the variable PIDLIST, and the particular process ID for build "ser" of package "pkg" is stored in the variable, PKG_SER_PID.
689
690
The method, _testPkg_, is supposed to determine whether a package is being tested.  If not, it simply returns.  But if the package is being tested, then _testPkg_ executes _wait_ for each build.  Upon successful completion of
691
all builds, the tests are launched.  These are treated just like builds, so the process IDs are stored as in the case of builds.
692
693 9 John Cary
The last method, _installPkg_, in the case of a tested package, waits for the tests to complete, then installs the package if the tests completed successfully, after which is sets the tests as being installed, so that tests will not be run again unless the version or dependencies of the package change.  In the other case, where the package is not being tested, it waits for the builds to complete and installs any successful builds.
694 1 Redmine Admin
695
All three methods for any package are supposed to compensate for any errors or omissions in the build systems. Errors include fixing up library dependencies on Darwin, setting permissions of the installed software, and so forth. 
696
697
The object-oriented analogy is that each package-build object has an interface with three methods.  The syntax translation is _buildPkg_ -> _pkg.build_.
698
699
### Private package-build data 
700
701 9 John Cary
In the course of its build, any package will generate other variables with values. These are organized on a per-build basis, and so one can think of each package-build object as containing an object for each build of that package.  
702 1 Redmine Admin
703
### Internal objects 
704
705
Builds
706
707
Tests
708
709
###  Combined package objects 
710
711
###  Future directions
712
713
714
# Linear Algebra Libraries in Bilder
715
716
There are a wide variety of ways to get LAPACK and BLAS: Netlib's libraries (reference LAPACK and BLAS), CLapack (for when one does not have a Fortran compilers), ATLAS (for cpu-tuned libraries), GOTOBLAS (from TACC), and system libraries (MKL, ACML).
717
718
For numpy and all things that depend on it, Bilder uses ATLAS (if it has been built), and otherwise it uses LAPACK.
719
720
For non Python packages, the current logic is
721
722
## Darwin 
723
724
Always use -framework Accelerate
725
726
## Linux and Windows 
727
728
* SYSTEM_LAPACK_SER_LIB and SYSTEM_BLAS_SER_LIB are used if set.
729
* Otherwise, if USE_ATLAS is true, then ATLAS is used.
730
* Otherwise, use Netlib LAPACK if that is found.
731
* Otherwise
732
    * If on Windows, use CLAPACK
733
    * If on Linux, use any system blas/lapack
734
735
The results of the search are put into the variables, CMAKE_LINLIB_SER_ARGS, CONFIG_LINLIB_SER_ARGS, LINLIB_SER_LIBS.
736
737
738
# Extending Bilder
739
740
Bilder builds packages using the general logic in bildfcns.sh, the operating-system logic in bildvars.sh,
741
logic for a particular package in the _Bilder package file_ (kept in the packages subdir), logic for a
742
particular machine in the _Bilder machine file_ (kept in the machines subdir), and additional settings for a particular package on a particular machine in the Bilder machine file.  To extend Bilder, one adds the files that introduce the particular logic for a package or a machine.
743
744
* [[Adding support for a new package]]
745
* [[Adding support for a new machine or operating system/compiler combination]]
746
747
# Debugging your builds
748
749
This section describes some of the things that can go wrong and explains how to fix them.
750
751
# Bilder With IDEs
752
753
This is a page to collect notes on using IDEs to develop code while at the same time using bilder to build the project.
754
755
* Reference for using Eclipse with CMake, http://www.vtk.org/Wiki/CMake:Eclipse_UNIX_Tutorial