Project

General

Profile

Wiki » History » Version 7

John Cary, 12/19/2015 08:59 AM

1 1 Redmine Admin
# What is Bilder?
2
3
{{>toc}}
4
5
Bilder is a cross-platform (Linux, OS X, Windows), meta-build or package management system applicable to LCFs, such the IBM Blue Gene and the Cray series of computers.  It automatically downloads packages, then configures, builds and installs them. It handles updating and building of a collection of packages and projects that have dependency relationships. When one package is updated, Bilder ensures that its dependents are updated as well. It can install in common areas, so that multiple packages can make use of the same dependency builds.
6
7
As of January 16, 2012, Bilder handles multiple builds of over 150 packages, with the multiple builds being, e.g., serial, parallel, shared, or static, as needed.  The platforms include Linux, OS X, AIX, and the specialized Linuces found on the IBM Blue Gene P and the Cray XT4.  It handles the compiler sets of gcc, XL, PathScale and PGI.
8
9
Bilder is not for replacing build systems.  Instead it works with the build systems that come with each package.  It supports packages with builds systems of autotools, CMake, qmake, Distutils, and the one-off builds systems of, e.g., lapack, ATLAS, and PETSc.  In essence, Bilder acts as a repository of build knowledge.
10
11
## Bilder Characteristics
12
13
* Build workflow automation, handling interpackage dependencies, with builds triggered when a dependency has been built.
14
* Uses soft inter-package dependencies: Suppose component A depends on B, and is updated but does not build (fails or is excluded).  Attempt to build A anyway if any other dependency is rebuilt or if A is updated, as the newer A may be consistent with an existing installation of B, or A may be able to build without B.
15
* Integration with version control systems.
16
* Integration with testing.
17
* Support for multiple OSs: Linux, OS X, Windows
18
* Support for multiple compiler sets (gcc, XL, PGI, PathScale, Visual Studio)
19 3 Redmine Admin
  * LCFs have particular preferred compilers, e.g., for which some libraries have been built
20
  * Need to compare performance of code generated by different compilers
21
  * Have to use built libraries  (HDF5, Lapack) when possible for performance
22 1 Redmine Admin
* Ability to use different underlying package configuration/build systems.
23
* Support for different kinds of builds (e.g., parallel, serial, static, shared) for any package.
24
* Collection of build provenance information, including logging of all steps and notification using emails and dashboards.
25
* Allows disabling the builds of particular packages (e.g., so that a system version will be used).
26
* Parallel (multi-threaded or multi-process) builds of independent builds or packages.
27
* Out of place build and installation: with defaults and also user-specified locations.
28
* Defaults for all parameters on all supported platforms that can be overridden by users.
29
* Integration with the Jenkins continuous integration tool.
30
* Searching for packages within the installation area.
31
* Isolation of general logic from specific logic and data
32 3 Redmine Admin
  * General logic in top-level Bilder files
33
  * Package specific logic and data in package files (the files in the package subdirectory)
34
  * Machine specific logic and data in machine files (the files in the machines subdirectory)
35 1 Redmine Admin
36
## What does Bilder not handle?
37
38
* Installing compilers
39
* Probably much more
40
41
# Preparing your machine for Bilder
42
43
* [[Preparing a Windows machine for Bilder]]
44
* [[Preparing a Linux machine for Bilder]]
45
* [[Preparing a Mac machine for Bilder]]
46
47
Then check out a bilder repo and build. Below are some examples.
48
49
# EXAMPLE1: Python Packages
50
51
Build ipython, scipy, tables with one command! This will build these packages and all of their dependencies, which are ipython scipy tables tornado pyzmq pyqt matplotlib hdf5 numexpr setuptools zeromq Cython qt sip numpy Python atlas clapack_cmake chrpath sqlite bzip2 lapack cmake.
52
53
~~~~~~
54 6 John Cary
svn checkout http://ice.txcorp.com/svnrepos/code/bilder/pypkgs/trunk pypkgs
55 1 Redmine Admin
cd pypkgs
56
./mkpypkgs.sh
57
~~~~~~
58
59 2 Tech-X Corporation
# EXAMPLE2: VisIt Visual Analysis Package #
60 1 Redmine Admin
61 2 Tech-X Corporation
![https://wci.llnl.gov/codes/visit/](VisIt)
62
63
Build the [VisIt](https://wci.llnl.gov/codes/visit/) visualization tool with one command! This will build VisIt and all its dependencies, which are visit Imaging visit_vtk qt mesa hdf5 openmpi zlib cmake bzip2.
64 1 Redmine Admin
65
~~~~~~
66 6 John Cary
svn checkout http://ice.txcorp.com/svnrepos/code/bilder/visitall/trunk visitall
67 1 Redmine Admin
cd visitall
68
./mkvisitall.sh
69
~~~~~~
70
71
# Getting Bilder
72
73
Bilder is a set of shell scripts to configure software. All the configure scripts are available from a subversion repository. To access Bilder, enter:
74
75
~~~~~~
76 6 John Cary
svn co https://ice.txcorp.com/svnrepos/code/bilder/trunk bilder
77 1 Redmine Admin
~~~~~~
78
79
# Configuring Bilder
80
81
## Required configuration information 
82
83
Before running Bilder you need to tell it where its configuration information is.  This is a directory.  The value of the environment variable, BILDER_CONFDIR, is set to it.  (E.g., BILDER_CONFDIR=/etc/bilder.)
84
85
Inside that directory, there must be at least two files.  The first, _bilderrc_, defines a variable, _PACKAGE_REPOS_FILE_, that contains the name of the file containing the repositories to be searched for tarballs for packages to be built.  E.g.,
86
87
~~~~~~
88
PACKAGE_REPOS_FILE=${PACKAGE_REPOS_FILE:-"$BILDER_CONFDIR/numpkgssvn.txt"}
89
~~~~~~
90
91
This follows the standard Bilder style, that no variable with a value is overwritten.  This allows the person executing the build instruction to override any variable value on the command line, e.g., using env.
92
93
The Package Repos File, then contains the repos to be searched for packages, with the format:
94
95
~~~~~~
96
    $ cat numpkgssvn.txt 
97
    ####
98
    #
99
    # File:    numpkgssvn.sh
100
    #
101
    # Purpose: List the package repos in the format,
102
    #          subdir,method=URL
103
    #          Where subdir is the desired location for the repo,
104
    #          method = svn to get by svn, empty to get with wget
105
    #          URL is the resource locator
106
    #
107
    # Version: $Id: numpkgssvn.txt 54 2012-04-08 13:52:09Z cary $
108
    #
109
    ####
110
    PACKAGE_REPO: numpkgs,svn=https://ice.txcorp.com/svnrepos/code/numpkgs/trunk
111
~~~~~~
112
113
Each line starting with _PACKAGE_REPO:_ defines the subdir (in this case numpkgs) into which the packages are put, the method (in this case svn) for getting the packages, and after the equals sign, the URL for the directory containing all packages.
114
115
For the method (part between the command and the equals sign) of svn, this means that the svn repo will be checked out as empty, 
116
117
~~~~~~
118
svn co --depth=empty https://ice.txcorp.com/svnrepos/code/numpkgs/trunk numpkgs
119
~~~~~~
120
121
and packages will be obtained by
122
123
~~~~~~
124
svn up pkgname
125
~~~~~~
126
127
in the numpkgs subdirectory.
128
129
## Optional logic in bilderrc 
130
131
It can happen that "hostname -f" does not give the fully qualified hostname for your machine.  In this case, you can define __FQHOSTNAME__ to contain that hostname.
132
133
You can also find the following three methods:
134
135
* _bilderGetAuxData_ defines how to get any auxiliary data needed by a package
136
* _bilderFinalAction_ defines a final action (like posting to a dashboard) to be undertaken at the end of a build run
137
* _signInstaller_ to sign any installers that you create during your build
138
139
## Optional additional logic 
140
141 7 John Cary
You can provide specific logic in domainname files that also define default installation directories and such in files named with the domain name.  Examples are seen in bilder/domains: 
142 1 Redmine Admin
143
~~~~~~
144 7 John Cary
domains$ ls
145
alcf.anl.gov	hpc.mil		nersc.gov	tacc.utexas.edu
146 1 Redmine Admin
~~~~~~
147
148
# Running Bilder
149
150
## Running Bilder for the Novice User 
151
152
First you will need to check out a ''meta-project'' svn repo that includes the source that you want to build along with the bilder scripts repo.
153
154
For example, Tech-X maintains the _visitall_ repo, which can be obtained by:
155
156
~~~~~~
157
svn co https://ice.txcorp.com/svnrepos/code/visitall/trunk visitall
158
~~~~~~
159
160 7 John Cary
In the bilder'ized project, if there is a script usually named "mk<project>all-default.sh" where <project> is the project name that may be abbreviated (e.g. for visitall the script is mkvisitall.sh), then this is the easiest way to run bilder. The options of a top level "default" Bilder script can be seen by running the script with the -h flag.
161 1 Redmine Admin
162
~~~~~~
163
    $ ./mkvisitall-default.sh -h
164
    source /Users/cary/projects/visitall/bilder/runnr/runnrfcns.sh
165
    Usage: ./mkvisitall-default.sh [options]
166
    This script is meant to handle some of the vagaries that occur at LCFs and
167
    clusters in large systems (which have complicated file systems) such as those 
168
    that have high performance scratch systems and NFS mounted home systems. This 
169
    script is also meant to ease the use of non-gfortran compilers.
170
    OPTIONS
171
    -c              common installations: for non-LCFS, goes into /contrib,
172
                    /volatile or /internal, for LCFSs, goes into group areas
173
    -C              Install in separate tarball and repo install dirs
174
                    (internal/volatile) rather than in one area (software).
175
    -E <env pairs>  Comma-delimited list of environment var=value pair
176
    -f <file>       File that contains extra arguments to pass
177
                    Default: .extra_args
178
    -F <compiler>   Specify fortran compiler on non-LCF systems
179
    -g              Label the gnu builds the same way other builds occur.
180
    -H <host name>  use rules for this hostname (carver, surveyor, intrepid)
181
    -h              print this message
182
    -i              Software directory is labeled with "internal" if '$USER'
183
                    is member of internal install list
184
    -I              Install in $HOME instead of default location
185
                    (projects directory at LCFs, BUILD_ROOTDIR on non-LCFs)
186
    -j              Maximum allowed value of the arg of make -j
187
    -k              On non-LCFs: Try to find a tarball directory (/contrib)
188
                    On LCFs:     Install tarballs (instead of using facetspkgs)
189
    -m              force this machine file
190
    -n              invoke with a nohup and a redirect output
191
    -p              just print the command
192
    -q <timelimit>  run in queue if possible, with limit of timelimit time
193
    -t              Pass the -t flag to the  mk script (turn on testing)
194
    -v <file>       A file containing a list (without commas) of declared
195
                    environment variables to be passed to mk*.sh script
196
    -w <file>       Specify the name of a file which has a comma-delimited
197
                    list of packages not to build (e.g.,
198
                    plasma_state,nubeam,uedge) Default: .nobuild
199
    --              End processing of args for mkall-default.sh, all remaining
200
                    args are passed to the script.
201
~~~~~~
202
203
204
For this script to work, you must have defined the location of your Bilder configuration directory in the environment variable, BILDER_CONFDIR.  This will be discussed more in [ConfiguringBilder].
205
206
207
## Running Bilder for the Advanced User ... 
208
209 7 John Cary
In the bilder'ized project, there will be a script named "mk<project>all.sh" where <project> is the project name that may be abbreviated (e.g. for nautilus the script is mkvisitall.sh). The options of a top level Bilder script can be seen by running the script with the -h flag, e.g.,
210 1 Redmine Admin
211
~~~~~~
212 7 John Cary
visitall$ ./mkvisitall.sh -h
213
Usage: ./mkvisitall.sh [options]
214
BILDER OPTIONS
215
  -a ................ Update non-subversion packages
216
  -A <addl_sp> ...... Add this to the supra search path.
217
  -b <build_dir> .... Build in <build_dir>.
218
  ...
219 1 Redmine Admin
~~~~~~
220
221
222
## Notes on Installation Directories and Path Modifications 
223
224
Bilder builds all software, when possible, in ''the build directory'' or  <builddir>, which is specified by the _-b_ flag.  It also unpacks tarballs into this directory before building them.
225
226
Bilder defines two installation directories, which may be the same.
227
228 5 Redmine Admin
Tarballs are installed in ''the tarball directory'' or \<tarballdir\>, specified by the _-k_ flag. This is the _/contrib_ directory at Tech-X.
229 1 Redmine Admin
230 5 Redmine Admin
Code from repositories is installed in ''the repo directory'' or \<repodir\>, the directory specified by the _-i_ flag.  At Tech-X, this is typically _/volatile_ or _/internal_.
231 1 Redmine Admin
232
If only one of the above directories is specified, then the other directory defaults to the specified directory.  If neither directory is specified, then both directories default to _$HOME/software_.
233
234
During the build process, _/contrib/autotools/bin:/contrib/valgrind/bin:/contrib/mpi/bin:/contrib/hdf5/bin:/contrib/bin:_ is added to the front of the path so that the installed packages are use to build the packages.
235
236
237
## Debugging Bilder Errors 
238
239
Bilder is a set of bash scripts. The [https://ice.txcorp.com/svnrepos/code/bilder/trunk/ trunk version of the scripts] will tell you exactly what bilder is doing if you know bash programming.
240
241
242
# Bilder's Build Types
243
244
The standard builds of Bilder are
245
246
* ser: static, serial build
247
* par: static, parallel (MPI) build
248
* sersh: shared, serial build
249
* parsh: shared, parallel (MPI) build
250
* cc4py: shared build compatible with the way Python was built
251
252
The Bilder standard is to install each build in its own directory.  While libtool allows shared and static builds to be done within the same build, cmake generally does not as discussed at [http://www.cmake.org/Wiki/CMake_FAQ#Library_questions].  Further, to do this on Windows, library names have to differ, as otherwise the static library and the shared interface library files would overwrite each other.  So in the end, is it simply easier to install shared and static libraries in their own directories.
253
254
In all cases, the builds are to be "as complete as possible".  E.g., for HDF5 on Darwin, shared libraries are not supported with fortran.  So in this case, sersh has to disable the fortran libraries.  However, completeness may depend on other criteria.  So, e.g., for trilinos, complete builds are provided, but so are builds that are as complete as possible and compatible with licenses that allow free reuse in commercial products.
255
256
## Static builds 
257
258
The static builds provide the most portable builds, as they eliminate or minimize the need to be compatible with any system shared libraries.  The are also the most widely supported.  For Windows, these mean libraries that import the static runtime library (libcmt).  Generally this means that, for Windows, one should not use a static dependency for a shared build of a project, as doing so typically leads to the dreaded runtime conflict, e.g., http://stackoverflow.com/questions/2360084/runtime-library-mis-matches-and-vc-oh-the-misery.
259
260
## Shared builds 
261
262
Shared builds allow one to reuse libraries among executables, but then one has the difficulty of finding those libraries at runtime.  This can be particularly difficult when moving an installation from one machine to another or when installing a package.  To minimize these headaches, Bilder, as much as possible, uses rpath on Linux.  However, packages need to figure out how to modify any executables or libraries post-build to make an installer.
263
264
## Cc4py builds 
265
266
This is a special build that is just a shared build using the compiler that Python was compiled with.  This is generally gcc for Unices and Visual Studio for Windows.  One adds a cc4py build only when the serial compiler is not the compiler used to build Python.
267
268
# Bilder Hierarchy
269
270
It is possible to specialize Bilder: per machine, per poject and per person. by sourcing file(s) at each level of hierarchy:
271
272
## Bilder default settings 
273
274
When no specialization files are used, Bilder uses the default settings for the project.
275
276
## By Machine 
277
278
Set of machine files under bilder/machines directory to specify machine specific variables and settings. For example, to build a project on Windows platform with cygwin using Visual Studio 9, we have cygwin.vs9 machine file which sets up the environment as needed by Visual Studio 9. The machine files can be specified by "-m" option.
279
280
## By Project 
281
282
Please see [wiki:ConfiguringBilder Configuring Bilder] on how to set up per project configurations. Here, information needed for the project such as where to obtain third party dependency libraries, default installation directories, set the various variables defining where builds should take place, where installations should go, etc. can be specified.
283
284
## By Person 
285
286
### Default settings using .bilddefrc 
287
288
Every person building a project using Bilder can specify his/her own default settings by creating a .bilddefrc file in their home directory. This will be sourced in the mkXYZall-default.sh file to override any other default project settings. 
289
290
### Settings using .bilderrc 
291
292
Every person building a project using Bilder can specify his/her own settings by creating a .bilderrc file in their home directory. This will be sourced in the mkXYZall.sh file to override any other project settings. 
293
294
## Per package per person 
295
296
In cases where it is necessary to specify settings per package per person, a XYZ.conf file can be specified in the BILDER_CONFDIR/packages directory. If found, this file will be sourced in the mkXYZ.sh script to override all the other settings. If this file is modified, then Bilder will reconfigure and build the package.
297
298
# Running Bilder Through The Defaults Scripts
299
300
The full set of options for Bilder are many, and this gives rise to the potential for mistakes. To facilitate this, we have created the defaultsfcns.sh and mkall-defaults.sh, and then then associated defaults scripts include the latter and execute runBilderCmd:
301
302
~~~~~~
303
    $ cat mkfcall-default.sh 
304
    #!/bin/bash
305
    #
306
    # Determine (and possibly execute) the default Bilder command
307
    # for Facetsall.
308
    #
309
    # $Id: mkfcall-default.sh 593 2012-03-09 15:26:46Z cary $
310
    #
311
    h2.#########################################################
312
    # 
313
    # Set the default variables
314
    mydir=`dirname $0`
315
    mydir=${mydir:-"."}
316
    mydir=`(cd $mydir; pwd -P)`
317
    # Where to find configuration info
318
    BILDER_CONFDIR=$mydir/bilderconf
319
    # Subdir under INSTALL_ROOTDIR where this package is installed
320
    PROJECT_INSTSUBDIR=facets
321
    source $mydir/bilder/mkall-default.sh
322
323
    # Build the package
324
    runBilderCmd
325
    res=$?
326
    exit $res
327
~~~~~~
328
329
The options,
330
331
~~~~~~
332
    $ ./mkfcall-default.sh -h
333
    source /Users/cary/projects/facetsall/bilder/runnr/runnrfcns.sh
334
    WARNING: runnrGetHostVars unable to determine the domain name.
335
    Usage: ./mkfcall-default.sh [options]
336
    This script is meant to handle some of the vagaries that occur
337
    at LCFs and clusters in large systems (which have complicated file
338
    systems) such as those that have high performance scratch systems
339
    and NFS mounted home systems.  This script is also meant to ease
340
    the use of non-gfortran compilers.
341
    OPTIONS
342
      -c              common installations: for non-LCFS, goes into /contrib,
343
                      /volatile or /internal, for LCFSs, goes into group areas
344
      -C              Install in separate tarball and repo install dirs 
345
                      (internal/volatile) rather than software
346
      -E "<options>"  quoted list of extra options to pass to the mk script
347
      -f <file>       File that contains extra arguments to pass
348
                      Default: .extra_args
349
      -F <compiler>   Specify fortran compiler on non-LCF systems
350
      -g              Label the gnu builds the same way other builds occur.
351
      -H <host name>  use rules for this hostname (carver, surveyor, intrepid)
352
      -h              print this message
353
      -i              Software directory is labeled with "internal" if '$USER'
354
                      is member of internal install list
355
      -I              Install in $HOME instead of default location
356
                      (projects directory at LCFs, BUILD_ROOTDIR on non-LCFs)
357
      -j              Maximum allowed value of the arg of make -j
358
      -k              On non-LCFs: Try to find a tarball directory (/contrib)
359
                      On LCFs:     Install tarballs (instead of using facetspkgs)
360
      -m              force this machine file
361
      -n              invoke with a nohup and a redirect output
362
      -p              just print the command
363
      -q <timelimit>  run in queue if possible, with limit of timelimit time
364
      -t              Pass the -t flag to the  mk script (turn on testing)
365
      -v <file>       A file containing a list (without commas) of declared
366
                      environment variables to be passed to mk*.sh script
367
      -w <file>       Specify the name of a file which has a comma-delimited
368
                      list of packages not to build (e.g.,
369
                      plasma_state,nubeam,uedge) Default: .nobuild
370
      --              End processing of args for mkall-default.sh, all remaining
371
                      args are passed to the script.
372
~~~~~~
373
374
375
mostly deal with which directory is to be used for installation, what is the time limit for the build, any extra options to be passed to the build, whether on the command line or in a file.
376
377
An example invocation look like
378
379
~~~~~~
380
mkfcall-default.sh -cin -- -oXZ -E BUILD_ATLAS=true
381
~~~~~~
382
383
which will (c) install in areas common to all users, (i) using the internal rather than the volatile directory for repo installations, (n) in background via nohup, -- what follows are more args for the base script, which are (o) build openmpi if on OS X or Linux, (X) build the newer, experimental packages, (Z) do not invoke the user defined bilderFinalAction method, (E) set this comma delimited list of environment variables, in this case to build Atlas if on Linux or Windows.
384
385
# Using Jenkins with Bilder
386
387
### Setting up Jenkins for use with Bilder ###
388
389
This set of pages is intended to describe how to set up the Jenkins continuous integration tools for launching Bilder jobs (which then handle the builds and testing). It is not intended to describe the most general way to set up Jenkins, but instead it describes a way that relies on having a Linux master node.
390
391
392
## Starting up a Linux Jenkins master node 
393
394
Install Jenkins using the installation mechanism for your platform.  E.g., see
395
https://wiki.jenkins-ci.org/display/JENKINS/Installing+Jenkins+on+RedHat+distributions.
396
397
*IMPORTANT:* Before starting Jenkins for the first time:
398
* create the directory where Jenkins will do its builds (known as _JENKINS_HOME_, not to be confused with the Jenkins home directory in /etc/passwd, which is initially set to /var/lib/jenkins, which we will assume here)
399
* set the permissions of the Jenkins build directory (e.g., /home/bilder/jenkins)
400
* Add jenkins to any groups as needed (e.g., contrib, research, xxusers)
401
* modify {{{/etc/sysconfig/jenkins}}} as needed.  Our settings are
402
403
404
~~~~~~
405
    JENKINS_HOME="/home/bilder/jenkins"
406
    JENKINS_PORT="8300"
407
    JENKINS_AJP_PORT="8309"
408
    JENKINS_ARGS="--argumentsRealm.passwd.jenkins=somepassword --argumentsRealm.roles.jenkins=admin"
409
~~~~~~
410
411
(_somepassword_ is not literal.)
412
413
Create an ssh key for jenkins:
414
415
416
~~~~~~
417
    sudo -u jenkins ssh-keygen
418
~~~~~~
419
420
421
It cannot have a passphrase.
422
423
424
Start the jenkins service:
425
426
427
~~~~~~
428
    sudo service jenkins start
429
~~~~~~
430
431
432
Set Jenkins to start on boot:
433
434
~~~~~~
435
    sudo chkconfig --level 35 jenkins on
436
~~~~~~
437
438
## Preparing a Unix Jenkins slave node 
439
440
We will have one node prepared to act as a Jenkins slave for now.  For ease, we will create a Unix slave.  Later we will add more slaves.
441
442
* On the service node, create the user who will run Jenkins.
443
* As that user create the directory where Jenkins will work
444
* Add that user to any groups needed to give it appropriate permissions (e.g., contrib, research, xxusers)
445
* For public-key authentication
446 3 Redmine Admin
  * Add the public key created above for jenkins to that user's ~/.ssh/authorized_keys
447
  * On the master, check that you can do passwordless login by trying: "_sudo -u jenkins ssh jenkins@yourhost_"
448 1 Redmine Admin
* For password authentication
449 3 Redmine Admin
  * Configure /etc/sshd_config to allow password authentication (PasswordAuthentication yes) and restart sshd
450 1 Redmine Admin
451
## Configuring the Linux Jenkins master node 
452
453
* Open a browser and go to _master.yourdomain:8300_ and log in as admin with the password that you set in the JENKINS_ARGS variable, above.
454
* Go to Manage Jenkins -> Manage Plugins -> Available and install the plugins,
455 3 Redmine Admin
  * Jenkins cross-platform shell (XShell)
456
  * Conditional Build-Step
457
  * Matrix Tie Parent
458
  * Jenkins build timeout (Build-timeout)
459 1 Redmine Admin
* Go to Manage Jenkins -> Manage Users and then use _Create User_ to create the users for your Jenkins installation. Make sure to create an administrative user perhaps yourself).
460
* Go to Manage Jenkins -> Configure system and select/set
461 3 Redmine Admin
  * Enable security
462
  * Jenkins's own user database
463
  * _If you wish_, allow users to sign up
464
  * Project-based Matrix Authorization Strategy
465 4 Redmine Admin
      * Add an administrator name with all privileges
466
      * Give anonymous user Overall Read (only)
467 3 Redmine Admin
  * Default user e-mail suffix: e.g., @yourdomain
468
  * Sender: jenkins@yourdomain
469 1 Redmine Admin
* Go to Manage Jenkins -> Manage Nodes -> New Node
470 4 Redmine Admin
  * Fill in name
471
  * Dumb Slave
472
  * You are taken to the configure form:
473
      * \# of executors = 1
474
      * Remote FS root: what you decided upon when creating the slave
475
      * Usage: Leave this machine for tied jobs onlye
476
      * Launch methog: Launch slave gents on Unix machines via SSH
477
      * Advanced:
478
          * Host 
479
          * Username (jenkins)
480 1 Redmine Admin
481
## Creating your first Bilder-Jenkins project 
482
483
We will create the first project to build on the master node. Later we will add more nodes.
484
485
* Go to Jenkins -> New Job
486
    * Build multi-configuration project
487
* Set name (here we will do visitall as our example)
488
* Enable project-based security
489
    * For open source builds, give Anonymous Job Read and Job Workspace permission
490
    * Add user/group as needed
491
* Source Code Management
492
    * Subversion
493
    * Put in your URL, e.g., https://ice.txcorp.com/svnrepos/code/visitall/trunk
494
    * Put in your svn credentials as requested
495
* Build Triggers (examples)
496
    * Build Periodically
497
        * Enter cron parameters, e.g., 0 20 * * *
498
    * Or have this build launched as a post-build step of another build
499
* Configuration Matrix
500
    * Add axis -> slaves (is this available before we add nodes?)
501
        * Add master and the above unix slave
502
* Build Environment
503
    * Abort the build if stuck (if desired)
504
        * Enter your timeout
505
    * Tie parent build to a node
506
        * Select master node
507
* Build
508
    * Add build step -> Invoke XShell command
509
        * Command line: bilder/bildtrol/unibild -d mkvisitall
510
        * Executable is in workspace dir
511
* Post-build Actions
512
    * Aggregate downstream test results
513
        * Select both
514
    * Archive the artifacts (select, see below for settings)
515
    * E-mail Notification
516
        * Set as desired
517
518
519
## Creating a Windows Slave 
520
521
* Get all tools in place on the slave machine by following the instructions at https://ice.txcorp.com/trac/bilder/wiki/BilderOnWindows
522
* Create the jenkins user account (if not already defined) as an Administrative account and log into the windows machine as the jenkins user
523
* Make sure the slave's Windows name and its domain name are consistent.
524
* Install Java (see http://www.java.com) and update the path to include `C:\Windows\SYSWOW64` if on 64 bit Windows and then `C:\Program Files (x86)\Java\jre6\bin`
525
* Create the build directory (e.g., C:\winsame\jenkins)
526
* Set the owner to that directory to the jenkins user via properties->security->advanced->owner.
527
* Install the Visual C++ redistributables from http://www.microsoft.com/download/en/details.aspx?id=5582
528
* Follow the networking, registry, and security related instructions at https://wiki.jenkins-ci.org/display/JENKINS/Windows+slaves+fail+to+start+via+DCOM
529
* (Cribbing from https://issues.jenkins-ci.org/browse/JENKINS-12820)
530
* Start a web browser on the windows slave and connect to the master jenkins web page.
531
* Manage Jenkins -> Manage Nodes -> New Node
532
* Create a new node (Slave node)
533
  * Fill in name, choose it to be the same as the Windows name of the slave
534
  * Dumb Slave
535
  * In the configure form, set
536
    * \# of executors: 1
537
    * Remote FS root: the directory created above (e.g., C:\winsame\jenkins)
538
    * Usage: Leave this machine for tied jobs only
539
    * Launch method: Launch slave agents via java web start
540
* Launch the slave
541
* Press the newly appeared button: Launch by webstart  
542
* A pop up window will be visible with a message as "Connected"
543
* In that pop up window click File-> Install as Windows Service
544
* Find jenkins service in the control panel, ensure that the owner is the jenkins user
545
* UNCHECKED: Set startup to Automatic
546
* Return to browser, take slave node off line in jenkins
547
* Set launch method to: Windows slave as a Windows Service
548
  * Advanced:
549
    * Administrative username (jenkins) ''You may need to type it as computername\jenkins if you get an invalid service account error''
550
    * Password (set as selected in slave setup)
551
    * Use Administrator account given above
552
* Relaunch slave node
553
554
### Use the Slave on the Master 
555
556
You should now be able to select this slave as a build node.
557
558
559
## Launching Bilder through Jenkins 
560
561
Jenkins runs Bilder through the scripts in the bildtrol subdir using the XShell command. The XShell command, when configured to launch _somescript_, actually invokes _somescript.bat_ on Windows and _somescript_ on unix.  The Bilder _.bat_ scripts simply translate the arguments and use them in a call to _somescript_, which is run through Cygwin.
562
563
### Building and testing: jenkinsbild 
564
565
The script, _jenkinsbild_, launches a build from Jenkins using the default scripts.  For this example, we consider building the visualization package _VisIT_, for which the repo is _https://ice.txcorp.com/svnrepos/code/visitall/trunk_.  This repo uses externals to bring in the VisIT source code.  In this
566
case, the simplest XShell command is
567
568
~~~~~~
569
    bilder/jenkins/jenkinsbild mkvisitall
570
~~~~~~
571
572
which leads to execution of
573
574
~~~~~~
575
    ./mkvisitall-default.sh -t -Ci -b builds-internal  -- -Z -w 7
576
~~~~~~
577
578
which action is described in the rest of the Bilder documentation, but in particular, testing is invoked (-t), packages and repos are installed in separate areas (-C), use the ''internal'' directory for repo installation, do not do any post build action (-Z), and if a build less than 7 days old is found, do not execute the build (-w 7). The arguments after *--* are passed directly to the underlying Bilder script, mkvisitall.sh.
579
580
The _jenkinsbild_ script has very few options:
581
582
583
~~~~~~
584
    Usage: $0 [options]
585
    GENERAL OPTIONS
586
      -b ................ Use internal/volatile build directory naming
587
      -m ................ Use mingw on cygwin
588
      -n ................ Do not add tests
589
      -p ................ Print the command only.
590
      -s step ..........  Sets which build step: 1 = internal, 2 = volatile.
591
      -2 ................ Sets build step to 2.
592
~~~~~~
593
594
At present, the internal/volatile build directory naming is in fact always true.  In this case, the first step (the default) builds in the subdir, *builds-internal*, and the second step (selected with -2 or -s 2) builds in the subdir, *builds-volatile*. Correspondingly, the repo installation directory is the *internal* directory on step 1 and the *volatile* directory on step 2.
595
596
Using mingw on cygwin (-m) is useful for codes that cannot build with Visual Studio.
597
598
Not adding the tests is useful in many instances where one is counting on only a few hosts to do testing.
599
600
The build step (-s2 or -2) will build in *builds-volatile* and install in the volatile directory, but it also determines several options by looking at the email subject of any step-1 build.
601
602
This is geared towards a general philosophy of having two builds, the stable (or internal) build that is done more rarely, and a volatile build that is done every night. So what is done in step 2 depends on the step 1 result, which can be determined from the email subject file, left behind. There are four cases:
603
604
* Step 1 did nothing as there was a sufficiently recent build.  Then step 2 does a full build with tests.
605
* Step 1 was fully successful, both builds and tests.  Then step 2 is not executed.
606
* Step 1 builds succeeded, but some tests failed (and so some packages were not installed).  Then step 2 is executed without testing, as that was done in step 1, and this permits installation of the built, but untested packages.
607
* Step 1 builds failed (and so corresponding tests were not attempted). Then step 2 is not executed, as it will fail as well.
608
609
The error code returned by jenkinsbild for is success (0) if even only the builds succeeded but not the tests. This way the dashboard indicates jenkinsbild build success only. A subsequent job, jenkinstest, determines whether tests passed by examining the email subjects left behind.
610
611
For either build step, one wants to archive the artifacts,
612
613
~~~~~~
614
mk*all.sh,jenkinsbild.log,builds-*/bilderenv.txt,builds-*/*-summary.txt,\
615
builds-*/*.log,builds-*/*-chain.txt,*/*-preconfig.sh,*/preconfig.txt,\
616
builds-*/*/*/*-config.sh,builds-*/*/*/*-config.txt,\
617
builds-*/*/*/*-build.sh,builds-*/*/*/*-build.txt,\
618
builds-*/*/*/*-test.sh,builds-*/*/*/*-test.txt,\
619
builds-*/*/*/*-submit.sh,builds-*/*/*/*-submit.txt,\
620
builds-*/*/*/*-install.sh,builds-*/*/*/*-install.txt,\
621
*tests/*-config.sh,*tests/*-config.txt,*tests/*-build.sh,\
622
*tests/*-build.txt,*tests/*-install.sh,*tests/*-install.txt,\
623
*tests/runtxtest-*.txt,*tests/*-txtest.log,\
624
builds-*/*/*/*-Darwin-*.dmg,builds-*/*/*/*-win_x??-*.exe,\
625
builds-*/*/*/*-Linux-x86*-*.tar.gz
626
~~~~~~
627
628
in order to collect all results of builds and tests and any created installers.
629
630
### Posting test results: jenkinstest 
631
632
633
# Bilder Architecture
634
635
Bilder has a largely Object Oriented structure, even though it is written in Bash. But like all (even OO) programs, it has a procedural aspect.  Further it is task oriented (as opposed to event driven), with clear start and conclusion. We will break this architecture down into these three aspects: the task flow, the primary objects, and the procedures.
636
637
## Task flow 
638
639
Bilder scripts, like mkvisitall.sh, begin by setting some identifying variables, BILDER_NAME, BILDER_PACKAGE, ORBITER_NAME, and then continue by sourcing bildall.sh, which brings in the Bilder infrastructure: initializations of variables and methods used for building, testing, and installing packages.
640
641
### Global method definition 
642
643
The file, bildall.sh, brings in all of the global methods by first sourcing runr/runrfcns.sh, which contains the minimal methods for executing builds in job queues and reporting the results. It then obtains all of the more Bilder-specific methods by sourcing bildfcns.sh. These include generic methods for determining the build system, preconfiguring, configuring, building, testing (including running tests and collecting results), and installing. These files are the heart of Bilder, as they do all the heavy lifting.
644
645
A trivial, but important method is _techo_, which prints output to both stdout and to a log file.  Another is _decho_, which does the same, but only if DEBUG=true, which is set by the option _-d_.
646
647
### Option parsing 
648
649
Options are parsed through the sourcing of bildopts.sh, which is sourced by bildall.sh.  It then sets some basic command-line-argument derived variables, such as the installation directories, which it checks for writability. This file, bildopts.sh, has been written in such a way that Bilder-derived scripts (like mkvisitall.sh) can add their own arguments.
650
651
### Initialization 
652
653
Initialization is carried out by sourcing of two files, bildinit.sh and bildvars.sh (which are both sourced by bildall.sh). The purpose of bildinit.sh is to handle timing, to clear out indicating variables (like PIDLIST and configFailures), get the Bilder version, and define any path-like environment variables that might get changed in the course of the run.
654
655
The purpose of bildvars.sh is to determine useful variables for the build.  The first comes from a possible machine file, then by OS (AIX, CYGWIN, Darwin, or Linux; MinGW is a work in progress). Then unset variables are set to default
656
values. These variables contain the compilers for serial (front-end nodes), back-end nodes, parallel, and gcc (as some packages build only with gcc, and the names of the gcc compilers can vary from one system to another).  As well,
657
the flags for all of these compilers are set. 
658
659
There are some packages that are so basic, that bilder defines variables for them.  These include HDF5, the linear algebra libraries (lapack, blas, atlas), and Boost. These definitions allow the locations of these libraries to be defined on a per machine basis. This is needed particularly for LCFs, which have special builds of HDF5, BLAS, and LAPACK, and for CYGWIN, which must have Boost to make up for deficiencies in the Visual Studio compilers.
660
661
Finally, bildvars.sh prints out all of the determined values.
662
663
### Package building 
664
665
A Bilder-derived script, like _mkvisitall.sh_, after sourcing _bildall.sh_, then builds packages in groups. In the simplest case, a package is built in a straight-through sequence, like
666
667
~~~~~~
668
    source $BILDER_TOPDIR/bilder/packages/facets.sh
669
    buildFacets
670
    testFacets
671
    installFacets
672
~~~~~~
673
674
(The call to _testFacets_ can be ignored if thepackage is not tested.)  The methods for building, testing, and installing a package are defined in the ppropriate file under the packages subdirectory.
675
676
Bilder, however, has the capability of doing threaded builds, such as in the sequence,
677
678
~~~~~~
679
    source $BILDER_TOPDIR/bilder/packages/trilinos.sh
680
    buildTrilinos
681
    source $BILDER_TOPDIR/bilder/packages/txphysics.sh
682
    buildTxphysics
683
    source $BILDER_TOPDIR/bilder/packages/txbase.sh
684
    buildTxbase
685
    installTxbase
686
    installTxphysics
687
    installTrilinos
688
~~~~~~
689
690
In this case, all of the builds for _Trilinos_, _TxPhysics_, and _TxBase_ are launched and so are occurring simultaneously. Then _installTxbase_ waits for
691
the _TxBase_ build to complete, then it installs it. Then it waits on and installs _TxPhysics_ and _Trilinos_.
692
693
This ability to build multiple, non-interdependent packages simultaneously is a key feature of Bilder. It leads to great savings in time, especially with packages that must be built in serial due to a lack of dependency determination.
694
695
### Concluding 
696
697
The last part of the task flow is to install the configuration files, to summarize the build, and to email and post log files, build files, and the summary. The configuration files, which are created by _createConfigFiles_ and installed by _installConfigFiles_ into the installation directory, contain the necessary additions to environment variables to pick up the installed software.
698
699
The method, _finish_, then does the remaining tasks. It creates the summary file and emails it to the contact specified by the option parsing.  It then posts all log and build files to Orbiter.
700
701
## Package files 
702
703 6 John Cary
Package files define how a package is acquired, how it is configured for building on the particular platform for all builds, how all builds are done, and how they are all installed.  Here we introduce an important distinction: the **tarball packages** are those obtained in the tar.gz format; the **repo packages** are obtained from a Subversion source code repo. Generic tarball packages are found in the Tech-X maintained Subversion repo at https://ice.txcorp.com/svnrepos/code/numpkgs and are available by anonymous svn. The repo packages are typically svn externals to a Bilder project, e.g., for visitall
704 1 Redmine Admin
705
~~~~~~
706 6 John Cary
visitall$ svn pg svn:externals .
707
bilder http://ice.txcorp.com/svnrepos/code/bilder/trunk
708
bilderconf http://ice.txcorp.com/svnrepos/code/bilder/bilderconf/trunk
709
visit http://portal.nersc.gov/svn/visit/trunk/src
710
visitwindows/distribution http://portal.nersc.gov/svn/visit/trunk/windowsbuild/distribution
711
visittest/data http://portal.nersc.gov/svn/visit/trunk/data
712
visittest/test http://portal.nersc.gov/svn/visit/trunk/test
713 1 Redmine Admin
~~~~~~
714
715
Though written in _Bash_, Bilder uses object concepts. Each package file under packages acts an object, with instantiation, exposed (public) data members, private data members, and methods.  As in OO, these **package-build** objects have the same data members and a common interface.
716
717
Instantiation is carried out by sourcing the package file. At this point, the data associated with that package is initialized as necessary.
718
719
### Package-build data 
720
721
The public data members for a package _PKG__ are
722
723
~~~~~~
724
    PKG_BLDRVERSION # Either the version to install or the
725
                    # version from the code repository
726
    PKG_DEPS        # The dependencies of this package
727
    PKG_BUILDS      # The names of the builds for this package
728
    PKG_UMASK       # The "umask" that determines the permissions
729
                    # for installation of this package
730
~~~~~~
731
732
In the syntax of C++, the first underscore would be represented by '.', i.e., pgk.DEPS. Even dynamic binding can be implemented in _Bash_.  I.e., if one
733
has _pkgname_ that holds the name of a package, one can can extract, e.g., BLDRVERSION via
734
735
~~~~~~
736
    vervar=`echo $pkgname | tr 'a-z./-' 'A-Z___'`_BLDRVERSION
737
~~~~~~
738
739
Admittedly, many of these constructs would more easily be accomplished in a language like Python that naturally supports object orientation. The trade-off is that then one does not have the nearly trivial expression of executable invocation or threading that one has in _Bash_.
740
741
In addition, there are the per-build, public variables _PKG_BUILD_OTHER_ARGS_ (e.g., _FACETS_PAR_OTHER_ARGS_ or _BABEL_STATIC_OTHER_ARGS_. These are added to the command-line when configuring a package.  In some cases, a package has more than one builds system, like HDF5, in which case one has two sets of variables, e.g., _HDF5_PAR_CMAKE_OTHER_ARGS_ and _HDF5_PAR_CONFIG_OTHER_ARGS_.
742
743
### Exposed package-build methods 
744
745
All package files are supposed to provide three methods, e.g., _buildPkg_, _testPkg_, and _installPkg_, where "Pkg" is the name of the package being built.  E.g., FACETS has buildFacets, testFacets, installFacets. For untested packages, the second method can simply be empty.
746
747
The method, _buildPkg_, is supposed to determine whether a package needs to be built.  If so, it should either acquire a tarball package or preconfigure (prepare the build system for) a repo package, then configure the package, and finally launch the builds of the package.  Preconfiguring in the example of an _autotools_ package involves invoking the _autoreconf_ and other executables for creating the various configuration scripts.  In many other cases there is no associated action.  If the Bilder infrastructure is used, then all builds are executed in a separate thread, and at the end of the _buildPkg_ method all the process IDs for these builds have been stored in both the variable PIDLIST, and the particular process ID for build "ser" of package "pkg" is stored in the variable, PKG_SER_PID.
748
749
The method, _testPkg_, is supposed to determine whether a package is being tested.  If not, it simply returns.  But if the package is being tested, then _testPkg_ executes _wait_ for each build.  Upon successful completion of
750
all builds, the tests are launched.  These are treated just like builds, so the process IDs are stored as in the case of builds.
751
752
The last method, _installPkg_, in the case of a tested package, waits for the tests to complete, then installs the package if the tests completed successfully, after which is sets the tests as being installed, so that tests will not be
753
run again unless the version or dependencies of the package change.  In the other case, where the package is not being tested, it waits for the builds to complete and installs any successful builds.
754
755
All three methods for any package are supposed to compensate for any errors or omissions in the build systems. Errors include fixing up library dependencies on Darwin, setting permissions of the installed software, and so forth. 
756
757
The object-oriented analogy is that each package-build object has an interface with three methods.  The syntax translation is _buildPkg_ -> _pkg.build_.
758
759
### Private package-build data 
760
761
In the course of its build, any package will generate other variables with values. These are organized on a per-build basis, and so one can think of each 
762
package-build object as containing an object for each build of that package.  
763
764
### Internal objects 
765
766
Builds
767
768
Tests
769
770
###  Combined package objects 
771
772
###  Future directions
773
774
Dependency determination.
775
776
777
# Linear Algebra Libraries in Bilder
778
779
There are a wide variety of ways to get LAPACK and BLAS: Netlib's libraries (reference LAPACK and BLAS), CLapack (for when one does not have a Fortran compilers), ATLAS (for cpu-tuned libraries), GOTOBLAS (from TACC), and system libraries (MKL, ACML).
780
781
For numpy and all things that depend on it, Bilder uses ATLAS (if it has been built), and otherwise it uses LAPACK.
782
783
For non Python packages, the current logic is
784
785
## Darwin 
786
787
Always use -framework Accelerate
788
789
## Linux and Windows 
790
791
* SYSTEM_LAPACK_SER_LIB and SYSTEM_BLAS_SER_LIB are used if set.
792
* Otherwise, if USE_ATLAS is true, then ATLAS is used.
793
* Otherwise, use Netlib LAPACK if that is found.
794
* Otherwise
795
    * If on Windows, use CLAPACK
796
    * If on Linux, use any system blas/lapack
797
798
The results of the search are put into the variables, CMAKE_LINLIB_SER_ARGS, CONFIG_LINLIB_SER_ARGS, LINLIB_SER_LIBS.
799
800
801
# Extending Bilder
802
803
Bilder builds packages using the general logic in bildfcns.sh, the operating-system logic in bildvars.sh,
804
logic for a particular package in the _Bilder package file_ (kept in the packages subdir), logic for a
805
particular machine in the _Bilder machine file_ (kept in the machines subdir), and additional settings for a particular package on a particular machine in the Bilder machine file.  To extend Bilder, one adds the files that introduce the particular logic for a package or a machine.
806
807
* [[Adding support for a new package]]
808
* [[Adding support for a new machine or operating system/compiler combination]]
809
810
# Debugging your builds
811
812
This section describes some of the things that can go wrong and explains how to fix them.
813
814
# Bilder With IDEs
815
816
This is a page to collect notes on using IDEs to develop code while at the same time using bilder to build the project.
817
818
* Reference for using Eclipse with CMake, http://www.vtk.org/Wiki/CMake:Eclipse_UNIX_Tutorial