ConSol Labs

Blog moved to blog.consol.de

2023-06-20T00:00:00+00:00

This blog will be continued on blog.consol.de.
Read great stories and news from our colleagues over there.

labs.consol.de will continue to host the OMD repository and static pages. Also
existing blog entries will stay here for archive reasons.

Image Change Triggers for Tekton

2022-03-14T00:00:00+00:00

One of our customers is in the process of decommisioning their OpenShift v3.11 cluster. This cluster is currently still used for building customer specific base images. Over time quite a few elaborated pipeline builds (based on Jenkins) have been developed for that purpose.

The customer wanted me to migrate the existing pipeline builds on their v3.11 cluster to Tekton (aka OpenShift Pipeline) builds running on their new v4.9 cluster. This task turned out to be quite pesky. Tekton is a beast in many aspects.

Visualisation and debugging of Apache Camel routes

2022-02-24T00:00:00+00:00

The documentation of software is an everyday business of a software developer and engineer. Especially for integration scenarios a diagram on the flow of a message through the system or the whole landscape is an essential illustration. Fortunately there are standardised messaging patterns which can be used. Unfortunately, however, there is no tool which can create such visualizations out of the box directly of source code. In this article we will have a look at Apache Camel and how it is possible to get a graphical representation of an integration route. We will also discuss about debugging it, as some tools have the feature to do this.

A look insight Camel K

2022-02-24T00:00:00+00:00

Today software often needs to be run in cloud environments. Newly developed software, especially microservices are developed with cloud readiness in mind.
But we not only have microservices in business environments, we also have integration software. This type of software is developed and designed to connect external services to internal ones.

Streaming and Messaging

2021-07-14T00:00:00+00:00

Disclaimer

This article is the author’s opinion on similarities and differences between Streaming and Messaging.

Streaming and Messaging

The first time I was busy with the terms messaging and streaming was during my master thesis in 2016. Among other things, the thesis was about different strategies of microservices integration. During that time, the term messaging was popular. Moroever, Kafka, which is a streaming platform, was popular, too. From a high-level perspective, messaging, kafka and streaming seem to be the same thing… but I never understood, why we have these two terms which are used synonymously in many contexts: messaging and streaming. This article is my answer to that question.

A journey of a Helm operator to OperatorHub.io and back again

2021-05-03T00:00:00+00:00

Some time ago, I started a project to create a Helm based operator for an OpenShift application. I used the Operator SDK to create the Helm operator. The Operator SDK documentation describes the parameters pretty good, and it contains a simple tutorial. it does not, however, describe the complete development cycle. This article aims to describe everything from creating the operator to the point where you can upload your operator to OperatorHub.io. We start with a basic Helm Chart. With this, you can install Nginx as a StatefulSet. You can find the source code in my github repo. Before we can start with creating an operator, we need to fulfill some prerequisites.

RabbitMQ

2021-02-09T00:00:00+00:00

The first version of RabbitMQ has been released in 2007. Back in these days, the goal was to provide a complete open source implementation of Advanced Message Queuing Protocol (AMQP), aiming at modern messaging needs such as high availability, high performance, scalability and security.
Nowadays, RabbitMQ is one of the most popular message brokers and can be found in several domains.
This article lights up core concepts and compares it with ActiveMQ Artemis and AWS SQS.

How to integrate Kafka with Istio on OpenShift

2021-02-02T00:00:00+00:00

Last summer I watched the Red Hat master course about Kafka from Sébastien Blanc. The Kafka setup in Kubernetes presented in the course looked pretty easy. The Kafka client implementation for Java seemed to be easy as well. Furthermore, I wanted to use Kafka for a long time, so I got the idea to extend my Istio example. Each time a service is called, a message is sent to a topic. The service (implemented in Quarkus), as well as the Kafka cluster should be in an Istio Service Mesh and secured with mTLS. I found descriptions of Joel Takvorian that Kafka works with Istio, so I knew (or at least hoped) that my plan should work.

This article will describe the overall architecture of the example and which obstacles I encountered during deployment.

AWS Comprehend and the output.tar.gz

2020-11-03T00:00:00+00:00

AWS Comprehend is a great tool when you want to extract information from textual data. As a managed service it is really easy to setup and can be used with next to no prior knowledge of machine learning. But there is one minor thing that bugs me about Comprehend: The Output.

TL;TR output.tar.gz bad, flat json file good.
See python code below for transformation.

Integration testing with Testcontainers

2020-10-27T00:00:00+00:00

Automatic integration tests as part of the development life cycle can save a lot of time and money. Not only when dealing with other service APIs or offering some, also if the application uses a database or other infrastructure services.

We at Consol made a lot of good experience to develop the integration tests as part of the life cycle from the beginning of a project. Therefor the Citrus framework is often a good choice to do it automated.

But there are other frameworks and libraries which can be useful. In this article, we’ll have a look at Testcontainers. By using a sample microservice, we will show how Testcontainers can be used and what chances it provides.

OpenShift and Let's Encrypt

2020-09-16T00:00:00+00:00

So you have this nifty web application deployed on your OpenShift cluster and you want to make it accessible by the whole world with HTTPS under the name coolapp.. Unfortunately you face several issues:

Exposing the service to your web application leaves you with a route using the self-signed certificate that was generated during setup of the cluster. None of the browsers in the wild will trust this certificate.
The self-signed certificate dictates URLS of the form https://.apps.. (or whatever domain suffix you configured). Not very nice.
You might mitigate the previous issues by getting an official certificate signed by a generally trusted institution. But you will have to pay for it.
And you will have to pay for it not only once but every year (latest every 389 days) thanks to recently tightened certificate policies installed by all major browser vendors.
Worst of it all: You must not (by any means) forget to apply for a new certificate in a timely manner and replace the certificate in your route before the old expires. Otherwise some people might get pretty angry about you.

Let’s Encrypt to the rescue!

How to add an application to a Red Hat OpenShift Service Mesh

2020-08-07T00:00:00+00:00

During a discussion with a customer, we talked about which steps are necessary to add an application to a services mesh. Which should be no big deal. Unfortunately, there is not a simple guideline how to do that for the Red Hat OpenShift Service Mesh. Furthermore, I was not sure how the requests for the application would look like in Jaeger. To clarify these points, I created a small application. Which I then deployed on OpenShift and added it to a service mesh control plane. This is the documentation of the steps that I have done.

Unofficial guideline to get the latest and greatest version of Kiali in OpenShift

2020-05-18T00:00:00+00:00

During this year’s Red Hat Summit I had the chance to get a glimpse of the latest version of Kiali. This version had some nice features, like the traffic flow of the application graph during a time period (Graph replay). It also contains wizards to create destination rules and virtual services. This demo has struck my curiosity to get the hands on this Kiali version. One obstacle for me was that my Kiali is running in Red Hat OpenShift Service Mesh and is controlled by the Kiali operator. Currently, it is using version 1.12. The version that I wanted to try was the latest release version (1.17). The Red Hat OpenShift Service Mesh does not support this version. This article describes what we need to do in order to replace the Kiali version of an Red Hat OpenShift Service Mesh with the latest version of Kiali.

Simple example how to use Istio and Keycloak

2020-05-07T00:00:00+00:00

Some time ago, I did a webinar about the RedHat Service Mesh, which is based on Istio. For this webinar, I prepared a demo application. Among other things, I wanted to show how to do the authentication with JWT token in general and, more specific, with Keycloak. This article will describe how to configure Keycloak. In the second article, I will show you what problems I encountered running the application in Istio and how I figured out what was wrong in my configuration. You can find the article here

Debugging Istio

2020-05-07T00:00:00+00:00

In the article, I’m going to describe what we can do, if we configured our application to use Istio, but it is not working like intended. Originally, I wanted to give a detailed description what problems I encountered during the creation of my webinar and how I fixed them. However, I came to a point where this would be a very long one. I hope that you don’t mind that I shortened it and just describe which tools are available to debug the Istio configuration. In my previous article I described how to configure Keycloak for my webinar. So without further ado, let’s start.

OpenShift 4.3 installation on VMware vSphere with static IPs

2020-01-31T00:00:00+00:00

In this article, I will show you how to install Red Hat OpenShift Container Platform 4.3 (OCP) on VMware vSphere with static IPs addresses using the openshift installer in UPI mode and terraform. In contrast to the official OpenShift 4.3 install documentation, we will not use DHCP for the nodes and will not setup the nodes manually - instead we will use static IP addresses and terraform to setup the virtual machines in our vCenter.

Installing MongoDB on OpenShift

2020-01-14T00:00:00+00:00

So here is another one of our series Installing Blahblahblah on OpenShift. This time it is about getting MongoDB to run on OpenShift - the way recommended and promoted by the MongoDB guys. The whole setup is still in beta stage as indicated on these two entries in Red Hat’s container image catalog. You can get your MongoDB instance up and running on OpenShift. But most of the required steps have to be performed on the command line, contrary to the impression given by MongoDB, Inc that once you get the MongoDB Operations Manager up and running everything can be achieved via this tool’s GUI. Some operations in the Operations Manager simply do not work (yet) on OpenShift.

CodeReady Containers on Ubuntu

2019-11-29T00:00:00+00:00

With the release of OpenShift 4.x Red Hat left no stone unturned (compared to previous 3.x versions). Among many things Minishift became Red Hat CodeReady Containers. Having been a big fan of Minishift I recently wanted to give CodeReady Containers (aka CRC) a try.

Turned out this is not that easy - at least if you want to run CRC on a Linux that does not come from Red Hat (or its community). This article gives instructions for all those people out there who want to run CodeReady Containers on Ubuntu.

Update 2020-12-17: According to this comment on GitHub by one of the maintainers / developers of Red Hat CodeReady Containers the issues with Ubuntu have been resolved in the latest version of CRC.

Introduction to AWS CDK

2019-11-04T00:00:00+00:00

AWS Cloud Development Kit (CDK) is a relatively new kid on the block. It is a tool for defining Infrastructure as Code (IaC) and is considered to be the future successor of AWS CloudFormation.

This article overviews the IaC approach, introduces a reader to the AWS CDK, shows what problems it aims to solve and presents a simple example application implemented with it.

Introduction to Spring Boot and GraphQL for API Design Pt. 2

2019-09-26T00:00:00+00:00

This is the second part of the series in which we will create a REST-Service based on Spring Boot which will be translated in a GraphQL Service in the 3rd part of this little series.

Installing GitLab on OpenShift

2019-07-31T00:00:00+00:00

We recently had to install a bunch of applications on a customer’s shiny new OpenShift 3.11 cluster. Among others also GitLab. Turned out getting GitLab up and running on OpenShift is not so easy. What I found on the Internet about installing GitLab on OpenShift was partly outdated and not 100% accurate. Most information was about getting GitLab into a Kubernetes cluster. So I had to adapt these information to the situation in an OpenShift cluster.

This article is the conclusion of all these findings and efforts and gives a step-by-step recipe on how to install GitLab on OpenShift.

Security guide for Amazon Kubernetes Cluster (AWS EKS)

2019-05-31T00:00:00+00:00

One of the most challenging questions in cloud environments is about how secure is my application when its deployed in the public cloud ?
Its no secret that security aspects are much more important in a public cloud than it was in classic environments.
But dont be surprised that many applications even in public cloud dont follow best practice security patterns.
This has several reasons for example time and costs are very high trying to achieve a high security level.
But in fact AWS and Kubernetes offer many options which let you improve your security level without too much effort.
I like to share some of the possibilities that you have when creating a secure AWS EKS cluster.

Introduction to Spring Boot and GraphQL for API Design

2019-05-02T00:00:00+00:00

GraphQL is a nice way to publish a highly customizable API. In combination with Spring Boot, which makes development really easy and offers features like database integration and security, you can quickly build your API service from scratch.
This is the start of a series from articles showing you the way to a Spring Boot powered REST-Service with an API running Spring Boot and Graphql.

Hello Kubernetes on AWS! A simple way to test-drive EKS

2019-04-29T00:00:00+00:00

Under the name of “Managed Kubernetes for AWS”, or short EKS, Amazon offers its own dedicated solution for running Kubernetes upons its cloud platform. The way this is provided is quite interesting: While the Kubernetes Master Infrastructure is offered “as a service” (and also billed separately) the Kubernetes Worker Nodes are simply EC2 instances for which Amazon provides a special setup procedure. These now also offer the potential to use well known AWS features like Autoscaling for Kubernetes workloads.

However, manually setting up this infrastructure is still quite a complex process with multiple steps. To be able to quickly have an EKS Kubernetes Cluster up and running, and also to deploy a software project on it, we created a small helper project that offers the creation of a “turnkey ready” EKS cluster that can be quickly pulled up and also teared down after usage.

Avoid Kubelet OOM errors on Amazon Kubernetes Cluster (AWS EKS)

2019-04-16T00:00:00+00:00

AWS offers a great service called “Amazon Elastic Container Service for Kubernetes” (AWS EKS).
The setup guide can be found here: Offical AWS EKS getting started guide

If you overload such a cluster it easily happens that your Kubelet gets “Out of Memory” (OOM) errors and stops working.
Once the Kubelet is down you can see kubectl get nodes that node is in state “NotReady”.
In addition if you describe your node kubectl describe $NODE you can see the status description is: “System OOM encountered”.
If you look on your pods kubectl get pods --all-namespaces you can see that pods are in state “Unknown” or in “NodeLost”.

Kubelet OOM errors should be avoided by all costs.
It causes to stop all pods on that node and its quite complicated for K8s to maintain high availability for applications in some cases.
For example for stateful sets with a single replica k8s cannot immediately move that pod to another node.
The reason is that k8s does not know how long the node with all its pods stays unavailable.

Therefore i like to share some best practice to avoid OOM problems in your AWS EKS clusters.

Of VPCs, Subnets and ACLs

2019-04-12T00:00:00+00:00

In the first article of this series, Getting started with AWS Lambda, we used a Cloudformation template to provision and deploy all needed parts for our REST application.

In this and the following articles, we are going to explore components used in the template. The focus of this article is the network infrastructure components.

oc patch unleashed

2019-04-08T00:00:00+00:00

Recently, I stumbled on a situation where I wanted to add a couple of values to an OpenShift deployment configuration. Previously I had modified or added a single attribute in a yaml file with oc patch. So I started to wonder whether it is possible to update multiple attributes with oc patch as well. To get right to the result: Yes, it is possible. This article will show you which features oc patch and likewise kubectl patch really have, beside a simple modification of one attribute.

Developers vs. OpenShift CI/CD #3: System test failure analysis

2019-02-11T00:00:00+00:00

After some time, let’s move on to another topic around making OpenShift environments more developer friendly. This time we are going to look at what happens, when a system test actually failed, and how to enable developers to properly react.

Comparing Kubernetes and OpenShift

2019-02-11T00:00:00+00:00

Kubernetes and OpenShift have a lot in common. Actually OpenShift is more or less Kubernetes with some additions. But what exactly is the difference?

It’s not so easy to tell as both products are moving targets. The delta changes with every release - be it of Kubernetes or OpenShift. I tried to find out and stumbled across a few blog posts here and there. But they all where based on not so recent versions - thus not really up-to-date.

So I took the effort to compare the most recent versions of Kubernetes and OpenShift. At the time of writing v1.13 of Kubernetes and v3.11 of OpenShift. I plan to update this article as new versions become available.

Java Licensing: Is the Free Lunch over?

2019-02-05T00:00:00+00:00

The license change to Java SE 8, as well as the new license for Java SE 9 and onwards lead to confusion within the Java community. Looking for information on the web, one finds results in the spectrum from “Is Java in Jepoardy?” to “Java is still free!”. The good news is: yes, Java is still free. The bad news: not necessarily Oracle’s Java distribution.

In this article, we discuss the situation revolving around Oracle’s license change and its consequences. For this, we need to understand how the Oracle JDK is connected to OpenJDK. Furthermore, we take a look at some alternatives to Oracle’s Java distribution and how divergence between the different distribution is avoided.

What you will need:

about 15-20 minutes of time

Getting Started with AWS Lambda

2019-02-04T00:00:00+00:00

Lambda is AWS’ realization of a serverless architecture. Instead of deploying instances and scaling them manually, developers deploy only their code and AWS executes the code. Different triggers for code executions can be defined, e.g. when a new event in an AWS Kinesis stream is published or when a REST endpoint is accessed.

Since AWS takes care of Lambda execution, the Lambda does automatically scale in and out to current needs. Coupled with its “pay only for what you use” pricing and the fact that lambda execution can scale to zero when no lambda is executed, AWS Lambda is an interesting technology.

Advanced OpenShift command line interface

2019-01-09T00:00:00+00:00

The OpenShift command line interface is a very powerful tool which is quite useful for beginners and advanced user of OpenShift alike. Some of its features are not well documented or not documented at all. In this article I would like to shed some light on commands that I personally find useful and that are, from my observation, not widely in use. So without further ado, let’s start with the commands:

Introduction to TimescaleDB

2018-10-31T00:00:00+00:00

Our world is full of various processes: tracking of goods delivery, currencies trading, monitoring of server resources, hotel bookings, selling goods or services etc. Since these processes occur over time, they can be described by time series data.

Successful businesses always take advantage of their data by analyzing it and then making predictions (e.g. predicting volume of sales for the next month) and business decisions (e.g. if the volume of sales grows then additional goods need to delivered to a warehouse).

There are a number of technologies for analysing the time series data. This article gives an introduction to one of them which is called TimescaleDB which is an open source solution for time series data analysis based on battle-tested PostgreSQL DBMS.

OMD 2.80 Labs Edition Released

2018-10-29T10:00:00+02:00

OMD Labs Edition 2.80 has been released today. The OMD Labs Edition is based on the standard OMD but adds some more useful addons like Grafana and Prometheus or additional cores like Icinga 2 and Naemon. This release updates many of the shiped components and adds some more usefull features.

JavaZone Video: Prometheus Monitoring without Modifying Source Code Using Java Agents and Byte Buddy

2018-09-14T00:00:00+00:00

The Prometheus monitoring tool follows a white-box monitoring approach: Applications actively provide metrics about their internal state, and the Prometheus server pulls these metrics from the applications using HTTP.

If you can modify the application’s source code, it is straightforward to instrument an application with Prometheus metrics: Add the Prometheus client library as a dependency, call that library to maintain the metrics, and use the library to expose the metrics via HTTP.

However, DevOps teams do not always have the option to modify the source code of the applications they are running.

At this year’s JavaZone conference, Fabian Stäber did a talk on how to instrument Java Web Applications with Prometheus metrics without modifying the application’s source code.

Eclipse MicroProfile - Microservices with Java EE

2018-09-10T00:00:00+00:00

As the number of microservice based architectures continues to grow, development teams are facing new challenges when choosing the adequate tools for the job. At the technical level, the decisions need to be made considering the features of both: the cloud or container platform that is going to be used for the deployment and the runtime that will be used by the software. The infrastructure needs to be aware of the health and metrics of the software and the software itself must make the most of the infrastructure by tolerating failures and being able to handle configuration changes. There are numerous solutions for the individual challenges but the lack of an enterprise level blueprint actually paved the way for Eclipse Microprofile.

Developers vs. OpenShift CI/CD #2: System tests and their data

2018-07-11T00:00:00+00:00

Let’s move on with this little series about how OpenShift environments may fall short in terms of developer experience.

Today we focus on the role that system tests have in an OpenShift infrastructure and what might possibly go wrong here testdata-wise.

Sakuli v1.2.0 released!

2018-07-04T17:00:00+02:00

It’s about time for a new Sakuli release! Our latest release v1.2.0 is the first version to include a beta of Sakuli-UI, a web UI to help you develop and manage your tests.

The new release also brings a bunch of enhancements and bug-fixes, a detailed changelog is included in this post.

Once again, we want to say THANK YOU for the great support of our contributors, our valued supporting companies and of course ConSol!

Developers vs. OpenShift CI/CD #1: Running applications locally

2018-06-25T00:00:00+00:00

In some OpenShift environments for building and delivering software we notice that the needs of developers, arguably a group of people who will have a great deal of contact with the platform, are not met as thoroughly as would have been possible.

Especially when it comes to software testing there is often much room for improvement. The usage of container platforms can improve testing techniques a lot but might also be a major blocker when it comes to the provided infrastructure. Good testing is already hard. Everything that makes it even harder, by forcing your developers into workarounds or compromises on testing quality will result in larger round trips, more testing effort, less valid testing, in short: wasted time.

So in this mini series of blog posts we will have a look into some possible fields of improvement and give recommendations on how to fix the respective situation.

Today we evaluate the fact, that some CI/CD setups for OpenShift may spoil the most simple type of testing a developer uses: Just running the software locally - in OpenShift.

Arch Linux for Devs

2018-06-18T00:00:00+00:00

This report is about the experience, I’ve made with Arch Linux as the operating system for a developers workstation. You’ll be introduced into the concepts of Arch Linux, followed by a introduction into the main tasks such as package installation and OS maintenance. At the end, I’ll discuss why I think that Arch Linux is a great OS for developers, and finish with a conclusion.

Setting up the Prometheus Operator with Ansible on a Kubeadm Kubernetes Cluster

2018-06-08T00:00:00+00:00

Prometheus is a popular monitoring tool based on time series data. One of the strengths of Prometheus is its deep integration with Kubernetes. Kubernetes components provide Prometheus metrics out of the box, and Prometheus’s service discovery integrates well with dynamic deployments in Kubernetes.

There are multiple ways how to set up Prometheus in a Kubernetes cluster. There’s an official Prometheus Docker image, so you could use that and create the Kubernetes YAML files from scratch (which according to Joe Beda is not totally crazy). There is also a helm chart. And there is the Prometheus Operator, which is built on top of the CoreOS operator framework.

This blog post shows how to get the Prometheus Operator up and running in a Kubernetes cluster set up with kubeadm. We use Ansible to automate the deployment.

Backup and Restore a Kubernetes Master with Kubeadm

2018-05-25T00:00:00+00:00

Kubeadm is a basic toolkit that helps you bootstrap a simple Kubernetes cluster. It is intended as a basis for higher-level deployment tools, like Ansible playbooks. A typical Kubernetes cluster set-up with kubeadm consists of a single Kubernetes master, which is the machine coordinating the cluster, and multiple Kubernetes nodes, which are the machines running the actual workload.

Dealing with node failure is simple: When a node fails, the master will detect the failure and re-schedule the workload to other nodes. To get back to the desired number of nodes, you can simply create a new node and add it to the cluster. In order to add a new node to an existing cluster, you first create a token on the master with kubeadm token create, then you use that token on the new node to join the cluster with kubeadm join.

Dealing with master failure is more complicated. Good news is: Master failure is not as bad as it sounds. The cluster and all workloads will continue running with exactly the same configuration as before the failure. Applications running in the Kubernetes cluster will still be usable. However, it will not be possible to create new deployments or to recover from node failures without the master.

This post shows how to backup and restore a Kubernetes master in a kubeadm cluster.

CIFS Flexvolume Plugin for Kubernetes

2018-05-11T00:00:00+00:00

This blog post shows how to use CIFS (a.k.a. SMB, Samba, Windows Share) network filesystems as Kubernetes volumes.

Docker containers running in Kubernetes have an ephemeral file system: Once a container is terminated, all files are gone. In order to store persistent data in Kubernetes, you need to mount a Persistent Volume into your container. Kubernetes has built-in support for network filesystems found in the most common cloud providers, like Amazon’s EBS, Microsoft’s Azure disk, etc. However, some cloud hosting services, like the Hetzner cloud, provide network storage using the CIFS (SMB, Samba, Windows Share) protocol, which is not natively supported in Kubernetes.

Fortunately, Kubernetes provides Flexvolume, which is a plugin mechanism enabling users to write their own drivers. There are a few flexvolume drivers for CIFS out there, but for different reasons none of them seemed to work for me. So I wrote my own, which can be found on github.com/fstab/cifs.

This blog post shows how to use the fstab/cifs plugin for mounting CIFS volumes in Kubernetes.

Machine Learning and ConSol CM

2018-04-20T17:00:00+02:00

ConSol CM brings BPM to a CRM system. In-house ConSol CM is used to process cases of a wide range of types. Amongst others it also maps the sales process. For that purpose a new sales case is created automatically or manually every time a sales opportunity or lead comes up. To these cases, information can be added concerning the communication with the customer, the origin of the opportunities and others.

Within a research and development project the scope was to predict the success for open sales cases, using machine learning algorithms. This way, sales employees would know already in an early stage if the opportunity most probably will be successful or how to adapt their strategy during the sales process to increase the chances for success.

Docker Headless VNC Container 1.3.0 Released

2018-04-03T17:00:00+02:00

Docker Headless VNC Container 1.3.0 has been released today. The different Docker images contains a complete VNC based, headless UI environment for testautomation like Sakuli does or simply for web browsing and temporary work in a throw-away UI container. The functionality is pretty near to a VM based image, but can be started in seconds instead of minutes. Each Docker image has therefore installed the following components:

Dynamic and complex configurations with FreeMarker

2018-03-26T00:00:00+00:00

When you are developing software, you will most likely stumble upon situations where you must perform frequent, but minor, code changes. Changes that do not alter your software’s basic functionality, changes so simple that from a developer’s perspective are more like a different configuration for your code but are still a bit too complex to use a simple configuration file.

In this article I will show you how to use Apache FreeMarker to implement dynamic and complex configurations in Java projects that can be configured from outside the application.

Database testing with Citrus

2018-03-01T00:00:00+00:00

Database communication is an essential part of many applications, when persistent data storage is required. May it be orders, customer data, product recommendations or product information, if persistent storage is in place, the data contains a certain business value. Therefore it’s important that your software handles your persistent storage the right way.

In this blog post you’ll learn how to test your database communication using Citrus.

FOSDEM Video: Monitoring Legacy Java Applications with Prometheus

2018-02-04T00:00:00+00:00

At this year’s FOSDEM conference I did a 30 minutes presentation on Monitoring Legacy Java Applications with Prometheus. The talk gives an overview of some of the options you have for monitoring Java applications with Prometheus when you cannot modify the application’s source code:

Logfile monitoring (grok_exporter), and how it differs from Elastic stack
Blackbox monitoring (blackbox_exporter)
JMX (jmx_exporter)
Write your own Java agent (promagent.io)

The video is available below.

Visualisierung von Firewall-Clusternodes mit Thruk und check_nwc_health

2018-01-24T00:00:00+00:00

In diesem Blogartikel wird gezeigt, wie das Monitoring-Plugin check_nwc_health auf eigene Bedürfnisse angepasst bzw. erweitert werden kann.

Ursprünglich sollte nur die Logik des Modes ha-role modifiziert werden, um den Status von Cluster-Nodes nur zu reporten, anstatt zu alarmieren. Heraus kam eine Statusanzeige im Thruk-Frontend auf Basis von Host-Macros…

Application monitoring in OpenShift with Prometheus and Grafana

2018-01-19T00:00:00+00:00

There are a lot of articles that show how to monitor an OpenShift cluster (including the monitoring of Nodes and the underlying hardware) with Prometheus running in the same OpenShift cluster. This article however is based on a different scenario: You are responsible for an application on an OpenShift cluster and want to monitor just this application, but you don’t have any administrative permission on it. The reason for this can be that you are working in a big company where the operation of the OpenShift environment is outsourced or the process to introduce a new monitoring solution takes way too long or the current monitoring solution doesn’t match your requirements and so on.

In this article I’m going to show you how to setup the monitoring of a demo application in 6 easy steps. The example is built in that manner that it will be easy for you to do the same for your application. A side note: If the OpenShift cluster that you are using will be monitored in the future with a different Prometheus setup, you don’t need to start from scratch. You might need to tweak the configuration of your scraping a bit and you need to move your dashboard to a different Grafana but that should be it.

Automated debugging with git

2018-01-12T00:00:00+00:00

Imagine your’re working on a bigger feature in a complex piece of software. Your implementation is complete, all tests in scope turned green and you push your changes for integration testing. Then, some integration tests from a completely different module fail and you have no clue which change may have caused this. Now you start analyzing the issue. Probing your commits by hand would end up in a very tedious process for sure. Thankfully git can do all the work for you, while you enjoy a cup of coffee.

The high-level command git bisect allows you to automatically run a specified test procedure, while it’s crawling through your commit history to find the bad revision.

Sakuli v1.1.0 released!

2017-12-22T17:00:00+02:00

Just in time before X-Mas holidays starts, we crate a huge release of our open source end-to-end testing framework Sakuli. The v1.1.0 release brings a bunch of new features and a brand new documentation with. The list of the current changes you will find bellow. Also we created a Short Overview Presentation so that you be able to get quick intro about what purpose of Sakuli is.

Also we wan’t to say a big THANK YOU for the great support of our contributors, our valued supporting companies and at least ConSol for making this possible as open source software. Double Thumbs up!!!

Sakuli Tutorial - Docker based E2E application monitoring

2017-11-17T17:00:00+02:00

The Tutorial “Docker based E2E application monitoring with Xfce UI and OMD Labs” describes how to:

Implement a complete containerized end-to-end monitoring environment
Testing HTML content
Testing native UI content
Setting up a monitoring with OMD Labs:
- Grafana graphs about the performance times (end user perspective)
- Alerts on errors with screenshots
Continuous execution of the test suite in a loop

Sources: see github.com/ConSol/sakuli-examples

Devoxx Video: Prometheus Monitoring for Java Web Applications w/o Modifying Source Code

2017-11-07T00:00:00+00:00

The Prometheus monitoring tool follows a white-box monitoring approach: Applications actively provide metrics about their internal state to the Prometheus server. In order to instrument an application with Prometheus metrics, you have to add a metrics library and call that library in the application’s source code. However, DevOps teams do not always have the option to modify the source code of the applications they are running.

At this year’s Devoxx conference, Fabian Stäber did a talk on how to instrument Java Web Applications with Prometheus metrics without modifying the application’s source code.

Simulating 3rd party services with Spring Boot and Citrus

2017-09-22T00:00:00+00:00

When developing software that exchanges data with other components or services you may be confronted with the proper simulation of those foreign services during integration testing. This is because you need to connect with a foreign service
that is simply not available on your local machine or in a test environment.

For unit testing purpose you can use mocks that help out to simulate proper responses. There will be times where your software is deployed to a test environment
in order to perform some acceptance tests with your stakeholders before going to a final release. Usually this is also done with the customer exploring the software through manual testing. In these situations traditional service mocking is not
a good option and you need a real simulator instance that receives requests and responds with proper test data.

This is exactly what the Citrus simulator project provides for you. Standalone simulation and complex request/response processing with solid validation capabilities. The Citrus simulator provides a very easy and reliable definition of inbound and outbound messages for different scenarios.
Good news is that this is not only for Http REST interfaces but also for SOAP WebService, JMS, RMI, mail messaging and many more. So you can use the simulator whenever you need to integrate with another service that is simply not available on your local machine or in your test environment.

Docker Headless VNC Container 1.2.0 Released

2017-09-19T10:00:00+02:00

Docker Headless VNC Container 1.2.0 has been released today. The different Docker images contains a complete VNC based, headless UI environment for testautomation like Sakuli does or simply for web browsing and temporary work in a throw-away UI container. The functionality is pretty near to a VM based image, but can be started in seconds instead of minutes. Each Docker image has therefore installed the following components:

Monitoring Plugin Language Comparison

2017-09-13T12:00:00+02:00

Which programming language should we use to write monitoring check_plugins? This question rose some discussion and this post is trying to give some hints.

Reunite separate git repositories

2017-09-08T00:00:00+00:00

I recently had to deal with two projects that have a common origin but separated at some point in time. I now had to try to bring them back together again - basically merging the changes. Sounds like a pretty standard git merge or git rebase job.

Unfortunately the separation was done in a not so clever way. Someone cloned the original repository, checked out some branch, made some first refactoring steps, got rid of the git stuff (probably rm -rf .git) and started a new git repository with this status. Rumors are that the situation at that time was so tense that people wanted to make a clear cut - which they did in a technical way.

Quite some time later it was my task to try to get the projects together again. The only input I had was two git URLs and the above story.

NEB Modules with Go

2017-09-01T14:00:00+02:00

Have you ever written a NEB (Nagios Event Broker) module? This article will explain a tool which makes this a lot easier, especially if the reason was that you are not familiar with C or C++. In this case the “Go NEB Wrapper” could come very handy and if you are new to this topic it is a good point to start with.

OMD 2.60 Labs Edition Released

2017-08-21T10:00:00+02:00

OMD Labs Edition 2.60 has been released today. The OMD Labs Edition is based on the standard OMD but adds some more useful addons like Grafana and Prometheus or additional cores like Icinga 2 and Naemon. This release updates many of the shiped components and adds some interesting options when resolving update conflicts.

Automated CI/CD Build Pipeline with Jenkis in OpenShift

2017-07-17T00:00:00+00:00

The GitHub repository toschneck/openshift-example-bakery-ci-pipeline contains, the sourcecode for the examples of the talk Continuous Testing: Integration- und UI-Testing mit OpenShift-Build-Pipelines at the Redhat/ConSol OpenShift-Day:

Introduction to GitLab CI with Maven

2017-07-14T00:00:00+00:00

At ConSol we use GitLab as our central Git server and I am quite happy with its functionality. Lately, I have been playing around with GitLab CI with the objective of finding out if we can use it instead of Jenkins, our current CI server of choice.

Since most of our projects use Maven, I was particularly interested in setting up a simple Maven build job.

To cut a long story short, yes, I would use GitLab CI in my next project. We’ll later see why, but first I want to give a quick walkthrough of GitLab CI.

Introducing Citrus Admin Web UI

2017-07-11T00:00:00+00:00

It has been a while since the last release in the Citrus universe. It took us some time to get the new Citrus release 2.7.2 ready for you.
Of course we were not being lazy in that time. Besides the new Citrus 2.7.2 release we are proud to announce a new player in the Citrus team. The Citrus administration UI is a
web-based user interface that helps you to manage your Citrus projects and test cases.

Often users complained about the complexity of having to learn all about Citrus and the Spring framework in particular as Citrus uses Spring for configuration and dependency injection.
Especially non-developers had problems to master the learning curve for Citrus and Spring when starting to use the framework. Also people asked for a way to have a user interface for managing
components and tests.

We heard you and introduced a new administration user interface for Citrus! There is a detailed Citrus Admin documentation (which is still ongoing).
However I would like to outline the main features of that web UI here in a short post for you.

Prometheus Monitoring for Java Web Applications without Modifying Their Source Code

2017-07-10T00:00:00+00:00

The Prometheus monitoring tool follows a white-box monitoring approach: Applications actively provide metrics about their internal state to the Prometheus server. In order to instrument an application with Prometheus metrics, you have to add a metrics library and use that library in the application’s source code. However, DevOps teams do not always have the option to modify the source code of the applications they are running.

Promagent is a Java agent using Bytecode manipulation for instrumenting Java Web applications without modifying their source code. Promagent allows you to get white-box metrics for Java Web applications even if these applications do not implement any metrics library out-of-the-box.

OMD 2.40 Labs Edition Released on Raspberry Pi

2017-06-14T14:00:00+02:00

OMD Labs Edition 2.40 for the Raspberry Pi has been released today. A month and a broken SD card (excessive use of /var/swap during the builds) after the release of the x86 version it is now possible to run a full-blown monitoring system on your ARM boards. It was tested on Raspberry 2 and Raspberry 3. If you want to run OMD on one of the older models, you might experience performance problems, especially when you enable InfluxDB and Grafana.

OMD 2.40 Labs Edition Released

2017-05-17T10:00:00+02:00

OMD Labs Edition 2.40 has been released today. The OMD Labs Edition is based on the standard OMD but adds some more useful addons like Grafana and Influxdb or additional cores like Icinga 2 and Naemon. This releases focus is on security and maintainance and removes some recently discovered CVEs in Nagios, Icinga and Naemon.

JAXenter - Sakuli End-2-End-Testing und -Monitoring im Container-Zeitalter

2017-05-03T00:00:00+00:00

Sowohl End-2-End-Testing als auch End-2-End-Monitoring folgen dem gleichen Paradigma – sie betrachten eine Applikation aus der Sicht des End-Users. Hier darf es keine Rolle spielen, in welcher Oberflächentechnologie die Applikation geschrieben ist oder in welcher Art sie mit dem End-User in Verbindung tritt. Genau an diesem Punkt setzt das Open-Source-Tool Sakuli an.

Angular, we have to talk!

2017-03-28T00:00:00+00:00

About three or four years ago I had the first contact with AngularJs (obviously V1.x) and what should I say? I loved it! It perfectly added the missing piece that JQuery wasnt able to solve: Bind data to the Dom in an easy way. Since this days there where a lot of evolution in Javascriptland. A lot of new Frameworks for entire SPAs, new techniques like functional and reactive programming, a lot (!) of build systems / task manager and even the language itself developed towards an serious programming language (I know some people have a different opinion). At least with the power of Typescript (or Flow) Javascript projects doenst have to be a Pain. Angular2+ took many of the the mordern aspects to provide a good and productive developer experience to develop application in time, quality and budget. Dont get me wrong: They made a very good job! But I have personally some concerns which I want to point out in this post.

Protecting Passwords in Java Properties Files on Windows

2017-03-27T00:00:00+00:00

Typical Java backend applications need to integrate with existing 3rd party services. In most cases, calls to these 3rd party services are authenticated. Frequently, Java applications are required to use login credentials for authenticated calls: A username and a password.

This scenario raises a problem: How can we store the password needed for calling the 3rd party service? We could store it in a properties file, but then everyone with access to the properties file learns the password. We could provide the password as a command line parameter or environment variable, but then everyone with access to the startup script learns the password. We could hard-code it in our application, but then everyone with access to the JAR file learns the password. We could encrypt the password using a master key, but then we have the same problem again: How to store the master key?

The common solution is to use a secure data store provided by the operating system. Our application runs on Windows Server, so we use the Windows Data Protection API (DPAPI) for protecting our secret passwords. This blog post shows how to use the DPAPI in Java applications.

DevoxxUS - review

2017-03-24T00:00:00+00:00

DevoxxUS 2017

DevoxxUS has been my first Devoxx outside of Europe so far. It was a total different Devoxx experience for me compared to the six times in Antwerp Belgium that I have been to in the past years.
Yet different it has been a great conference! I would like to share some of my adventures and thoughts in this post.

Ansible limit pitfalls

2017-03-15T00:00:00+00:00

Mit steigender Zahl der im Ansible-Inventory gepflegten Hosts verlängert sich die Laufzeit eines Playbooks. Ansible erkennt zwar, welche Tasks nicht ausgeführt müssen (z.B. weil bestimmte Pakete bereits installiert sind), jedoch kostet auch diese Überprüfung Zeit. Früher oder später wird man deshalb den Playbook-Parameter --limit|-l einsetzen - und sich wundern, warum Teile des Playbooks plötzlich nicht mehr funktionieren. Dieser Blogpost zeigt, in welche Probleme man laufen kann bzw. wie man sie vermeidet und löst.

Getting Started With Java 9's New HTTP Client

2017-03-14T00:00:00+00:00

If you ever needed to request HTTP resources with Java, you probably came across several solutions put together from a surprising number of lines. And you probably ended up with using a third party library to achieve your goal in a reasonable manner.

Good news: besides Java 9 modules, the next JDK version comes with a brand new HTTP client, and it not only brings support for HTTP/2, but also a sleek and comprehensive API. Let’s have a closer look at the new features.

Ansible meets DokuWiki

2017-03-13T00:00:00+00:00

Ansible meets DokuWiki

Dokumentation belegt in der Rangliste der beliebtesten Arbeiten eines Administrators sicher einen der hinteren Plätze. Neben der Beliebtheit der Aufgabe ist es auch mit zunehmender Anzahl der vorhandenen Systeme immer aufwändiger, die Dokumentation auf einem aktuellen Stand zu halten. Ein klassischer Fall also für Automatisierung.

Das Ziel in diesem Blog soll es sein, für jedes System eine DokuWiki Seite automatisch zu erzeugen. Weiter soll auf jeder Seite noch die Möglichkeit bestehen, individuelle Dokumentation mit einzufügen.

Prometheus und die Fritzbox

2017-03-08T00:00:00+00:00

Prometheus ist ein quelloffenes Monitoring- und Alarmierungs-Werkzeug. Seine Basis bildet eine Zeitreihen-Datenbank, auf deren Daten mit einer eingebauten, sehr mächtigen Abfragesprache zugegriffen werden kann.

Prometheus verfolgt den Ansatze des sogenannten “whitebox-monitoring”. Anwendungen stellen hier entweder nativ Metriken zur Verfügung, oder alternativ macht ein “exporter” Applikations- oder Geräte-Metriken für Prometheus abfragbar.

In diesem Artikel möchte ich zeigen, wie man mit Hilfe des fritzbox_exporter und des speedtest_exporter im Zusammenspiel mit Grafana Einblicke in die Performance seines Heimnetzwerks und seines Internetanschlusses bekommen kann. Die Hardware-Basis für dieses Projekt stellt ein RaspberryPi.

Testing Logstash Configuration with Citrus and Docker

2017-03-08T00:00:00+00:00

The ELK-Stack is a good option to aggregate and visualize distributed logging-data. It basically based on

Elasticsearch as a datastore
Logstash for extracting and distributing the data
Kibana as visualization frontend.

The core of the most ELK applications is the Logstash configuration. A user defines here which data (inputs) is processed, how (filter) the data is processed and where it will go afterwards (outputs). Especilly this configuration contains a lot of logic which is unfortunally not easy to test. In this article I want to show you how to setup a testing environment for your Logstash configuration.

Devoxx US - Behavior driven integration with Cucumber and Citrus

2017-02-27T00:00:00+00:00

DevoxxUS

In about three weeks DevoxxUS will take place in San Jose, California on March 21-23. After having visited Devoxx Belgium six
consecutive times this will be my first Devoxx conference outside of Europe. Once again I am honored
to be a speaker at that conference! After my Devoxx BE talk in 2015 (Testing Microservices with a Citrus twist) this is my second time speaking
in front of Devoxxians from all around the world. Fantastic!

This time I am going to talk about behavior driven integration with Cucumber and Citrus.

Portchannel-Monitoring mit check_nwc_health

2017-02-26T00:00:00+00:00

Version 6.0 von check_nwc_health ist erschienen und hat neben Aufräumarbeiten unter der Haube ein paar neue Features zu bieten:

interface-etherstats
F5 Wide IPs
Juniper VSD Memberstatus
interface-stack-status

SNMP-Traps und OMD - Teil 1

2017-02-25T00:00:00+00:00

SNMP-Traps und Nagios ist eins der Themen, um das man bislang gerne einen großen Bogen gemacht hat. Grundsätzlich gibt es seit etlichen Jahren die AddOns SNMPTT und Nagtrap, deren Konfiguration aber ein wenig mühsam ist. In einem Projekt, bei dem es um die Überwachung von mehreren Tausend Storage-Systemen ging, entstand eine Methode, welche ressourcenschonend und einfach automatisierbar ist.
Im ersten Teil dieses Artikels geht es um die entsprechende Vorbereitung eines OMD-Servers. Genauer gesagt darum, wie man dafür sorgt, daß ein eingehender Trap gleichzeitig an mehrere OMD-Sites (Test, Produktion, …) zugestellt wird.

Using .gitignore the Right Way

2017-02-22T00:00:00+00:00

Have you ever wondered what kind of patterns .gitignore allows? Was it **/*/target, target/* or *target*?? Read on and find out!

PNP4Nagios and Grafana

2017-02-21T16:00:00+02:00

Many Nagios folks use PNP4nagios to store performance data and draw graphs.
Nowadays time series databases like Influxdb are quite trendy and have their own pros and cons. But In the end
they visualize the stored metrics with Grafana which comes with a plugable
datasource api, so wouldn’t it be nice to use the already collected RRD data in Grafana as well? This
combines the speed of rrd with the more modern graphs from Grafana.

Getting Started with Java 9 Modules

2017-02-13T00:00:00+00:00

So, 2017 has arrived - this is the year when Java 9 will finally be released. And with it, the brand new module system called Jigsaw. In January, Marc Reinhold has announced that JDK 9 is feature complete, so we have every reason to be optimistic that the final release will actually ready in July. So it is about time to get acquainted with project Jigsaw, also known as Java 9 modules.

Local Kubernetes Development with Minikube

2017-02-10T00:00:00+00:00

Getting started with Kubernetes can be intimidating at first. Installing Kubernetes is not the easiest of tasks and can get quite frustrating.¹ Luckily, there is an out-of-the box distribution called Minikube which makes toying around with Kubernetes a bliss.

As mentioned on Twitter by Roland Huß (Red Hat developer and former ConSol employee), if you are on Linux you can try kubeadm for a light-weight installation. ↩

Hamcrest - More readable Tests

2017-02-07T00:00:00+00:00

The probably best written tests are those which can be understood by anyone understanding some English, right?

Hamcrest is an anagram of the word “Matchers” and a paradigm of encapsulating matching logic and corresponding error messages in objects we could use and reuse in the tests. They hide “matching”-implementation details and get self explanatory names we can seamless integrate in our tests. And of course we are also able to write tests for our matchers!

Hamcrest itself isn’t only intended to be used in the context of tests. It’s available for: Java, Python, Ruby, Objective-C, PHP, Erlang, Swift.

FOSDEM Video: Implementing 'tail -f'

2017-02-07T00:00:00+00:00

At this year’s FOSDEM conference I did a 20 minutes presentation on how to implement tail -f in Go. The video is available below.

Abstract: As part of a log file monitoring tool, I implemented a file tailer that keeps reading new lines from log files. This turned out to be much more challenging than I thought, especially because it should run on multiple operating systems and it should be robust against logrotate. In this 20 Minutes talk I will present the lessons learned, the pitfalls and dead-ends I ran into.

Undertow - How to setup a HTTP server

2017-01-24T00:00:00+00:00

Undertow is an open-source lightweight, flexible and performant Java server, they say. I can confirm that it’s
- lightweight: just have a look at those few lines of code to start a server and 1MB core JAR
- flexible: always feel free to provide your own implementations or use Undertow helpers to delegate usual server glue code to a more specific implementation you provide

I didn’t check or compare performance. It is the default server implementation of Wildfly Application Server and sponsored by JBoss.

Java aktuell - Automatisiertes Testen in Zeiten von Microservices

2017-01-03T00:00:00+00:00

Die Software-Entwicklung ist im Wandel. Immer schneller, immer häufiger, immer einfacher müssen neue Features in Produktion gebracht werden. Große, schwergewichtige Alleskönner werden durch mehrere kleine, individuelle Services ersetzt. Jeder Microservice bildet einen Aspekt der gesamten Fachlichkeit ab und lässt sich deshalb unabhängig entwickeln und warten. …

Der vollständige Artikel ist in der Java aktuell 01-2017 zu finden:

Open Monitoring Distribution 2016+ auf der OSMC

2016-12-29T00:00:00+00:00

Kurz vor Ende des Jahres sind die Vortragsvideos der OSMC 2016 online verfügbar. Auch dieses Jahr war ich wieder Referent, diesmal mit einem Überblick über die letzten Entwicklungen von OMD, einige Umgebungen, in denen es eingesetzt wird und dem Ausblick auf das, was nach 2016 in die Distribution einfließen könnte.
Dauer des Videos: 60min.

OMD, die Open Monitoring Distribution, bildet heute in vielen Unternehmen das Rückgrat bei der Überwachung unterschiedlichster IT-Komponenten und Services. Für Anfänger ist OMD ein umfassendes Starterpaket, für Consultants eine solide Plattform für individuelle Monitoring-Landschaften. Seit dem Gründungsjahr 2010 wurde OMD kontinuierlich verbessert, mit der OMD-Labs-Edition wurden 2015 moderne Elemente wie InfluxDB und Grafana eingeführt. Das Thema Automatisierung wurde mittlerweile mit Ansible und Coshsh ebenso aufgegriffen. Der Wandel der IT-Welt in Richtung cloud-basierter Services und kurzlebigen Containern stellt eine besondere Herausforderung dar. Der Vortrag zeigt, wie OMD sich dieser in Zukunft stellen wird.

Macht euch nicht in die Hosen

2016-12-21T00:00:00+00:00

Kürzlich wurden zwei Schwachstellen von Nagios veröffentlicht, u.a. bei heise.de. Wir verwenden Nagios als einen von mehreren möglichen Cores innerhalb des Monitoring-Frameworks OMD. Eine Gefährdung liegt nicht vor. Bei besagten Schwachstellen handelt es sich um:

CVE-2016-9565 - Betroffen ist das Web-Frontend von Nagios. Dieses zeigt nach dem Login einen RSS-Feed des Herstellers Nagios Enterprises an, dessen Inhalt so manipuliert werden kann, daß eingeschleuste Befehle im Kontext des www-data/nagios-Benutzers ausgeführt werden können. De Angreifer muss sich dazu jedoch als www.nagios.org ausgeben (durch einen DNS-Angriff) oder den Datenstrom als Man-in-the-Middle manipulieren. Abgesehen davon, daß die original Web-Gui von Nagios seit Erscheinen weitaus modernerer Oberflächen wie Thruk sowieso niemand mehr ernsthaft benutzt - die RSS-Funktionalität wurde bei OMD von Anfang an abgeschaltet bzw. rausgepatcht. Sie existiert schlichtweg nicht mehr und somit auch nicht die Schwachstelle.
CVE-2016-9566 - Bei diesem Exploit wird ausgenutzt, daß Nagios, so denn der Prozeß unter dem root-Account gestartet wird, das Logfile /usr/local/nagios/var/nagios.log o.ä. zunächst mit den entsprechenden root-Privilegien öffnet, bevor diese mittels des Systemcalls setgid(pid des nagios-Benutzers) aufgegeben werden. Ein Angreifer mit Zugang zum Monitoring-Server, welcher die Möglichkeit hat, das Logfile durch einen Symlink zu systemkritischen Dateien wie z.b. /etc/ld.so.preload zu ersetzen, kann die Voraussetzungen zur deren Manipulation schaffen. Dazu muss er noch dafür sorgen, daß Nagios schadhaften Inhalt in die Datei schreibt. Eine Möglichkeit wäre, ein externes Kommando (entsprechend präpariert) in die Command-Pipe zu schicken, was einen Eintrag im Logfile (und somit in /etc/ls.so.preload) zur Folge hat. Auch diese Form des Angriffs ist unter OMD ausgeschlossen, da ein Nagios-Prozess zu keinem Zeitpunkt mit root-Privilegien läuft. Monitoring mit OMD spielt sich ausschließlich im Kontext stinknormaler Benutzer ab.

Ergo: alles OK und grün.

Fosdem - Monitoring and Cloud Devroom

2016-11-10T00:00:00+00:00

For FOSDEM 2017, there are two DevRooms where ConSol employees are among the organizers.

Accordingly, the two devrooms have combined CfPs, so that you can submit your container cloud talk in just one place. These devrooms are interested in talks about:

Monitoring containerized services
Automating cloud deployments
Developing and administering microservices
Container orchestration
Continuous Integration & Deployment
Prometheus, Kubernetes, Docker, CRIO, etc.
New projects and technology
Other container and cloud native talks

Submit Talk Proposals by November 26th on our CfP Page:

https://goo.gl/forms/bbfCH14ido5kMD4H3

Devoxx Video: Prometheus Monitoring for Java Developers

2016-11-10T00:00:00+00:00

Prometheus is an open source monitoring tool, which is conceptually based on Google’s internal Borgmon monitoring system. Unlike traditional tools like Nagios, Prometheus implements a white-box monitoring approach: Applications actively provide metrics, these metrics are stored in a time-series database, the time-series data is used as a source for generating alerts. Prometheus comes with a powerful query language allowing for statistical evaluation of metrics.

Software-Test im Container - Graphical User Interfaces mit Docker und Sakuli testen

2016-10-25T00:00:00+00:00

Stabile und skalierbare Testumgebungen für End-2-End-Tests sind seit jeher schwer aufzusetzen und zu warten. Besonders in Kombination mit automatisierten UI-Tests stellen sie Tester und Entwickler immer wieder vor große Herausforderungen. Einen eleganten Ausweg bieten in Container verpackte Testumgebungen, die sowohl Web- als auch Rich-Clients in echten Desktop-Umgebungen testen können. Als “Immutable Infrastruktur” betrieben, wird es dadurch möglich, einen definierten Systemstand jederzeit reproduzierbar aufzurufen und Tests darin performant auszuführen.

Containerized UI-Tests in Java with #Sakuli and #Docker

2016-10-14T00:00:00+00:00

The Sakuli Java DSL setup shows how easily you can use Sakuli to test your application in an end-2-end scenario. This is a great starting point to learn how to use Sakuli together with Maven and Docker.

Sakuli EndToEnd Tests mit Android

2016-10-10T00:00:00+00:00

Sakuli wird für EndToEnd mit Linux und Windows Applikationen bereits vielfach eingesetzt. Wie sieht es aber mit Android, dem verbreitetsten mobilen Betriebssystem, aus? Hierzu ein Beispiel.

Anomalieerkennung in Performancedaten

2016-10-06T00:00:00+00:00

Wenn man einen Dienst überwachen möchte und man diesen nicht selbst betreut, fehlt meist die Erfahrung, wie sich dieser verhalten sollte und was als „normal“ gilt. Im Folgenden wird beschrieben, wie man (Un)Regelmäßigkeiten automatisch erkennen lassen kann.

Containerized End-2-End Testing beim #JSD2016

2016-09-30T00:00:00+00:00

Der JUG Saxony Day fand am 30.09.2016 im Konferenzzentrum Radisson Blu Park Hotel in Dresden statt. Die von Anfang an gute und entspannte Atmosphäre beeindruckte ebenso wie die Auswahl der Vorträge. Insgesamt waren es über 30 Vorträge in 5 parallelen Tracks, die sowohl die aktuellen Trends in der Container-Technologie behandelten, Überblick über die neuesten Testing-Konzepte zeigten als auch einen Ausblick auf das zukünftige JDK 9 in petto hatten.

Vorhersage von Performancedaten

2016-09-19T00:00:00+00:00

Oft kommt die Frage auf ob man mit den Performancedaten, die von Nagios und ähnlichen System erhoben werden, nicht auch Vorhersagen treffen kann, etwa wie sich die Systeme in den nächsten Tagen und Wochen entwickeln. Aus diesem Grund wird im Folgenden vorgestellt, wie man dies erreichen kann.

11. Workshop der Open-Source-Monitoring-Community in Kiel

2016-09-12T00:00:00+00:00

Labskaus

Kiel, 24 Grad, 50 Mann an Bord. Bei unerwartet schönstem Sommerwetter wurde in der Kieler Fachhochschule am 7. und 8. September der elfte Workshop der Monitoring-Community veranstaltet. Das ConSol-Monitoringteam trug mit acht Vorträgen zum Gelingen der Veranstaltung bei. Eine kurze Zusammenfassung:

Bereits mit dem erstem Vortrag nach der Begrüßung, “E2E-Monitoring mit Sakuli”, sorgte Simon Meggle für einen würdigen und technisch anspruchsvollen Auftakt der Veranstaltung. Die Möglichkeit, Sakuli in Docker-Containern einzusetzen und End-to-End-Tests somit praktisch beliebig zu parallelisieren, sorgte für viel Gesprächsstoff.

Damit es jeder zu Hause nachmachen kann, führte Simon dann am zweiten Tag die Teilnehmer in einer Live-Demo durch sein Tutorial “Sakuli-Tests im Docker-Container”.

PromCon Video: grok_exporter

2016-09-09T00:00:00+00:00

PromCon 2016 was the first conference around the Prometheus monitoring system. It took place from August 25 - 26 2016 at Google Berlin as a single-track event with space for 80 attendants.

We took the opportunity and did a lightning talk introducing grok_exporter, which is a tool for extracting Prometheus metrics from application logs.

Containerized End-2-End-Testing beim Herbstcampus Nürnberg

2016-08-31T00:00:00+00:00

Mit seinem Vortrag “Containerized End-2-End-Testing” war Tobi am 31.08. als Redner beim Herbstcampus 2016 in Nürnberg.

Counting Errors with Prometheus

2016-08-13T00:00:00+00:00

Counting the number of error messages in log files and providing the counters to Prometheus is one of the main uses of grok_exporter, a tool that we introduced in the previous post.

The counters are collected by the Prometheus server, and are evaluated using Prometheus’ query language. The query results can be visualized in Grafana dashboards, and they are the basis for defining alerts.

We found that evaluating error counters in Prometheus has some unexpected pitfalls, especially because Prometheus’ increase() function is somewhat counterintuitive for that purpose. This post describes our lessons learned when using increase() for evaluating error counters in Prometheus.

Ansible im Monitoring-Umfeld

2016-08-05T00:00:00+00:00

Am 27.7. fand bei ConSol das Sommer-Meetup der Gruppe “Münchner Monitoring-Stammtisch” statt. Das Thema war diesmal “Ansible im Monitoring-Umfeld”.
Ansible ist ein Framework, mit dem üblicherweise Server nach der Grundinstallation nachkonfiguriert und mit ausgewählten Softwarepaketen versorgt werden. Oder mit dem im laufenden Betrieb immer wieder Patches und sonstige Updates ausgerollt werden. Dabei wird in einem sogenannten Ansible-Playbook lediglich der Soll-Zustand beschrieben und Ansible kümmert sich im Hintergrund um die dazu nötigen Aktionen. Das hat grundsätzlich noch nichts mit Monitoring zu tun, aber da wir über den Tellerrand hinausschauen und bei allen Kunden keine Insel installieren, sondern Teil einer Unternehmens-IT mit allen möglichen Verflechtungen sind, gehört Ansible seit längerem zum Werkzeugkasten des ConSol-Monitoring-Teams. Es gibt übrigens auch eine eigene Ansible-Meetup-Gruppe, die unsere Veranstaltung freundlicherweise auch auf ihrer Seite ankündigte.
Die Fachsimpelei bei Augustiner und Pizza wurde immer wieder durch einen Vortrag unterbrochen, als da waren:

Michael Kraus - Überblick über Ansible, erste Schritte, coole Features
Simon Meggle - Rollout und Administration einer verteilten Monitoring-Umgebung mit Ansible
Matthias Gallinger - Erstinstallation und kontinuierliche Betankung von Monitoring-Clients mit Plugins

Extracting Prometheus Metrics from Application Logs

2016-07-31T00:00:00+00:00

Prometheus is an open-source systems monitoring and alerting toolkit. At its core, Prometheus uses time-series data, and provides a powerful query language to analyze that data. Most Prometheus deployments integrate Grafana dashboards and an alert manager.

Prometheus is mainly intended for white box monitoring: Applications either provide Prometheus metrics natively, or they are instrumented with an exporter to make application-specific metrics available.

For some applications, parsing log files is the only way to acquire metrics. The grok_exporter is a generic Prometheus exporter extracting metrics from arbitrary unstructured log data.

This post shows how to use grok_exporter to extract metrics from log files and make them available to the Prometheus monitoring toolkit.

IDoc-Monitoring mit check_sap_health

2016-07-22T00:00:00+00:00

IDoc ist das Austauschformat von SAP ERP-Systemen, welches benutzt wird, um per Import und Export Daten sowohl untereinander als auch mit Fremdsystemen auszutauschen. Typische Beispiele solcher Daten sind Bestellungen, Lieferscheine, Überweisungen, Stundenbuchungen, etc. Ein IDoc besitzt neben Control- und Data-Records auch Status-Records, in denen jeder einzelne Verarbeitungsschritt protokolliert wird. Diese Status-Records werden in der Tabelle EDIDS gespeichert. Die neue Version 1.9 von check_sap_health kennt den Mode failed-idocs, mit dem in EDIDS nach Fehlermeldungen gesucht wird.

Schnelles Anlegen eines Monitoring-Users mit check_mssql_health

2016-06-28T00:00:00+00:00

Seit der Version 2.6.3 von check_mssql_health ist es möglich, den für das Monitoring benötigten Datenbankbenutzer direkt vom Plugin erzeugen zu lassen. Angenommen, der Benutzer soll NAGIOS heißen und das dazugehörige Passwort ES_ku_el. Der Plugin-Aufruf lautet dann:

$ check_mssql_health --hostname dbsrv1 --port 1433 \
    --username sa --password 'Str3ng!g3heim' \
    --mode create-monitoring-user \
    --name NAGIOS --name2 'ES_Ku_el'

Anstelle des Benutzers sa kann man auch jeden beliebigen Administrator-Account nehmen. NAGIOS wird in jeder einzelnen Datenbank angelegt. Kommen neue Datenbanken dazu, so wiederholt man einfach den create-monitoring-user-Befehl.

Reguläre Schwellwerte

2016-06-02T00:00:00+00:00

In der neuesten Version von GLPlugin habe ich die Möglichkeit vorgesehen, Thresholds auch als reguläre Ausdrücke anzugeben. Wie schaut das nun genau aus?

$ check_wut_health --hostname dcenv2.de.xxxx --community public --mode sensor-status
OK - return air temperature Unit 1.1 is 21.40C, humidity Unit 1.1 is 49.40%, return air temperature Unit 2.1 is 22.40C, humidity Unit 2.1 is 46.80% | 'temp_Unit 1.1'=21.40;25;28;; 'hum_Unit 1.1'=49.40%;40:60;35:65;0;100 'temp_Unit 2.1'=22.40;25;28;; 'hum_Unit 2.1'=46.80%;40:60;35:65;0;100

Wir sehen hier die hartcodierten Default-Schwellwerte 25 und 28 für die Temperatur bzw. 40:60 und 35:65 für die Luftfeuchtigkeit.
Bisher gab es zwei Möglichkeiten, diese zu ändern, z.b. in 20 und 30 für die Temperaturen zu ändern.

Sakuli v1.0.0 stable release

2016-04-07T16:00:00+02:00

“Sakuli”, das Open-Source-Framework zum automatisierten Testen von Applikationen, ist vor kurzem in Version 1.0 erschienen. Ein kleiner Blick auf die zurückliegenden Änderungen.

Literate Shell Scripting with Markdown and Packer

2016-04-04T21:32:00+02:00

Markdown is great for writing documentation or tutorials. However, executing the steps from a tutorial usually means to copy and paste the commands into a shell. There is no guarantee that the documentation is complete, and there is no protection against copy-and-paste errors.

This post shows how to use Packer for automatically executing code snippets from Markdown files on a variety of platforms. Machine images are created directly from the code snippets in the documentation. That way, documentation is guaranteed to be up-to-date and complete, and it can be integrated in an automated delivery pipeline.

SSL - No more excuses

2016-03-28T16:00:00+02:00

There are many reasons to enable encryption on your webserver and since Let’s Encrypt openend its public beta, there are no more excuses to not use ssl. Besides the official scripts, programs and webpage, there is also already a Perl module Crypt::LE available which uses the Lets Encrypt API and makes requesting and renewing certificates super easy and most important… scriptable.

Testing web-applications with a Citrus twist

2016-03-15T18:09:03+00:00

In a previous article we went through how to build a chat room web application that used REST and STOMP for communicating between the client and server. In this article I use the very same application and show how to write automated integration tests using the open source Citrus integration test framework.

If you haven’t read the first article don’t worry. A quick summary of all the important bits will be shown shortly below. But before I get to that lets talk a little bit about automated integration testing and citrus.

One of the biggest challenges when testing any application is being able to simulate all endpoints.

Caching with JCache

2016-03-12T12:17:03+00:00

Introduction

A couple of years have passed since we last looked into in-memory caches here at ConSol. In that time a bunch of things have happened:

Probably the most significant thing that happened was that the oldest Java Service Request JSR 107, also known as JCache, finally reached ‘Release’ status. This JSR was a long time in the making taking a whole 13 years since the initial proposal back in 2001.
Grid Gains In-memory Data Fabric became an open source project and is now available under the Apache Foundation Project and known as Apache Ignite.
The existing In-memory caches providers, like Hazelcast, have received a whole host of new features including things like support for distributed transactions, a new Map-Reduce API, interceptors for executing business logic, when the cache entries change, to mention just a few.

Nagios Scheduling Insights

2016-03-02T16:00:00+02:00

The host- and servicecheck scheduling of Nagios has always been some kind of black box. Checks pile up when using timeperiods which often leads to performance issues while the Nagios host idles again just a minute later. Latest Thruk release (2.06) ships a new addon which visualizes and alleviates this issue.

Sakuli v0.9.2 Released

2016-02-12T14:00:00+02:00

Sakuli todays release is a great milestone because it introduces a brand new graphical installer and a bunch of other features!

Download and check it out sakuli-v0.9.2-installer.jar!

OMD 2.10 Labs Edition Released

2015-11-13T16:00:00+02:00

OMD Labs Edition 2.10 in has been released today. The OMD Labs Edition is based on the standard OMD but adds some
more useful addons like Grafana and Influxdb or additional cores like Icinga 2 and
Naemon. Todays release is a great milestone because it introduces grafana based graphing out of the box in the usual easy OMD way.

Devoxx Talks about HTTP2 and Citrus Framework

2015-11-13T00:00:00+00:00

Fabian and Christoph have been invited to speak at Devoxx Conference 2016 in Antwerp, Belgium. Watch their talks to learn more about HTTP2 and Citrus Framework.

Upload Travis Artifacts to Dropbox

2015-11-04T16:00:00+02:00

Travis CI is a free platform for continues integration tests which fits perfectly in our opensource products workflow with Github. Unfortunately it only supports uploading artifacts to amazon aws. Usually not a major problem, because most tests result in simple text output.
Latest Thruk Tests however are based on Sakuli and Docker and produce screenshots on errors because we do full enduser gui tests of the dashboard and other javascript based parts. So we need a way to store these screenshots on Dropbox.

Aktivieren der Grafana-Graphen in der OMD Labs Edition

2015-10-21T12:00:00+02:00

In der OMD Labs Edition gibt es seit kurzem die Möglichkeit, die Performance-Daten in einer InfluxDB zu speichern. Die Daten werden dabei von der Komponente Nagflux in die InfluxDB geschrieben, Histou übernimmt das Erzeugen der Graphen auf Basis von Templates und Grafana übernimmt die eigentliche Anzeige.

Einen ausführlichen Vortrag vom Autor von Nagflux und Histou, Philip Griesbacher, wird es auf der diesjährigen OSMC geben.

Das Aktivieren des kompletten Gespanns ist ab der Version omd-2.01.20151021-labs-edition aus unserem Testing-Repository in einer OMD site sehr einfach möglich. Erfahrene OMD-Benutzer verwenden die folgenden Kommandos, für OMD-Einsteiger gibt es die ausführlichere bebilderte Anleitung weiter unten.

omd config set PNP4NAGIOS off
omd config set GRAFANA on
omd config set INFLUXDB on
omd config set NAGFLUX on

Lidl sucht Mitarbeiter

2015-10-19T11:10:25+02:00

Unser Kunde Lidl setzt beim Monitoring seiner europaweiten IT-Landschaft in großem Umfang Tools und Plugins von ConSol-Labs ein. Neu- und Weiterentwicklungen der check_*_health-Plugins oder Thruk entstehen häufig im Auftrag von Lidl, wobei die Kollegen dort im Gegensatz zu anderen Unternehmen kein Problem damit haben, die Ergebnisse der Öffentlichkeit zur Verfügung zu stellen. Von diesem Engagement für die Open-Source-Community kann sich mancher eine Scheibe abschneiden.

Unter https://www.it-bei-lidl.com/ findet sich eine Stellenausschreibung für den Bereich Geschäftsprozess-Monitoring. Ich habe das technische und menschliche Umfeld von Lidl kennengelernt und kann nur empfehlen, sich dort zu bewerben. Es erwartet einen eine tiptop gemanagte IT-Landschaft, die so ziemlich jede zeitgemäße Technologie umfasst. Und natürlich Monitoring made by ConSol.

Und jetzt kommt Werbung….

Interface-Rundumschlag mit check_nwc_health

2015-10-03T16:10:25+02:00

Beim Monitoring von Netzwerkinterfaces ist es üblich, daß man vier Services konfiguriert. Jeweils einen für Status (up/down), Bandbreite, Errors und Discards. Gelegentlich gab es auch die Anforderung, das alles in einen einzigen Service zu packen, in dem Fall half dann check_multi. Zwar wurde jeweils auch die Konfigurationsdatei für check_multi mit coshsh generiert, aber je simpler, desto besser, daher habe ich einen neuen Modus interface-health eingeführt, so daß check_nwc_health diese vier Checks selber bündelt.

$ check_nwc_health --hostname 10.37.6.2 --community kaas \
    --mode interface-health --name FastEthernet0/0
OK - FastEthernet0/0 is up/up, interface FastEthernet0/0 usage is in:0.01% (12041.88Bits/s) out:0.00% (1435.76Bits/s), interface FastEthernet0/0 errors in:0.00/s out:0.00/s , interface FastEthernet0/0 discards in:0.00/s out:0.00/s  | 'FastEthernet0/0_usage_in'=0.01%;80;90;0;100 'FastEthernet0/0_usage_out'=0.00%;80;90;0;100 'FastEthernet0/0_traffic_in'=12041.88;80000000;90000000;0;100000000 'FastEthernet0/0_traffic_out'=1435.76;80000000;90000000;0;100000000 'FastEthernet0/0_errors_in'=0;1;10;; 'FastEthernet0/0_errors_out'=0;1;10;; 'FastEthernet0/0_discards_in'=0;1;10;; 'FastEthernet0/0_discards_out'=0;1;10;;

WAN-Monitoring mit check_nwc_health

2015-08-25T23:10:25+02:00

Das Plugin check_nwc_health erfreut sich größter Beliebtheit beim Monitoring von Komponenten in den Core-, Access- und Distribution-Layern, oder kurz: den Netzwerkkomponenten innerhalb von Gebäuden und Standorten.
Das WAN-Monitoring geht aber weit über die üblichen Hardware/CPU/Memory/Interfaces-Checks hinaus.
Für einen OMD-Kunden wurde das Plugin so erweitert, daß er sein europaumspannendes Netzwerk, bestehend aus mehreren tausend WAN-Knoten, umfassend überwachen kann. Den Vergleich mit schweineteuren proprietären Lösungen braucht das Gespann OMD/check_nwc_health seitdem nicht mehr zu fürchten.

Nachmittags kommt die Post

2015-08-20T20:10:25+02:00

Es gibt wieder mal ein neues Plugin, diesmal geht es um die Überwachung von Postfächern/Mailservern/Mailempfang etc. Mit check_mailbox_health prüft man,

ob ein Mailserver antwortet bzw. ein Login zulässt
Mails im Postfach liegen
wie alt diese sind
ob sie ein bestimmtes Subject haben (oder ein Suchmuster im Text vorkommt)
ob sie Attachments (ggf. eines bestimmten Typs) haben

Mit check_mailbox_health lassen sich so auch nicht ganz triviale, auf Mail basierende Geschäftsvorgänge monitoren.

Arquillian & Citrus in combination - Part 2

2015-08-19T00:00:00+00:00

Some time has passed since part one of this blog post series and we have made some improvements on the Citrus Arquillian extension.
So we can look forward to this post as we move on with a more complex test scenario where we include some Citrus mail server within our test. In part one we have already combined both frameworks Arquillian
and Citrus with a basic example.

check_nwc_health und Solaris-Interfaces

2015-07-31T00:00:00+02:00

Aller guten Dinge sind drei. Bisher konnte man mit check_nwc_health die lokalen Interfaces von Linux und Windows-Rechnern überwachen, jetzt geht das auch bei Solaris. Das Betriebsteam eines MySQL-Cluster auf Oracle Solaris wollte die Auslastung der Netzwerk-Interfaces aufzeichen, da die übertragene Datenmenge sich allmählich dem GBit/s-Bereich nähert.

Thruk just released new major version v2.00

2015-07-16T00:00:00+00:00

Today Thruk has released version 2.00 wich is a great milestone and a huge step forward. Instead of adding lots of things, we tried to remove unnecessary dependencies. Version 2.00 comes without the Catalyst framework and many performance improvements, especcially on larger setups.

Arquillian & Citrus in combination - Part 1

2015-06-29T00:00:00+00:00

Citrus and Arquillian
both refer to themselves as integration test frameworks. Following from that you might think these frameworks somehow ship the same package but this is not the case. In fact the frameworks work
brilliant together when it comes to automate the integration testing of JEE applications.

check_*_health-Plugins und die Passwörter mit Sonderzeichen

2015-06-26T20:10:25+02:00

Jeder Icinga-Admin kennt das: Ein Gerät, eine Applikation oder eine Datenbank soll überwacht werden, es gibt auch eine extra Monitoring-Kennung dafür, aber das zugehörige Passwort ist einfach nur grauenhaft. Sei es aufgrund einer Vorschrift oder weil der DBA ein Sadist ist, häufig enthält das Passwort Zeichen, welche bei der Ausführung des Plugins durch eine Shell Probleme bereiten können. Dazu zählen alle Arten von Anführungszeichen, Strichpunkt, Kaufmanns-Und oder gar nicht druckbare Zeichen.
So eine Command-Definition

define command {
  command_name check_mssql_health
  command_line $USER1$/check_mssql_health \--hostname $ARG1$ \--username '$ARG2$' \--password '$ARG3$' ...
}

schließt zwar den ganzen Dreck in einfache Hochkommas ein, aber was, wenn das Passwort selber ein Hochkomma enthält?

Current Status:	WARNING (for 0d 0h 6m 3s)
Status Information:	[sh: -c: line 0: unexpected EOF while looking for matching `''
	sh: -c: line 1: syntax error: unexpected end of file]

Damit das nicht passiert und auch die Icinga-Konfigurationsdateien von Sonder- und Schmierzeichen aller Art verschont bleiben, können die Plugins aus der check_*_health-Familie sowie check_hpasm seit den letzten Releases mit encodierten Passwörtern versorgt werden. Man hantiert also nur noch mit [A-Za-z0-9].

Monitoring Minutes 10 - Mod-Gearman in der DMZ

2015-06-26T17:10:25+02:00

Eine neue Ausgabe der ConSol-Monitoring-Minutes ist seit heute bei Youtube online. Matthias Gallinger erklärt, wie man in einer DMZ einen Gearman-Worker einrichtet, ohne daß man sich beim Firewall-Admin unbeliebt macht.

Git, the safety net for your local work in progress

2015-06-23T00:00:00+00:00

Just recently I gave a presentation on Git (the version control system, not the British pejorative). I introduced newbies to the Git world and the concepts behind it and demonstrated advanced users some lesser known Git features.

Additionally, I introduced my personal workflow when working on small scale features, let’s say the size of one commit to the main line. Some of my colleagues found this workflow to be particularly interesting, so I’d like to share it here and discuss its benefits and drawbacks.

Neues von check_tl_health

2015-06-18T19:10:25+02:00

Das Tape-Library-Plugin check_tl_health kann mittlerweile die meisten Geräte überwachen, die bei unseren Kunden im Einsatz sind. Kommen neue Modellvarianten hinzu, so werden diese i.d.R. vom Plugin erkannt. Möglich ist dies, weil gängige MIBs wie QUANTUM-SMALL-TAPE-LIBRARY-MIB, SEMI-MIB, SL-HW-LIB-T950-MIB, UCD-SNMP-MIB, ADIC-INTELLIGENT-STORAGE-MIB, ADIC-INTELLIGENT-STORAGE-MIB, BDT-MIB, … bereits enthalten sind. Durch Prüfen charakteristischer OIDs wird ermittelt, welche MIBs die zu überwachende Library implementiert hat, danach wird der entsprechende Zweig mit den spezifischen Abfragen ausgeführt.

Monitoring von Background-Jobs in SAP

2015-06-16T00:00:00+00:00

Beim Monitoring von SAP mit check_sap_health wurden bisher die Bereiche CCMS, Verbuchungssystem und Shortdumps abgedeckt. Mit der neuen Version können nun auch Hintergrundjobs überwacht werden. Folgende Anforderungen wurden implementiert:

check_sap_health soll Jobs melden, welche einen fehlerhaften Status haben. Würde man in SM37 nachschauen, dann würde man bei diese(n) Job(s) den Status aborted angezeigt bekommen
Defaultmäßig interessiert sich das Plugin nur für die vergangenen 60 Minuten, also die Jobs die in der letzten Stunde fertig geworden (oder abgebrochen) sind. Eine andere Zeitspanne ist einstellbar (so gibt es das auch beim Shortdump-Check). Dadurch hat der Service bei einem üblichen 5-Minuten-Check-1-Minute-Retry-Intervall die Gelegenheit, kritisch zu werden und eine Notification zu verschicken und nach kurzer Zeit wieder grün zu werden.
Die Sicht des Plugins kann mit Hilfe des Parameters --name auch auf bestimmte Jobs eingegrenzt werden. Es interessiert sich dann ausschließlich für Jobs dieses Namens. Damit lassen sich eigene Services einrichten, die speziell die Jobs bestimmter Applikationen bzw. des Systeme überwachen.
Bei allen Jobs, die in den letzten 30 Minuten fertig geworden sind, wird die Laufzeit mit vorgegebenen Schwellwerten verglichen. (--warning/critical). Bei Überschreitung gibt es Alarm. Die Laufzeit wird als _runtime=… in den Performancedaten auftauchen.

Apache Camel integration testing - Part 3

2015-06-03T00:00:00+00:00

In this post I will continue with the Apache Camel integration testing scenario that we have worked on in part one and part two of this series.
This time we focus on exception handling in Camel routes. First of all let’s add exception handling to our Camel route.

Apache Camel integration testing - Part 2

2015-05-28T00:00:00+00:00

In part one of this blog series we have used Citrus in combination with Apache Camel for setting up a complete integration test scenario. Remember we have interacted with our Camel route
via JMS as client and via SOAP Http WebService as a server.
Now in the second part we want to interact with a Camel route using direct and Seda in memory message transports. First of all we need a Camel route to test.

Nagios/Icinga GeoMaps with Thruk

2015-04-27T00:00:00+00:00

One of the most often requested features is the possibility to place hosts, services and host/servicegroups on a geomap.
Now with release 1.88 Thruk made a major change in its panorama dashboard to support this kind of map too.

Java ist auch ein Vergnügungspark - Wrap-Up über das JavaLand 2015

2015-03-31T00:00:00+00:00

Eine Konferenz in einem Vergnügungspark? Klang zunächst so ungewöhnlich, wie spannend. Zugegebenermaßen konnten wir uns im Vorfeld auch nicht genau vorstellen, wie so etwas aussehen wird. Und bereits beim Check-In im Hotel wurde klar: Das wird kein rein monotoner Talk-Marathon. Die beiden Hotels, welche von ConSol bezogen wurden, hatten jeweils ein Motto (asiatisch und afrikanisch) und wussten dieses auch für die Gäste, teils sehr detailverliebt, darzustellen. Nachfolgend wollen wir ein paar unserer persönlichen Eindrück schildern:

Intellij IDEA Docker Plugin on Ubuntu Linux

2015-03-31T00:00:00+00:00

This blog post shows how to set up the new Docker plugin for Intellij IDEA 14.1 on Ubuntu Linux 14.10.
**1. Install Docker
If you haven’t installed Docker already, use the following command to install it:

wget -qO- https://get.docker.com/ | sudo sh

Monitoring von SAP-Loadbalancing

2015-02-02T00:00:00+00:00

Mit den bisherigen Versionen von check_sap_health verband man sich unter Angabe von Hostname und System-Nummer direkt mit einem NetWeaver Application Server, um CCMS-Metriken abzufragen oder Geschäftslogik zu monitoren. In einer größeren Umgebung mit mehreren Application Servern gibt es noch eine weitere Komponente, die in der Überwachung nicht fehlen darf: Der Message Server der Zentralinstanz.
Seit der Version 1.4 kann sich check_sap_health nun auch zu diesem Server verbinden. Sogar der Weg über einen SAProuter ist möglich, so daß auch noch dieser wichtige Bestandteil einer SAP-Landschaft vom Monitoring abgedeckt wird.

Netzwerkmonitoring mit check_nwc_health - Der Film

2014-12-13T22:45:52+01:00

Wem meine Folien zum Thema Netzwerkmonitoring mit check_nwc_health zu trocken sind kann sich meinen Vortrag auch als Video anschauen. Film ab!

Netzwerkmonitoring mit check_nwc_health - Die Folien von der osmc2014 sind online

2014-11-21T14:55:42+00:00

Vorgestern habe ich auf der diesjährigen Open-Source-Monitoring-Konferenz in Nürnberg einen Vortrag über check_nwc_health gehalten.
Hier sind die Folien für diejenigen, die das Pech hatten, nicht dabei zu sein (damit meine ich die Konferenz an sich, nicht meinen Vortrag)

Open-Source-Monitoring von Netzwerkkomponenten mit check_nwc_health from Gerhard Laußer

Apache Camel integration testing - Part 1

2014-11-21T09:51:39+00:00

Apache Camel is a great mediation and routing framework that integrates with almost every enterprise messaging transport. In the past I have experienced Camel projects struggling with integration testing where the actual message interfaces to boundary applications are not tested properly in an automated way.

So in a series of posts I would like to talk about integration testing strategies for Apache Camel projects using the Citrus integration test framework.

GoTo Conference Berlin 2014

2014-11-20T12:01:28+00:00

The GoTo Conference Berlin is part of a conference series with stops in Berlin, Chicago, Amsterdam, Aarhus and Copenhagen. The 3 day conference was divided in workshops on the first day and talks on the second and third day.

The talks and the catering were very well organized. The only drawback was, that the WLAN wasn’t working most of the time.

Now lets go through the talks:

Devoxx 2014, Day 5 & Wrap-Up

2014-11-18T15:28:44+00:00

Devoxx is over, sadly, and under normal circumstances this would be the time when we Devoxxians return to our everyday’s lives for another year.

However, this time it is different: At Google’s booth at the exhibition area we got their latest Cardboard gadget. Cardboard is a virtual reality viewer for Android phones and it is absolutely the greatest thing I have ever seen on a phone. The Cardboard app comes with a lot of fancy demos like a virtual reality tour through Versailles, flying around in Google earth and even a short animated 360° movie.

For me Devoxx did not stop when I left the venue this afternoon. Devoxx continued at home when I opened that Cardboard give-away. Infinite possibilities, the motto of this year’s Devoxx, couldn’t fit better. I definitely need to check it out and learn more about it.

Thank you very much for that, Google! (fabian)

See you next year, at the Devoxx. But before that lets have a look at the last day and a very inspiring talk on Android Wear:

Devoxx 2014, Day 4

2014-11-14T09:12:24+00:00

First of all a small notice on the Ignite sessions at Devoxx. The 5 minute talks with 20 slides were very entertaining and I am glad that Roland kind of talked us into this. The sessions take place during lunch break but it is definitely worth taking the time and listening to the talks.

Now let’s go through the talks of day 4 in Antwerp.

Devoxx 2014, Ignite Sessions

2014-11-13T14:36:29+00:00

“Devoxx Ignite sessions” (fabian)

Devoxx ignite sessions are a great thing: Each speaker has 20 slides in 5 minutes, the slides are auto-forwarding, so each slide is up 15 seconds. During the hour long ignite session you would hear 8 talks. Today, we learned how to make money, ride a mountain bike, do performance tuning, save the planet, be a diabolical developer, share a house, do open source, decode the airspace, and why Stephen Chin’s job sucks.

The format reminds a bit of TED’s talks, talks are quick, innovative, and engaging. Sometimes I even felt that the take away of a five minute talk is not necessarily less than the take away of the three hour university sessions.

Devoxx 2014, Day 3

2014-11-13T10:14:02+00:00

So the group of ConSol Devoxxians is now complete as the rest of the posse has finally arrived for the upcoming conference days in Antwerp. And we are all excited to hear the latest news in the opening keynote from Stephan Janssen, who is best named as the father of the Devoxx conference and the Parleys platform.

Stephan has some great announcements in his keynote. One of them is to welcome Devoxx Poland as new family member in Krakow next year, which is indeed great news for our ConSol colleagues in Poland.

Devoxx 2014, Day 2

2014-11-12T16:30:02+00:00

2nd day in Antwerp where Torsten, Julian, Roland and Fabian are able to listen to awesome university and tools in action talks. Tomorrow on Wednesday the rest of the ConSol posse (Ana, Georgi, Christian and Christoph) arrives in Antwerp to see the 3-day conference part of Devoxx. And still another highlight to come on Devoxx day 2: Roland speaks about “Spicing up JMX with Jolokia” in his tools in action talk scheduled 18:05-18:35 in room 8.

So here is the wrap up of Devoxx Day 2:

Devoxx 2014, Day 1

2014-11-11T19:04:44+00:00

It’s Devoxx time! Mid of November in lovely Antwerp, Belgium. Five days packed with technical talks, hacks and technologies. And of course as usual a group of Consolis will bring the Devoxx atmosphere to you. We blog our experiences and impressions from Antwerp.

So be prepared to receive some on site summaries of what we have seen and what inspired us here.

EPEL-Repository in CentOS einbinden

2014-10-11T13:54:47+00:00

Immer wenn ich bei einem CentOS-System Pakete aus der EPEL-Kollektion installieren will, muss ich in meinem schlauen Büchlein blättern oder rumgoogeln, wie das Einbinden des EPEL-Repositories funktioniert. Deshalb halte ich es mal an dieser Stelle hier fest, dann finde ich das richtige Kommando beim nächsten Mal auf Anhieb.

# CentOS 7 64bit
rpm -i http://download.fedoraproject.org/pub/epel/7/x86_64/epel-release-7-2.noarch.rpm
# CentOS 6 32bit
rpm -i http://download.fedoraproject.org/pub/epel/6/i386/epel-release-6-8.noarch.rpm
# CentOS 6 64bit
rpm -i http://download.fedoraproject.org/pub/epel/6/x86_64/epel-release-6-8.noarch.rpm
# CentOS 5 32bit
rpm -i http://download.fedoraproject.org/pub/epel/5/i386/epel-release-5-4.noarch.rpm
# CentOS 5 64bit
rpm -i http://download.fedoraproject.org/pub/epel/5/x86_64/epel-release-5-4.noarch.rpm

Monitoring Minutes 08 - Hochverfügbares OMD

2014-10-10T22:06:54+00:00

Der Ausfall eines Monitoring-Servers ist an sich ist nicht produktionskritisch. Trotzdem stellt er eine Gefahr dar. Ohne Monitoring befindet sich der Betrieb im Blindflug. Fehler, die Geld kosten oder Kunden verärgern bleiben unentdeckt. Matthias Gallinger hat sich Gedanken gemacht, wie man einen möglichst unterbrechungsfreien Monitoringbetrieb sicherstellt. Dabei achtete er darauf, dieses Ziel mit simpelsten Mitteln zu erreichen. Mit der vorgestellten Lösung bleibt Admins das Erlernen und Bedienen von komplexen Cluster-Frameworks erspart.

Welche MIBs unterstützt dieses Ding?

2014-08-20T22:09:47+00:00

Ich habe in letzter Zeit viel Aufwand in die Entwicklung bzw. Erweiterung von SNMP-Plugins gesteckt. Die, die ich veröffentliche habe sind: check_nwc_health für Netzwerkkomponenten, check_ups_health für unterbrechungsfreie Stromversorgungen und check_tl_health für Tape Libraries. Allen drei haben gemeinsam, daß sie bei einheitlichem Kommandozeilenformat möglichst viele unterschiedliche Hersteller und Modelle abdecken. Wenn ich nun eine neue Anforderung bekomme und ein Plugin für ein bisher unbekanntes Gerät erweitern muss, dann brauche ich erstmal eine Übersicht über die MIBs und OIDs, welche bei diesem Gerät implementiert wurden. Ich kann natürlich die Dokumentation durchschauen, aber die steht nicht immer zur Verfügung bzw. ist nicht sehr aufschlussreich. Ein Snmpwalk ist auch einer der ersten Schritte, aber der liefert mir einfach nur endlose Zahlenkolonnen, die ich mühsam interpretieren muss. Daher habe ich einen --mode supportedmibs eingeführt, mit dessen Hilfe ich die Namen der unterstützten MIBs angezeigt bekomme.

Detailed Process Resource Monitoring

2014-08-15T14:14:55+00:00

Usually we monitor processes with Nagios or Naemon but sometimes you want to have really detailed graphs of resource usage of single processes in realtime resolution. Maybe to find and visualize memory leaks or to watch resource usage over time. Nagios based solutions are not worth the effort since you probably just want to nail down a specific single problem. So i used this opportunity to write a small desktop application with node-webkit.

Jmx4Perl 1.10

2014-07-01T19:51:33+00:00

It took some time, yes. In the past year most of the action was on the Jolokia side, but now Jmx4Perl 1.10 is out of the cage. It comes with a tons of fixes plus additional goodies:

Sakuli - Open Source End2End-Monitoring mit Nagios

2014-07-01T09:54:52+00:00

Mit Sakuli lassen sich unabhängig vom Betriebssystem User-Aktionen in Anwendungen (Fat-Client, Citrix, Web, …) simulieren; die Stati und dabei gemessenen Laufzeiten werden von Nagios ausgewertet und visualisiert. Unter der Haube stecken die Tools

Sahi (www.sahi.co.in) für webbasierte Tests und
Sikuli (www.sikuli.org) zum Ausführen von „echten“ Maus/Tastatur-Aktionen,

die wir unter dem Namen “Sakuli” über ihre gemeinsame API zu einem Team zusammenspannt und auf GitHub veröffentlicht haben.

Haufenweise falsche Messwerte beim SAP-Monitoring (Update)

2014-06-21T21:31:08+00:00

Bei der Ablösung des alten SAP-Monitorings eines unserer Kunden bin ich über Ungereimtheiten beim Auslesen von CCMS-Metriken gestolpert. Nicht alle Werte, welche man per RZ20 in der SAP-GUI angezeigt bekommt, werden nagios-seitig korrekt wiedergegeben. Teilweise sind die Messwerte um den Faktor 1000 zu hoch und werden so auch in den entsprechenden RRD-Files abgespeichert bzw. sorgen für ungläubiges Kopfschütteln. Das ist beispielsweise dann der Fall wenn der SAP-Server eine Load von 7.06 hat, laut Monitoring aber 706. Bisher ist das halt nicht aufgefallen, weil üblicherweise der von SAP gelieferte Status eins zu eins in Nagios verwendet wurde.

Update 23.6.14: im Git von check_mk wurde mein Patch mittlerweile eingespielt.
Update 26.6.14: im Git von Netways auch.
Update: Die beiden angemeckerten Plugins sind (Stand 26.6.14) gefixt und somit ist meiner Stänkerei jede Grundlage entzogen. Alles ist gut :-)

Automated Integration Testing for webMethods with Citrus Part II

2014-05-30T18:26:56+00:00

Achieving Continuous Integration for ESB Projects with Citrus (Part II: Basic Project Setup)

In Part I of this tutorial I introduced the basic concepts and benefits of Citrus as a test driver for ESB projects in general and webMethods in particular. In this second part I want to discuss some Citrus project setup options and provide a quickstart template project for Ant users.

Naemon & Thruk

2014-05-22T14:58:49+00:00

Auf dem diesjährigen Monitoring Workshop in Berlin durfte ich Neuigkeiten aus 2 Projekten vorstellen.

Monitoring von SAP mit check_sap_health

2014-05-17T00:30:31+00:00

Monitoring von SAP mit den bisher vorhandenen Plugins beschränkte sich auf die Abfrage von CCMS-Metriken. In einem SAP-System steckt aber noch viel mehr, das sich überwachen lässt. Check_sap_health ist ein neues Plugin, welches in Perl geschrieben wurde. Es entstand in einem Projekt, bei dem von unterschiedlichen Standorten aus die Laufzeiten von BAPI-Aufrufen gemessen werden sollten. Durch die einfache Erweiterung des Plugins um selbstgeschriebene Perl-Elemente lassen sich beliebige Funktionen per RFC aufrufen und somit firmenspezifische Logik implementieren.

Citrus 1.4 is here!

2014-05-05T21:51:33+00:00

A new package of the Open Source integration test framework Citrus has just arrived. Version 1.4 comes with new features such as data dictionaries, SMTP mail support and an improved endpoints API for easier configuration. See the 1.4 documentation changes report for a detailed overview on all changes.

With the new configuration components we give credit to all users continuously giving us feedback on the Citrus configuration. With 1.4 our primary goal was to simplify the configuration without loosing the great extendability and customization capabilities of Citrus.

If you are coming from Citrus 1.x we have summarized the configuration changes in this migration sheet.

The old Citrus configuration components were marked as deprecated, so you can continue to use those components when upgrading to 1.4 without any changes. However you should consider to upgrade to the new endpoint configuration in order to be ready for the upcoming versions.

Also have a look at the new config sheet to see how the new configuration works for you.

ConSol @JavaLand

2014-03-26T18:52:35+00:00

Jatumba ConSolis! Zwei aufregende Tage gehen zu Ende und was können wir euch berichten? Das JavaLand hat einen super Start in der Welt der Konferenzen hingelegt! Sowohl die Speaker als auch die Atmosphäre ließen keine Zweifel offen, hier entsteht etwas Großes. Neben sieben parallelen Tracks, einem Hacker-Garten und zahlreichen Community-Aktivitäten gab es einen kompletten Freizeitpark zu entdecken.

Automated Integration Testing for webMethods with Citrus

2014-03-06T20:48:26+00:00

Achieving Continuous Integration for ESB Projects with Citrus (Part I: Introduction)

Continuous integration is almost mainstream nowadays. Probably no one wants to argue against the value of having an all-embracing integration test suite in place, which is lightweight enough to be executed on each code change. In this blog series I want to show the interplay between Citrus, the integration test framework written and maintained by ConSol and a commonly used Enterprise Service Bus, the webMethods Integration Server.

NRPE, NSCA und NSClient++ - Monitoring Minutes 1/14

2014-02-28T20:54:44+00:00

Nach einer längeren Pause (Kundenprojekte haben Vorrang) setzen wir die Reihe Monitoring Minutes wieder fort. Hier ist die zehnte Folge, in der Matthias Gallinger erzählt, wie er in einer Hochsicherheitsumgebung ein Gateway mit NSClient++ gebaut hat, welches in die gesperrte Zone mit NRPE hineinschaut und die Ergebnisse mit NSCA zum Nagios-Server schickt.

Find your agents with Jolokia 1.2.0

2014-02-24T21:08:50+00:00

New year, new release. Jolokia 1.2.0 is in the house.

Erstes Naemon Release

2014-02-17T13:27:12+00:00

Der als Nagios 4 Nachfolger angetretene Fork “Naemon” veröffentlichte heute sein erstes Stable Release mit der Nummer 0.8.0.
Aber was macht Naemon nun besser als Nagios?

Review "Icinga Network Monitoring" von Viranch Mehta

2014-02-02T18:37:02+00:00

Der Verlag Packt Publishing ist an mich herangetreten und hat mich gebeten, eine Rezension zum soeben erschienenen Buch Icinga Network Monitoring von Viranch Mehta zu schreiben.

Eigentlich hatte ich keine Zeit, aber wenn mir jemand mit „Keeping in mind your knowledge in this subject and having looked at your contributions, I feel you'd make an excellent reviewer of this book.“ kommt, dann werde ich natürlich schwach.

Das Buch richtet sich an eine Leserschaft, die bisher keinen Kontakt zur Icinga (bzw. Nagios, Naemon oder Shinken) hatte. Linux-Kenntnisse werden aber dennoch vorausgesetzt. Ziel des Autors war es, eine nachvollziehbare (im Sinne von: sofort am Rechner umsetzbar) und möglichst vollständige Anleitung zu erstellen, anhand derer ein Icinga-Neuling (mit ein bisschen Hirnschmalz sind die Schritte aber auch auf die o.g. Geschwister von Icinga anwendbar) in kurzer Zeit ein Basis-Monitoring für seine IT-Landschaft aufsetzen kann.

Neues Feature von check_logfiles: Teilausdrücke gruppieren

2014-01-31T00:30:59+00:00

Reguläre Ausdrücke in Perl erlauben die Bildung von Teilausdrücken. Mit runden Klammern kann man bestimmte Abschnitte eines Ausdrucks zusammenfassen, um sie an anderer Stelle oder nach dem Mustervergleich weiterzuverwenden.

$line =~ /Fatal: error (\d+) occured/;
$errorcode = $1;

Bei check_logfiles kann dies benutzt werden, um aus Trefferzeilen die relevanten Teilstrings zu extrahieren und so die Ausgabe des Plugins zu verkürzen.

Have you heared about Naemon

2014-01-24T22:14:32+00:00

A few months ago, Andreas Ericsson, the main developer of Nagios 4, has been kicked from the Nagios Developer Team for personal reasons. So he decided to continue development in a new fork called Naemon. The result so far is quite impressive.

First ConSol FedEx Day

2013-12-20T15:21:20+00:00

For most employees at ConSol, today is the last day before their Christmas vacation. Eight of us took that opportunity and organized our first FedEx day:
During the full day event, we formed small teams and worked on innovative projects we are enthusiastic about.
At the end of the day, we had small presentations showing the results to the company.

In this blog post we’d like to share the projects we came up with:

Infinispan Cluster on Raspberry Pis

There seem to be a lot of interest in building Raspberry Pi clusters for
demo
projects.
One of the teams took the chance and built our own, with five Pis running an Infinispan distributed cache.
It turns out that having a real hardware cluster yields different results than testing Infinispan locally.
While clean shutdowns and startups are no problem, unplugging and plugging network cables is a much greater challange to the Infinispan infrastructure.
The Raspberry Pi hardware is sufficient to run embedded Infinispan instances, the JBoss based distributions don’t seem to fit well with the hardware.

Kiosk systems based on Raspberry Pis

The Raspberry Pi and a large screen is all that is needed for building an information kiosk.
One of the teams built a kiosk for our entrance hall, showing the current event schedule for our meeting rooms.
Access to the event database was implemented as a Spring application, on the front-end side
HTML5 and JavaScript magic was used to visualize the data.

Evaluating the Ceylon Programming Language

Ceylon 1.0.0 was released recently, and one of the teams took the chance to make some first experiences with the new programming language.
Ceylon runs on the JVM, and can also be compiled to JavaScript. It comes with an Eclipse-based IDE, which is, however, not very easy to run.
The strong type system enables a lot of tool support, but sometimes also results in errors that are hard to understand for the novice.

Video Recordings for the ConSol Academy

The ConSol academy is a company event where employees share their knowledge with their peers. One team used the FedEx day to build
a prototypical hardware for recording academy talks on video, to archive the talks for colleagues who cannot participate.
As most other project, the video recording hardware was also based on the Raspberry Pi.
The Pi was equipped with a small camera and a microphone, and streams the data over the network for recording.

Summary

The Raspberry Pi is currently the most popular thing
among our developers. It is easy to set up, and provides an open platform for a wide range of projects.
The FedEx day was a great opportunity to experiment with that, and it is also a good way to get together with colleagues who work in other projects.

OMD 1.10 Release

2013-12-17T18:41:32+00:00

The developer team of OMD (Open Monitoring Distribution) released the version 1.10 today.

This version contains lots of updated packages including Thruk 1.80, Mod-Gearman 1.4.14, NagVis 1.8, check_mk 1.2.2p3 and many more.

Using the OMD Repository installation is as simple as a apt-get install omd. If you have an rpm-based system, it's as simple as yum install omd or zypper install omd.

Nagios belches Lava

2013-12-02T11:48:45+00:00

This is the story, how I increased Nagios’ WAF by hooking up a Lava Lamp for notifying when the wood-burning stove needs to be inflamed.

Devoxx 2013, Wrap up

2013-11-18T15:04:48+00:00

It’s time to wrap up our Devoxx 2013 visit (after 1, 2, 3, 4, 5 days geek heaven)

Devoxx 2013, Day 4

2013-11-15T08:55:48+00:00

So that Devoxx is now nearly over, here are some reviews of talks of the last full conference day. Day 5 is typically a phase-out day with only a handful talks and everybody is an hurry, our Devoxx 2013 wrap out will be published on Monday.

Devoxx 2013, Day 3

2013-11-14T08:52:01+00:00

On Wednesday Torsten, Christoph and Jan joined our team, and the conference kicked off with full blast: Full Keynote, full rooms, full toilettes, empty coffee and much fun.

Devoxx 2013, Day 2

2013-11-13T15:43:31+00:00

Yesterday was the last university day with in depth, 3 hour talks. Here we go.

Devoxx 2013, Day 1

2013-11-12T10:33:18+00:00

We are back again. The ConSol posse enters Devoxx again and will posts some expressions from this largest independent Java conference. Roughly 3500 attendees transform Antwerp into Java’s epicenter for one week.

Business Prozesse mit Nagios und Thruk

2013-11-03T11:47:01+00:00

Business Prozesse in Nagios zu modellieren gehört zu den beinahe täglich anstehenden Aufgaben beim Thema Monitoring. Die neueste
Thruk Version (v1.78) bringt nun ein Business Prozess Addon mit dem sich Prozesse einfach per Gui modellieren lassen.

Infinispan Talk: Slides are online

2013-10-22T07:50:05+00:00

Tomorrow, I’ll be speaking on Infinispan at the JBoss One Day Talk 2013.

Slides can be found here: http://rawgithub.com/ConSol/reveal.js/2013-jbossOneDayTalk/index.html.

See you tomorrow!

OMD-1.00 für Raspberry Pi ist fertig

2013-09-20T20:59:51+00:00

Lange hat’s gedauert, aber seit heute kann man sich das Debian-Paket für OMD-1.00 vom ConSol-Labs-Repository herunterladen.

root@raspberrypi:~# apt-get install omd-1.00

Die Maschinen unserer Kunden, auf denen wir uns tagtäglich bewegen und Monitoring-Systeme betreiben, haben üblicherweise CPUs und Gigabytes im zweistelligen Bereich. Da wird es schon zur Geduldsprobe, wenn ein Build auf dem Raspberry Pi den halben Tag braucht. Ein ARM11 ist eben kein Xeon und SD ist nicht SSD.

Offline-Modus für check_nwc_health

2013-09-10T19:15:12+00:00

Offline-Betrieb von check_nwc_health

Der eine oder andere check_nwc_health-Anwender dürfte --mode walk schon kennen. Damit kann man sich eine Liste von snmpwalk-Anweisungen ausgeben lassen, deren Resultat mir beim Debugging hilft.

Infinispan JPA 2nd Level Caching on JBoss AS 7 - Minimal Working Example

2013-09-09T18:31:58+00:00

Infinispan is included in the JBoss AS 7 distribution as the default second level cache for Hibernate.
For my presentation on the JBoss One Day Talk 2013,
I was looking for the most simple example to be used in a demo.
This post shows how to get Infinispan Hibernate 2nd Level Caching up and running in five minutes.
It should also work with the upcoming WildFly Application Server.

Debian/Ubuntu dist-upgrade keeping OMD

2013-08-21T07:32:06+00:00

Changing major releases on linux is always a risk, but Debian / Ubuntu dist-upgrades really worked fine the last years. If you are using OMD (Open Monitoring Distribution) for Nagios/Icinga/Shinken, then the release-update would disabled 3rd party repositorys and therefore remove your OMD installations during the update. This is usually not what you want, but with a small trick, updates work smoothly.

Distributed Caches Shootout

2013-07-02T12:17:03+00:00

Distributed caches have evolved into an independent branch of Big Data solutions: When it comes to fast read and write access, distributed caches are the solution of choice.

Dr. Fabian Stäber gave a talk a JayDay 2013 where he introduced and compared the leading distributed cache implementations:

Ehcache / Terracotta
Hazelcast
Infinispan

Based on a simple example application, the basic functionality is presented, and the specific strengths and weaknesses of the different cache architectures are highlighted and compared.

The results of this ‘shootout’ and an executive summary can be found here at /java-caches and the example application is available from GitHub.

Start here …

OMD 1.00 just arrived

2013-06-17T19:34:09+00:00

The developer team of OMD (Open Monitoring Distribution) released the version 1.00 today. Three years after the project started we decided it was time to show that OMD is no longer under development but is a mature, proven product.

This version contains lots of updated packages including Nagios 3.5.0, Shinken 1.4, Multisite 1.2.2p2, Thruk 1.72, PNP4Nagios 0.6.21, NagVis 1.7.1, check_mk 1.2.2p2 and many more.

Using the OMD Repository installation is as simple as a apt-get install omd. If you have an rpm-based system, it's as simple as yum install omd or zypper install omd.

For those who weren't using OMD yet, now there is no more reason to hesitate.

Meine FRITZ!DECT 200 sind endlich da!

2013-06-02T18:29:09+00:00

Im Februar bestellt und in der letzten Mai-Woche eingetroffen. Die intelligenten Steckdosen von AVM scheinen heiss begehrt zu sein. Jedenfalls kann ich jetzt über meine FRITZ!BOX aufzeichnen, wieviel Strom gewisse Geräte momentan oder aber über einen langen Zeitraum verbrauchen.
Von Berufs wegen juckt's mich natürlich jedesmal in den Fingern, wenn irgendwo Messwerte anfallen. Mein Plugin check_nwc_health kann ja bereits CPU, Speicher und Interfaces einer FRITZ!BOX 7390 abfragen, also war klar, daß die Überwachung der FRITZ!DECT 200 bzw. des gemessenen Energieverbrauchs unbedingt dazugehört.

Die fünfte Ausgabe der ConSol Monitoring Minutes, die sich mit diesem Thema befasst, ist heute ebenfalls entstanden.

Wie man einem Plugin das Maul stopft

2013-05-24T16:41:39+00:00

Ein Monitoring-Kunde wünschte sich, dass alle WARNINGs, welche aus der Vmware-Ungebung stammen, als CRITICALs dargestellt werden. Mit dem Tool negate ist das normalerweise kein Problem, man schreibt den Check einfach um in:


$USER1$/negate --warning=CRITICAL $USER1$/check_vmware_api.pl ....

Leider machte mir etwas anderes einen Strich durch die Rechnung. check_vmware_api.pl schreibt nämlich eine Warnung auf STDERR raus:


Subroutine IO::Socket::INET6::sockaddr_in6 redefined at /omd/sites/sagichnicht/lib/perl5/lib/perl5/Exporter.pm line 66. at /usr/lib/perl5/vendor_perl/5.10.0/Socket/INET6.pm line 21

Monitoring von HSRP - Monitoring Minutes 04/13

2013-05-06T13:07:15+00:00

Wie in der dritten Folge schon angekündigt, habe ich zum Thema HSRP ein eigenes Filmchen erstellt. Hier ist die vierte Folge der ConSol Monitoring Minutes, in der gezeigt wird, wie eine mit dem HSRP-Protokoll redundant gemachte Gruppe von Cisco-Routern mit check_nwc_health überwacht wird.

Die Monitoring Minutes 03/13 sind erschienen - Netzwerkmonitoring

2013-04-21T21:57:41+00:00

Vor ein paar Minuten habe ich die dritte Folge der ConSol Monitoring Minutes bei YouTube hochgeladen. In dieser Ausgabe geht es um die Überwachung von Netzwerkkomponenten mit dem Plugin check_nwc_health. Am Beispiel von Cisco-Geräten wird gezeigt, wie man mit einem einzigen Plugin Hardware, Betriebssystem und Interfaces prüfen kann.

Jmx4Perl 1.07

2013-04-16T12:16:12+00:00

It took quite some time, but now Jmx4Perl 1.07 is out with some nice bug fixes and enhancements for the Nagios plugin check_jmx4perl.

Citrus 1.3 released

2013-04-02T07:31:56+00:00

I am excited to announce that Citrus 1.3 has been released! We hope you enjoy the new feature set coming with this version like the new Java test builder for writing tests with Java code only and the new citrus-ssh module that adds connectivity to the ssh protocol as a client or server. Now let’s have a quick look at the major changes with this release.

check_nwc_health - neuer Modus für Link Aggregation

2013-03-28T17:48:26+00:00

Von einem Kunden wurde der Wunsch geäussert, Cisco Port Channels zu überwachen. Diese Technologie ist eine Art, wie man mehrere Interfaces zu einem Strang bündeln kann, sei es aus Gründen der Ausfallsicherheit oder (was mittlerweile eher seltener der Fall ist) der Lastverteilung. Üblicherweise werden Uplinks zwischen Switches auf diese Art redundant ausgelegt. Herausgekommen ist ein neues Feature, nämlich --mode link-aggregation-availability

Jolokia 1.1.x

2013-03-26T21:32:33+00:00

Two new Jolokia releases (1.1.0 and 1.1.1) in the past month add some nice new features.

Mod-Gearman in den Monitoring Minutes 2/13

2013-03-04T14:33:06+00:00

Soeben erschienen: die zweite Folge der ConSol Monitoring Minutes. Getreu unserem Motto “aus der Praxis für die Praxis” zeigen wir darin live, wie die Checks einer großen Nagios-Installation (hier: einer OMD-Site) mit Mod-Gearman an Worker delegiert werden können.

Beim “Schweizer Taschenmesser” OMD (erhältlich über das ConSol-Repository) ist Mod-Gearman bereits integriert.

Portübersicht mit check_nwc_health

2013-02-24T22:42:34+00:00

Mit dem neuen Feature interface-availability von check_nwc_health überwacht man die Anzahl noch verfügbarer Ports an einem Router oder Switch.
Als besonderes Gimmick gibt es eine Übersicht über alle vorhandenen Ports und deren Verfügbarkeitsstatus dazu.

A Jolokia short story

2013-02-22T08:19:21+00:00

Please follow me on my trip through debug hell, happy end included (or jump to the end of the post for a tl;dr, but you’ll miss quite something).

Thruk 1.64 with Multi-Language Reports

2013-02-18T21:15:53+00:00

There won’t be general multi-language support in Thruk, but you may now choose different languages for your reports. The brand new release v1.64 comes with 5 built-in languages and it’s
super easy to add a new one.

Einer geht noch - check_nwc_health und Cisco 5500 Series Wireless Controller

2013-02-17T01:23:58+00:00

Cisco WLC dienen dazu, Access Points zu verwalten und an ein Backbone-Netz anzubinden. Es gibt zwar schon ein paar Plugins, um diese Geräte mit Nagios zu überwachen, aber ich mag es nicht, für jeden Service ein eigenes Plugin installieren zu müssen. Daher hat das Schweizer Taschenmesser check_nwc_health jetzt eine weitere Klinge bekommen.

Allesfresser check_nwc_health kriegt nicht genug

2013-02-10T15:05:34+00:00

Consulting im Bereich Monitoring wird nie langweilig. Ständig wird man mit neuen Anforderungen konfrontiert, so wie vergangene Woche:

Blue Coat ProxyNG Appliances sollten überwacht werden, genauer gesagt das Modell SG600. Diese Appliances finden Verwendung in Application Delivery Networks (ADN), wo sie für die performante Auslieferung von Geschäftsanwendungen und Schutz vor web-basierten Bedrohungen sorgen.
Und jetzt zum Monitoring…

Wie man sich vor unvorsichtigen Kollegen schützt

2013-02-03T22:54:33+00:00

Ein überaus praktisches Feature der Bash ist die Möglichkeit, Kommandos erneut ausführen zu lassen, indem man einfach ein Ausrufezeichen gefolgt von den ersten paar Buchstaben eines Befehls eintippt. Die History wird dabei durchsucht, bis der letzte Befehl gefunden wird, der mit genau diesen Buchstaben anfing. Anschliessend wird er ausgeführt, was allerdings nicht immer das gewünschte Ergebnis liefert.

YouTube-Monitoring mit check_logfiles

2013-02-03T20:00:56+00:00

Anlässlich der neuen Videoserie "ConSol Monitoring Minutes" habe ich mir überlegt, wie man die Zahl der Zugriffe auf ein YouTube-Video mit einem Nagios-Plugin auslesen und mit PNP4Nagios aufzeichnen kann. Ein eigenes Plugin müsste dazu die Informationen herunterladen, Kennzahlen aus dem Resultat herausparsen, ausgeben und nicht zuletzt irgendwie auf Download-Fehler reagieren. Mit check_logfiles, einer kleinen Konfigurationsdatei und der YouTube-API ist das aber kein Problem.

Die ConSol "Monitoring Minutes" - neu auf YouTube

2013-02-01T09:31:42+00:00

Die erste Folge der ConSol "Monitoring Minutes" ist soeben auf YouTube erschienen;
darin geben wir einen Überblick über den Aufbau und die Funktionsweise von OMD und zeigen zuletzt, wie OMD mit wenigen Handgriffen über die Repositories von ConSol installiert und aktualisiert werden kann.

Hier gehts zur ersten Folge: OMD im Überblick - ConSol Monitoring Minutes

Monitoring einer FRITZ!Box 7390 mit check_nwc_health

2013-01-22T01:40:54+00:00

Mittlerweile ist check_nwc_health recht verbreitet und wird von vielen Admins dem Sammelsurium von Plugins vorgezogen, das man früher einsetzen musste, um unterschiedliche Typen von Netzwerkkomponenten zu überwachen. Doch nicht nur für die grossen Kisten kann man check_nwc_health einsetzen. Am Wochenende habe ich mir meine FRITZ!Box genauer angesehen.

Überwachen einer Juniper NetScreen NS5GT Appliance

2013-01-13T01:34:35+00:00

Zu Hause habe ich eine Firewall/VPN-Appliance NS5GT stehen, die bisher noch nicht überwacht wurde. Schlag den Raab, Deutschland sucht den Superstar, Navy CIS und Andrea Berg – Die 20 Jahre Show waren der Grund dafür, dass sich das jetzt ändert. Aus Langeweile habe ich CPU- und Memory-Monitoring für die NS5GT implementiert. Das Plugin check_nwc_health ist also wieder um ein Feature reicher.

Neue Features für check_nwc_health

2013-01-08T20:56:58+00:00

Seit gestern gibt es die Version 1.7 von check_nwc_health, in die ich die Überwachung von Pools von F5 BIGIP Loadbalancern aufgenommen habe.

Neue Features für check_mssql_health

2013-01-02T23:27:20+00:00

Einer unserer Kunden, der check_mssql_health bereits intensiv nutzt, hat mich beauftragt, neue Anforderungen seiner DBAs umzusetzen.
Hier ist das Ergebnis:

check_hpasm unterstützt jetzt Proliant Gen8

2012-11-25T23:17:27+00:00

Die neuen Proliant Gen8 scheinen seit September Einzug in die Rechenzentren zu halten. Damals erhielt ich die erste Mail, in der mir von unrealistischen Ergebnissen berichtet wurde. Anscheinend waren bei mehreren Temperatursensoren im Server Schwellwerte von -99 Grad registriert. Zumindest war das der Wert, den die Sensoren meldeten, wenn sie mit check_hpasm abfragt wurden.

Devoxx 2012 - Day 5

2012-11-18T12:25:59+00:00

Last day of Devoxx conference in Antwerp Belgium is scheduled. We have had many awesome talks in the past 4 days and the conference closes with more talks to fit in this category. So before we ship home to our beloved ones let’s have a look at these last-minute impressions.

Devoxx 2012 - Day 4

2012-11-16T09:02:36+00:00

Let’s go directly into todays talks and sessions.

Devoxx 2012 - Day 3

2012-11-15T09:40:50+00:00

Conference part of the Devoxx 2012 starts today. This means three keynotes in the morning and 60 minute talks for the rest of the day. Finally all 3400 developers have arrived in Belgium to share the 3 day Devoxx conference. Also the rest of the ConSol gang has arrived in Antwerp so we are now 6 people.

Devoxx 2012 - Day 2

2012-11-14T17:50:18+00:00

The second day of Devoxx is history, here are our impressions.

Devoxx 2012 - Day 1

2012-11-13T08:31:56+00:00

It’s time again: The ConSol gang is coming back to Antwerp. And again,
we are presenting you our Devoxx impressions fresh from the movie
theaters. As last year, two of use (Georgi and Roland) went ahead
wereas the rest of us (Olaf, Kathrin, Christoph and Torsten) will join
us on Wednesday.

Monday and Tuesday are traditionally the days for the University talks
with in-depth coverage of certain topics.

Monitoring Core Benchmarks

2012-10-23T11:18:07+00:00

We often get asked about nagios server sizing, so we did some benchmarking. Here are the results.

GOTO Conference Aarhus 2012 (part 2)

2012-10-15T14:22:05+00:00

I just returned from the GOTO Aarhus 2012 conference. Here’s some of my highlights:

GOTO Conference Aarhus 2012

2012-10-08T19:17:00+00:00

This year was the first time I went to a GOTO conference,
i.e. GOTO Aarhus 2012. Traveling to Denmark took me remarkably
long (12 hours), but this is probably due to my inability in
effectively route planning (though it would be cool, if somebody could
point me how to go faster from Nuremberg, Germany to Aarhus,
Denmark. Just in case ;-). This blog sums up my impressions of this
developer event.

OMD 0.56 für Raspberry Pi ist fertig.

2012-10-04T21:20:22+00:00

Während der letzten Wochen habe ich viel Geduld aufgebracht, um auf meinem neuen Raspberry Pi die Open Monitoring Distribution zu bauen und zuletzt ein Debian-Install-Paket zu erstellen. Geduld deshalb, weil ein “make” schon mal einen Tag und mehr läuft. Wenn dann immer wieder Anpassungen im Build-Prozess für diese spezielle Hardware nötig sind, zieht sich das ganze Unternehmen ordentlich in die Länge. Aber nun ist es überstanden und die Version 0.56 von OMD kann auf dem Raspberry Pi installiert werden.

Webcast "Java Monitoring in der Praxis – Jmx4Perl / Jolokia"

2012-09-25T14:19:50+00:00

Am Donnerstag senden wir einen Webcast rund um das Thema Java Monitoring mit Nagios im Allgemeinen und dem Plugin hier beheimateten check_jmx4perl im Speziellen. Roland Huß als Referent unt Author von jmx4perl und Gerhard Laußer als Moderator erklären 45 Minuten, wie man Nagios am besten mit JEE Servern zusammenbekommt:

Was ist JMX und warum ist JMX mit Nagios so schwer zu überwachen?
Java Monitoring Extensions – ein Kurzeinführung
Probleme bei der Anbindung von Java Applikationsservern mit JMX an Nagios
Jmx4Perl – Architektur und Vorteile
Das Nagios Plugin ‘check_jmx4perl’
Die Tools ‘jp4sh’, ‘jmx4perl’ und ‚jolokia’
Sinnvolle Metriken für die Nagios-Überwachung Die Anmeldung zu diesem kostenfreien Webcast und weitere Details dazu finden sich hier.

Create a Pinned Thruk App Tab in Firefox

2012-09-06T09:54:49+00:00

Since Version 1.46 of Thruk the number of problems can be dynamically viewed in Thruks favicon. This is quite handy for creating a pinned app tab in Firefox.

Raspberry Pi mit Spiegel-Sticks

2012-08-29T22:03:49+00:00

In diesem Post wird gezeigt, wie man einen Raspberry Pi Miniatur-Computer mit einem Root-Filesystem ausstattet, das auf zwei gespiegelten USB-Sticks liegt.

Start Thruk Automatically

2012-08-27T21:53:25+00:00

Thruk uses the mod_fcgid apache module which makes Thruk start on the first request. The user then gets a “waiting” page till the fastcgi server has started. When using Thruk all the time, there is no reason to wait till someone makes the first request and you can just fire up the init script after apache starts.

In normal installations there is an rc script in /etc/init.d/thruk which fakes a request and makes the fastcgi server start.

 root@mo:~ #> /etc/init.d/thruk start
 Starting thruk.........(10492) OK

In OMD its even easier, latest snapshots have so called ‘init-hooks’ which are executed after the rc script. You
need to create two files in your site:

etc/init-hooks.d/apache-start-post
etc/init-hooks.d/apache-reload-post

One of them can be a symlink, because both files will have the same content:

 #!/bin/sh
 # check return code of apache start
 if [ $4 = 0 ]; then
   ./etc/init.d/thruk start
 fi

So when ever your apache starts / reloads, for example after logfile rotation, thruk will immediatly start too.

Intensivkurs Jmx4Perl

2012-08-12T23:40:08+00:00

Jmx4Perl und
Jolokia haben sich mittlerweile zum
de-Facto-Standard beim Nagios-Monitoring von Java entwickelt. Das
belegen etliche Blog-Postings, die Downloadzahlen und zahlreiche
Kundenprojekte, die ConSol durchgeführt hat.

Aus der Erfahrung von über einem Dutzend individueller Workshops haben
wir einen Intensivkurs destilliert, der in Bezug auf die Nagios-Anbindung von JEE-Applikationsservern keine Fragen mehr offen lässt.

In dieser Schulung lernen Administratoren, das Maximum aus Jmx4Perl
herauszuholen. Neben theoretischen Grundlagen wird vor allem viel Wert
auf praktische Übungen gelegt.

Weiterer Details zum Inhalt und eine Online-Anmeldung finden sich
unter
http://www.consol.de/allgemein/schulung-java-monitoring-mit-nagios/

Fragen zu dem Kurs beantworten wir auch gerne hier in den Kommentaren
oder im Forum.

Thruk 1.36 Released

2012-07-19T21:32:03+00:00

Version 1.36 of the Thruk monitoring gui has just been released. The changelog is quite huge this time. There is a new dashboard plugin called the ‘Panorama View’ Addon. There are a lot more reports included now. And finally there is a plugin manager included in the config tool which lets you easily manage your plugins and addons.

Citrus 1.2 Final

2012-07-10T06:20:10+00:00

It has been a while since our last final release for Citrus. Now I am proud to announce the final 1.2 release. The package ships with a huge list of new features and improvements that I would like to highlight in a few lines for you.

Show Git Branch in Bash Prompt

2012-06-19T12:20:48+00:00

When working a lot with git knowing which branch you are in is an important information. Putting the branch information in your bash prompt makes this information always visible and also shows immediatly if you are in a folder managed by git.

This is how it looks:

13:46:50 sven@tsui:~/projects/Thruk (master) %>

All you need is a simple function in your .bashrc

Running check_by_ssh over a persistent ssh-connection

2012-06-12T13:29:26+00:00

Monitoring Unix clients is very easy with the check_by_ssh plugin. The only prerequisite is public-key-based access and installation of some plugins on the remote side. Then, running a check is as easy as:

check_by_ssh --host 10.177.3.39 --logname nagios \
    --command "lib/nagios/plugins/check_swap -w 15% -c 8%"

The drawback of this method is extra load on the nagios server. With every check, a ssh process is forked which has to do a complete handshake with the remote side. With newer ssh implementations it is possible to have a persistent connection which requires only one handshake at startup. All the following ssh connects use the already established connection, which saves a lot of cpu cycles.
Here are the instructions to combine check_by_ssh with such a persistent tunnel.

Arbitrary ssh-command for check_by_ssh

2012-06-12T12:22:18+00:00

The well-known plugin check_by_ssh is a wrapper around the ssh client program. Unfortunately the path to ssh is defined at compile-time and remains hard-coded in the check_by_ssh binary. Usually this is /usr/bin/ssh. If you want to use features which are not implemented in your distribution’s ssh, but in an alternative ssh binary, you have to recompile check_by_ssh. Here is a patch which makes it easy to switch between multiple ssh binaries using a command line parameter.

Recurring Nagios Downtimes with Thruk

2012-06-12T10:18:16+00:00

Every now and then some of our 7x24 hosts / services need a daily or weekly maintmode for regular restarts. Normally you would have to create 2 new timeperiods because you don’t want both hosts in a cluster to be restarted at the same time. This is not just way to much work, it also adds unnecessary complexity because
nobody can see the maintmode unless you look into the config files.

Thats where recurring downtimes will become handy and latest Thruk Version includes this new feature.

How to use the new OMD init-hooks

2012-06-07T20:34:58+00:00

One of my bigger OMD installations consists of 13 sites. The visualization layer uses the Thruk interface. This alternative web ui can read data from multiple livestatus backends and display the host and service objects in one unified view. For this purpose i have one extra site called gui which only starts an apache process. I then point my browser to http://…./gui/thruk

The addresses of the livestatus backends have to be written into a config file, thruk_local.cfg. Now what if my list of 13 sites would be constantly changing? What if new OMD sites would be created, others deleted on a daily basis? I would have to edit the config file every time. With the new init-hook-feature, OMD will do this automatically for me.

Monitoring CPU usage of a Linux system with check_logfiles

2012-06-02T10:30:42+00:00

Keeping an eye on cpu usage of your servers is one of the basic things in system monitoring. For Nagios (and Shinken, of course) you’ll find plenty of plugins for this task. However, i was never happy with the way they work. Most of the plugins you can download work like this: read a counter - sleep - re-read the counter. This technique not only adds an extra delay to the execution time of the plugin, but it only shows the state of things within a small time frame. If you run such a plugin every 5 minutes and it sleeps 5 seconds between the two measurements, you don’t know what happens in the other 295 seconds. This is a very small sample rate.

How to install bleeding-edge Shinken in a minute with OMD

2012-05-20T19:28:00+00:00

You probably have noticed that development of the new Nagios-compatible monitoring system Shinken progresses very fast. Every few hours there is another commit at GitHub, where Shinken’s code repository is hosted. Now if you want to try all these new features immediately, there’s a very easy method which requires a simple update-command instead of a fresh install.

Jmx4Perl 1.05 and Jolokia 1.0.3

2012-04-22T18:33:30+00:00

Jmx4Perl and her sister project Jolokia received some spring updates.

OMD 0.54 is available

2012-04-04T23:04:49+00:00

The developer team of OMD (Open Monitoring Distribution) released the version 0.54 today. This version contains bugfixes and lots of updated packages including Shinken 1.0.1, Thruk 1.26, PNP4Nagios 0.6.17, NagVis 1.6.5 and many more.

Pentaho Kettle within a web application

2012-03-26T11:34:06+00:00

This post demonstrates how to include and deploy Pentaho Kettle as a regular Web application. There are some pitfalls you should be aware of.

Monitoring von VMWare ESX mit check_esx3: "command matrix"

2012-03-16T17:57:54+00:00

Virtualisierung spart Kosten und Ressourcen, stellt aber hohe Ansprüche an Verwaltung und Monitoring. Die schwedische Firma op5 entwickelte für ihr gleichnamiges Nagios-basierendes Produkt das Plugin check_esx3, welches ein umfassendes Monitoring von VMWare ESX-Umgebungen ermöglicht.

Mod-Gearman 1.2.6 released

2012-03-15T14:37:58+00:00

Version 1.2.6 of Mod-Gearman has just been released. You may now configure the worker queues by custom variables instead of host/servicegroups.

Wie bohrt man Datenbank-Plugins auf?

2012-02-27T16:33:55+00:00

check_oracle_health, check_mysql_health, check_mssql_health und check_db2_health bringen von Haus aus schon eine Menge Funktionalität mit. Allerdings wurden sie speziell für die Belange von Datenbankadministratoren entwickelt. Um auch den Betreibern von datenbankgestützten Applikationen die Möglichkeit zu geben, bestimmte Werte per SQL abzufragen, gibt es den Parameter "--mode sql". Damit lässt sich das numerische Ergebnis eines SQL-Aufrufs mit Schwellwerten vergleichen und in einen Nagios-Exitcode verwandeln. Üblicherweise sind die Anforderungen der Applikation an das Monitoring jedoch etwas komplexer. Am Beispiel von check_mysql_health und Wordpress wird gezeigt, wie man so etwas einfach umsetzen kann.

Mod-Gearman 1.2.2 released

2012-02-14T22:43:19+00:00

Version 1.2.2 of Mod-Gearman has just been released. It now comes with better orphaned check detection and easier installation for rpm based linux systems.

Using the Shinken livestatus module with MongoDB

2012-01-29T22:28:22+00:00

In my last post i was explaining why it became necessary to have an alternative to the sqlite-based storing of log data. One of the many new features of the upcoming release 1.0 “Heroic Hedgehog” of the Shinken monitoring software will be a MongoDB backend used by the livestatus module.

In this post i will show how to configure the livestatus module with a MongoDB cluster.

Pimp my Livestatus

2012-01-22T20:59:17+00:00

In the early days of the Shinken monitoring system you were quite limited in how many web user interfaces you could use. There was the old CGI-based Nagios-Webinterface or (thanks to the merlin-mysql broker module) the Ninja GUI from OP5.
At the same time, two Projects, Thruk and Multisite, became very popular. The success of these two web guis was mainly based on the way they communicated with the Nagios core.

Mod-Gearman with Embedded Perl

2012-01-01T15:48:42+00:00

The upcoming version 1.1.2 of Mod-Gearman will have embedded Perl support which greatly improves performance when you have lots of Perl checks.

Happy new year !

2011-12-30T11:48:22+00:00

As 2011 is now nearly history, it’s time for a short look behind what happened to labs.consol.de and its projects this year.

Devoxx 2011 - Wrap up

2011-11-22T09:59:37+00:00

Now that Devoxx has finished and we had a weekend in between, it is time to wrap things up. In this last #Devoxx blog for this year, everyone from the ConSol posse draws his very own personal conclusion.

Devoxx 2011 - Day 4 (cnt.)

2011-11-21T18:29:12+00:00

Here are two talks reviews from Jan which didn’t made it into our last blog. (But this was not the only hangover this week ;-). Tomorrow we will wrap up things with some personal statements about the whole show. Sorry, for day 5 we didn’t managed to write a single review. Devoxx visitors might guess the reason ;-)

Devoxx 2011 - Day 4

2011-11-18T08:46:12+00:00

The last full day was again packed with high-end tech stuff before we enter the 10th anniversary party of Devoxx. So I guess, the fifth day gets a bit less blog coverage than the previous blogs. The ConSol posse has some reviews about Akka, JavaFX, HTML-5 and Android again, Play, JMS 2.0 and Clojure for you.

Devoxx 2011 - Day 3

2011-11-17T10:22:10+00:00

Day 3 and Devoxx is running at full blast now. Rooms are crowded, WiFi breaks down periodically, lanes in front of the toilets and lunch lanes, but nothing will stop the FUN we are having here ;-). There we go with our reviews about the diabolical developer, Play 2.0, Kotlin, JAX-RS 2.0, NoSQL, Phone Gap, HTML5 and the JDK 7 Filesystem API. Please fasten your seat belt for some geeky stuff.

Devoxx 2011 - Day 2

2011-11-16T10:54:05+00:00

On the second the rest of the ConSol posse joined us : Christoph, Christian and Torsten arrived for the fireside chat. A session format which lacked a bit the tension, but there were some rare highlights like the following joke: Q: “What’s the difference between Ant and Maven ?”, A: “The author of Ant apologized”. The other stuff covered were the ServiceMix combo, Groovy, Spring in the cloud, Infinispan, JDK 7 and Jenkins for Continous Delivery.

Devoxx Talk "Jolokia - JMX on Capsaicin" Shownotes

2011-11-16T10:11:28+00:00

Here are the links I mentioned in my Devoxx Tools-in-Action Talk “Jolokia - JMX on Capsaicin”. Most of the pointers can of course be reached by starting at www.jolokia.org

Devoxx 2011 - Day 1

2011-11-15T12:25:28+00:00

We are back again. The first day as always is some sort of warming up and getting into the flow. This year was special in so far as one os us (roland) gave his first talk at Devoxx. But let’s start with the first talk. Here are our personal reviews for some of the talks we’ve attended with the author of each review is mentioned after the title in parentheses. These reviews covers JEE6 Enterprise applications, Arquillian, Continous Delivery, Spring Data JPA, Jolokia (of course ;-), Scala and Glassfish rolling updates.

Devoxx - here we come !

2011-11-13T14:14:22+00:00

The ConSol posse is on the road again for having a good and inspiring time at Devoxx. Devoxx is probably one of the best things what can happen to a Java developer. Marcel, Alvin, Jan and myself (Roland) will be at the show for the whole week, Christoph, Torsten and Christian will join us on wednesday.

Thruk Benchmarks

2011-11-10T12:50:03+00:00

I often get asked if there are any benchmarks for Thruk so i finally decided to do some tests.

High performance string concatenation in Python

2011-10-31T23:41:13+00:00

During evaluation of the migration of a customer’s Nagios installation to the Shinken monitoring system, i encountered a strange problem. Reading the configuration from a few files (hosts.cfg, services.cfg, etc) took a reasonable amount of time. But as soon as i divided the configuration into lots of smaller files (one directory for each host with several services files within), it took nearly an hour. What happened?

Halloween Update: Jmx4Perl 1.01 and Jolokia 1.0.1

2011-10-31T21:01:07+00:00

Small updates have arrived for Jmx4Perl and Jolokia.

Speicher löst sich nicht mehr in Luft auf dank check_hpasm

2011-10-14T17:12:57+00:00

Während des Bootens von Proliant-Servern wird eine umfangreiche Prüfung der verbauten Speichermodule durchgeführt. Entdeckt das Bios dabei Ungereimtheiten oder schadhafte DIMMs, so werden diese auskonfiguriert und der Bootvorgang fortgesetzt. Ob dies bei einem Server vorgekommen ist, zeigt ein Blick ins Integrated Management Log. Dort erscheint dann folgende Meldung:

Event: 26 Added: 03/08/2011 21:01
  CAUTION: POST Messages - POST Error: 207-Memory initialization error on Processor 1 DIMM 6. The operating system may not have access to all of the memory installed in the system..

Jolokia 1.0.0 and Jmx4Perl 1.00

2011-10-04T11:01:04+00:00

Time to celebrate: After two and half years working on Jmx4Perl and Jolokia it is time now to nail down the 1.0 release. The last month the focus was on hardening this first official release.

Maven Resource Bundle Check Plugin 0.5

2011-09-30T19:59:55+00:00

The new maven-rbc-plugin 0.5 release contains minor bugfixes.

OMD 0.50 is available

2011-09-23T09:19:34+00:00

The developer team of OMD (Open Monitoring Distribution) released the version 0.50 today. This version contains bugfixes and lots of updated packages including Shinken, Thruk, PNP4Nagios, Mod-Gearman, check_oracle_health and check_mysql_health.

Jolokia and Jmx4Perl on tour

2011-08-29T17:29:18+00:00

Jolokia and Jmx4Perl will go on tour this autumn. Roland Huss will talk about both projects in November at Devoxx, Antwerp, which is the biggest independent Java community conference in the world and at the Open Source Monitoring Conference, Nuremberg.

Jolokia 0.95 is here

2011-08-21T05:34:44+00:00

The summer break is over and Jolokia is one step closer to
1.0. Germans might reasonably argue, ‘ehm, what summer do you talk
about ?’ but at least 0.95 is now a fact and introduces two new
features. Very cool features, IMO.

Thruk Release 1.0.9

2011-08-15T12:49:22+00:00

Thruk 1.0.9 has just been released and contains a couple of cool new features. This version will also be in the next OMD release. Besides the release itself, new documentation about plugins and themes has been published.

check_oracle_health kann EZCONNECT

2011-08-10T22:39:54+00:00

Üblicherweise ruft man check_oracle_health mit den Kommandozeilenparametern

check_oracle_health --username  --password  --connect

auf. Voraussetzung dafür ist natürlich, dass die SID in einem Verzeichnisdienst oder in einer Datei tnsnames.ora vorhanden sein muss.

OMD Repository

2011-07-29T13:03:02+00:00

There is a now an ‘unofficial’ OMD Repository OMD Repository.
This makes new installations and upgrading your OMD sites even more easier.

Mod-Gearman 1.0.8 Released

2011-07-23T14:09:38+00:00

Mod-Gearman 1.0.8 has been released (download).
This release mostly contains bugfixes only and a minor change to use the identifier more often.

Validate Excel files in Citrus

2011-06-22T15:16:08+00:00

Lately I had to deal with Excel files as REST Http service response. I came up with a pretty clever validation mechanism in Citrus that I would like to share with you. You can apply the Excel validator to your Citrus project, too. It is not very complicated as you will see in this post.

Use TestNG data provider with Citrus

2011-06-17T07:15:58+00:00

TestNG provides brilliant support for test parameters and data providers. With some annotation magic you are able to pass parameter values to your test method and finally to your Citrus test logic.

Die Feature-Liste von check_oracle_health wächst weiter...

2011-06-16T14:14:38+00:00

Seit heute gibt es die Version 1.6.9 von check_oracle_health. Hauptzweck ist die Beseitigung eines Problems, das auftaucht, wenn man das Plugin unter OMD einsetzt. Daneben ist aber auch die Liste der Modi erweitert worden, um noch mehr Fehlersituationen in großen Oracle-Installationen rechtzeitig erkennen zu können.

Citrus 1.2.M2

2011-05-23T22:01:14+00:00

The next Citrus milestone release for version 1.2 has landed. This version introduces new REST Http support on client and server side. In particular we are now able to handle the Http request methods (GET, POST, PUT, DELETE, …) as well as Http response codes (e.g. 404 Not Found).

Citrus 1.2.M2 now works with Spring 3.0, Spring Integration 2.0 and Spring WS 2.0. In addition to that we have some bugfixes and improvements in this release. Check out the reference documentation for the complete changes list on what’s new.

Thruk Release 0.94

2011-04-28T17:36:33+00:00

Straight on the road to the Thruk 1.0 this will probably the last feature release so far.
The main focus was on usability and accessibility. This version will also be in the next OMD release.

ConSol Labs Forum

2011-04-17T11:15:27+00:00

labs.consol.de hat nun ein Forum für Fragen rund um labs.consol.de
und unsere Open Source Projekte.

Jolokia and Jmx4Perl 0.90 released

2011-04-11T19:53:14+00:00

Hand in hand, Jolokia and Jmx4Perl started their countdown for their first major version, scheduled late this summer.

While Jolokia got some minor enhancements, Jmx4Perl now finally got rid of any Java code, relying now completely on a Jolokia agent.

Jolokia 0.83 with Roo addon

2011-02-21T19:16:49+00:00

Jolokia 0.83 has been released which now contains a Roo addon.

OMD 0.46 is there!

2011-02-19T13:52:20+00:00

The developer team of OMD released the version 0.46 last week. Now you will not only be able to run Nagios out of the box. Shinken has been added as an alternative core. This enables you to create one set of configuration files and switch between two monitoring technologies with only a few commands.

Wenn der ESX-Server keine Icinga-VM annimmt...

2011-02-18T16:49:13+00:00

Wer sich die neueste Version von Icinga zum Ausprobieren herunterladen will, greift aus Bequemlichkeit sicher auf die virtuelle Maschine zurück, die bereits eine vorgefertigte, vollständige Installation enthält. Das dabei verwendete ova-Format kann allerdings nicht ohne weiteres in einer VMware-Umgebung verwendet werden. Zwar taucht auch ova in den von VMware unterstützten Virtualisierungsformaten auf, in diesem speziellen Fall trifft das jedoch nicht zu. Der VMware vCenter Converter zumindest weigert sich, die Icinga-Datei anzunehmen. Was man tun muss, um Icinga.ova in einen ESX-Server hochzuladen, wird hier beschrieben.

check_oracle_health unter Windows

2011-02-11T00:17:39+00:00

Seit einigen Versionen ist check_oracle_health auch unter Windows lauffähig, was anscheinend nur wenig bekannt ist. In vielen Firmen ist auf den Arbeitsplatz-PCs ein Oracle-Client installiert, mit dem Applikationen auf die Unternehmensdatenbanken zugreifen. Es ist daher nur logisch, wenn beim Monitoring die Verfügbarkeit einer Datenbank aus der Sicht so eines PCs geprüft wird.

Jmx4perl im Perl Magazin Nr. 17

2011-02-09T20:00:31+00:00

Für alle Freunde des leichtgewichtigen JMX Monitorings findet sich in der neuesten Ausgabe des Perl Magazins eine Vorstellung von jmx4perl. In dem 9-seitigen Artikel wird JMX im Allgemeinen, die jxm4perl (bzw. Jolokia) Agenten und die Programmierung mit JMX::Jmx4Perl beleuchtet.

Die Ausgabe lässt sich online für 6 € inkl. Versandkosten bestellen. Fragen zu dem Artikel bzw. jmx4perl im Allgemeinen beantworte ich in den Kommentaren hier gerne.

Mod-Gearman 1.0 released

2011-02-08T20:54:46+00:00

Mod-Gearman 1.0 has been released (download).
About half a year after starting development of Mod-Gearman it’s time to finish main development and release the stable 1.0.

use gearman to spread the load of your nagios box onto several worker
avoid core blocking events like eventhandler
distribute writing performance data

Check language bundles with maven-rbc-plugin

2011-01-30T19:37:21+00:00

Got an an internationalized Java app?
Then the maven-rbc-plugin plugin can help you finding

Citrus 1.2.M1 released

2011-01-26T10:32:12+00:00

We are very happy to announce the first milestone release of Citrus 1.2 in early 2011. The framework comes with great new features and many improvements to you. This post gives a short overview of the major changes, hope you enjoy the new features:

Jolokia goes Javascript

2011-01-18T19:38:35+00:00

Starting with release 0.82, Jolokia contains now a brand new Javascript client library. This blog post highlights the main features and gives some usage examples.

Verkürzen der Ausgabe von check_oracle_health und Konsorten

2010-12-20T18:56:53+00:00

Überwacht man mit check_oracle_health den Füllgrad von Tablespaces einer Oracle-Datenbank, so werden in der Ausgabe des Plugins grundsätzlich alle Tablespaces aufgeführt. Dabei spielt es keine Rolle, ob ein Tablespace genügend freien Speicherplatz hat oder bereits zu voll ist. Dies hatte in der Vergangenheit zur Folge, daß die Ausgabe bei großen Datenbanken mit vielen Tablespaces sehr, sehr lang war und somit in der Web-Oberfläche von Nagios etwas unübersichtlich erschien. Mit den neuesten Releases von check_oracle_health, check_db2_health, check_mysql_health und check_mssql_health ist es nun möglich, nur noch diejenigen Tablespaces/Datenbanken anzeigen zu lassen, die voller sind als es die Schwellwerte erlauben.

Integrate your own SoapAttachment Validator into your Citrus project

2010-12-14T07:13:17+00:00

Citrus includes a lot of convenient features which are only waiting for you to discover and use them. The other day I needed to validate a SoapAttachment. As you probably already know, a SoapAttachment is referenced by a href property in an Include tag like this: . Validation is quite easy when you’re still mock testing your application because you have full control over what your mock response will look like.

OMD 0.44 is out in the wild

2010-11-27T15:50:35+00:00

[OMD](http://www.omdistro.org), the new star on the open monitoring scene, has just been released in version 0.44 with a **lot** of enhancements and new addons.

Devoxx - Wrap-up

2010-11-25T09:36:28+00:00

Devoxx is over and it’s time for a
summary. We, the Citrus posse ;-), enjoyed the trip to Antwerp very
much and came back with a bunch of new impressions. It was a pleasure
to be part of this Java community event with great speakers and
excellent talks.

Devoxx - Day 4

2010-11-19T10:04:41+00:00

The last full conference day of the Devoxx was again packed full with very interesting talks of various kind. It started with a keynote about the roadmap of JEE 7. Summarizing we can expect some smooth refinements of the platform (exept maybe the support for virtualization out of the box). Here are our impression on the talks of Thursday. Please expect our summary blog post on monday since we are all now in rush to get out things done and to catch train, plain etc. We hope, you enjoyed the blog flood so far ;-)

Devoxx - Day 3

2010-11-18T10:59:06+00:00

The rest of Citrus posse (Christoph, Martin, Ralf, Torsten) joined us
(Marcel, Roland) yesterday, so we are able to spread over much more
talks and will flood this blog with even more reports from interesting
Java talks.

Devoxx - Day 2

2010-11-17T15:45:45+00:00

The second day at Devoxx continues with a university day, with much introductory talks.
BTW, catering was fine today, better than two years ago. I think the is worth mentioning, since this was one of the weak points last time we visited the Devoxx. Ah yes, Wifi is ok, too. Now for the talks ;-)

Devoxx - Day 1

2010-11-16T11:49:26+00:00

Some of us ConSol Labs guys enter this year’s Devoxx, the largest
Java conference in Europe. You can expect some blogging about the
state of Java, the newest trends and cool stuff in general out there
for this week.

Jolokia

2010-10-17T18:27:57+00:00

Let's welcome the new kid on the labs.consol block: Jolokia.

Nagios Gimmick: Boost your ego.

2010-10-08T06:15:16+00:00

A small Nagios plugin for monitoring search hit counts. Don't take it too seriously.

Extra-opts für die check_*_health-Plugins

2010-10-01T17:15:36+00:00

Die Datenbank-Plugins check_oracle_health, check_mysql_health, check_mssql_health und check_db2_health unterstützen auf vielfachen Wunsch auch den Parameter --extra-opts. Damit ist es jetzt möglich, z.B. Login-Daten von den Kommandozeilenparametern in Konfigurationsdateien zu verlagern. Neben Environmentvariablen gibt es somit eine weitere Alternative, Passwörter aus der Prozessliste zu entfernen und dadurch vor neugierigen Blicken zu schützen.

Jmx4Perl 0.72

2010-09-24T21:06:28+00:00

Jmx4Perl 0.72 has been released which is a pure bug-fix release.

Thruk 0.70

2010-09-06T09:08:03+00:00

Thruk 0.70 has been released at (download). The three major changes from user perspective are

side menu is completly adjustable by config file
excel export for hosts and services
search includes comments and downtimes

check_jmx4perl: New Nagios configuration style

2010-08-23T05:35:07+00:00

Since version 0.70, check_jmx4perl has support for configuration files. JMX Nagios checks are now considerably simpler to configure and multi checks add even more performance and flexibility.

Putting jmx4perl on the fast lane for Tomcat

2010-08-18T17:58:51+00:00

This post explains why a dedicated [Tomcat Connector](http://tomcat.apache.org/tomcat-6.0-doc/config/http.html) reserved for the jmx4perl agent is a useful thing.

Citrus and TestNG groups

2010-08-13T08:00:42+00:00

TestNG groups add great flexibility to the Citrus test execution. We are able to divide all tests into several groups reaching a sophisticated seperation of concerns in our test setup. As an example I want to classify some of my functional Citrus tests as “long-running”. These tests may not apply to continuous execution every time I package my project. Instead of this I want to set up a scheduled integration build to execute those long-running tests in a time schedule.

Citrus 1.1 released

2010-08-12T19:40:29+00:00

Citrus 1.1 release is here (download)! The release comes with a bunch of new features and bugfixes. Here is a short list of major features and changes in this release:

Wie man das Setzen der Schwellwerte an den DBA delegiert

2010-08-12T19:33:27+00:00

Die Plugins check_oracle_health und check_mssql_health haben mit den Versionen 1.6.6 bzw. 1.5.6 ein neues Feature bekommen. Critical- und Warning-Schwellwerte können jetzt auch direkt in der Datenbank hinterlegt werden. Bei Änderungswünschen muss der DBA nun nicht mehr den Nagios-Administrator belästigen, damit dieser die entsprechenden Servicedefinitionen anpasst.

Service dependencies with NRPE

2010-07-14T10:59:36+00:00

If you have defined services using the nrpe mechanism, you might know the following scenario:
The NRPE daemon fails and all services using it go critical. One first step to avoid these false alarms is to create an additional service which monitors the NRPE daemon itself (called check_nrpe_daemon in this example) and install a dependency between your services and check_nrpe_daemon.

Jmx4Perl 0.70

2010-07-10T10:21:18+00:00

I’m happy to announce the new jmx4perl release 0.70 with a lot of new features. The most exciting new stuff are configuration files and multi-checks for check_jmx4perl, a new Java client library and the start of a readline based JMX shell j4psh with syntax highlighting and command line completion.

Citrus with SOAP-ENV:mustUnderstand headers

2010-05-19T15:47:29+00:00

This post illustrates the support for SOAP-ENV:mustUnderstand headers in Citrus WebService simulation

Überwachen von Eventlogs mit check_logfiles und einem Domain-Benutzer

2010-04-12T12:55:41+00:00

Seit der Version 3.2 von check_logfiles ist es einfach geworden, Eventlogs von Windows-Servern auszulesen, ohne auf diesen das Plugin installieren zu müssen. Es wird jetzt nur noch ein "Gatewayserver" sowie ein Domainbenutzer nagios benötigt.

Fileupload with perl, decorated with a progressbar

2010-04-06T18:05:30+00:00

With [LWP](http://search.cpan.org/~gaas/libwww-perl) you can easily upload a file from within a Perl script. This post gives a small demo how to enhance your upload scripts with a progressbar for giving feedback while uploading.

Neues Release 4.2 von check_hpasm

2010-03-30T18:57:29+00:00

Es gibt mal wieder ein Update für check_hpasm, diesmal mit dem Schwerpunkt auf HP Bladesystems. Neu hinzugekommen ist die Überwachung von Sicherungen (Fuses) und Enclosure Managern. Ausserdem werden jetzt bei fehlerhaften Komponenten auch gleich die Spare-Part-Nummern angezeigt.

Jmx4Perl 0.65

2010-03-30T18:10:49+00:00

Jmx4Perl reaches is next evolution step, with a bunch of new features. The most important news are a new JDK 6 based JVM agent which allows monitoring of arbitrary Java applications (not only servlet containers) and the support for bulk read requests.

Removing attachments with JavaMail

2010-03-29T06:50:54+00:00

This post demonstrates how to remove an attachment from an IMAP mail with JavaMail. The post concentrates on a general strategy but shows also the dragons waiting on the way. I.e. specially JavaMail's aggressive caching needs to be taken into account.

Osgish

2010-03-02T17:41:57+00:00

Osgish is a command line shell for OSGi. It is based on the Readline Library, Jmx4Perl as OSGi backend and Aries JMX as OSGi Management layer. It is different than other OSGi shells as it is implemented in pure Perl and provides unique features like wildcard support, context sensitive command line completion, syntax highlighting, bulk lifecycle operations, advanced query facilities and remoting via HTTP. It uses the jmx4perl and Aries JMX OSGi bundles for accessing the OSGi container. This is the initial release.

Jmx4Perl 0.60 released

2010-02-28T16:29:43+00:00

Jmx4Perl’s next release 0.60 is out in the wild.

Damit dem Windows-Team nichts mehr entgeht - Allesfresser check_logfiles

2010-02-26T13:11:06+00:00

Folgende Anfrage wurde von einem Kunden an mich gerichtet:

Jetzt kam von den Admin die Anfrage ob es nicht möglich ist alle Meldungen (winwarncrit) erstmal als Warning an Nagios zu melden, um dann bestimmte Meldungen nach und nach als Critical einzustufen, oder komplett zu verwerfen (exclude).
Geht das?

Citrus: More XPath validation power

2010-02-26T09:50:02+00:00

In my last post (citrus-xpath-validation-power) I solved a validation problem regarding generic XML data structures with some XPath expression power. Now in latest 1.1-SNAPSHOT version of Citrus things become even more straightforward.

Citrus: XPath validation power

2010-02-18T16:02:00+00:00

This post shows the power of XPath validation in Citrus

Citrus sources on GitHub

2010-02-11T14:53:16+00:00

Citrus sources available on GitHub for public checkout

Localized citrus.properties

2010-02-03T16:06:02+00:00

In larger projects usually a team of testers is working on Citrus integration tests. In this post I'd like to share an easy way to localize the Citrus settings with Maven.

Citrus: Latest 1.1-SNAPSHOT version available

2010-02-02T07:58:50+00:00

Latest 1.1-SNAPSHOT version available

Customizing meta information

2010-01-18T21:43:46+00:00

Test cases in Citrus are usually provided with some meta information like the author’s name or the date of creation. This post shows how to extend on this to include your very specific meta data on your own.

Jmx4Perl OSGi Bundle

2010-01-10T21:08:28+00:00

The first developer version jmx4perl 0.55_1 with OSGi support has been pushed to CPAN. This post desribes the new features and the plans for 0.55 and beyond.

jmx4perl 0.51 released

2009-12-23T16:28:32+00:00

Jmx4perl 0.51 has been released.

Performance testing with Citrus

2009-12-18T09:10:11+00:00

Once you have written Citrus integration tests it would be nice to also use these test scenarios for performance testing. In a recent project we accomplished basic performance tests just using some out-of-the-box features in TestNG. In this post I would like to share a simple example with you regarding performance testing in Citrus.

A very first look at monitoring in Glassfish v3

2009-12-11T07:20:10+00:00

Glassfish Enterprise Server v3 has been released yesterday and it brings some exciting news related to monitoring. Here are some links to the new monitoring features of v3.

Jmx4perl Mule Agent

2009-12-10T06:59:07+00:00

In its standalone mode, [Mule](http://www.mulesoft.org) provides a simple to use interface for custom agents to plug in. This blog post is about the new *j4p-mule-agent* which can be used together with *jmx4perl* and the Nagios check *check_jmx4perl*.

check_hpasm 4.1 schaut HP BladeCenter unter die Haube

2009-12-07T20:14:30+00:00

Das neueste Release von check_hpasm ermittelt jetzt nicht mehr nur den globalen Status der cpqRack-MIB eines BladeCenters, sondern liest die wichtigsten Tabellen detailliert aus. Aufgerufen mit -v liefert check_hpasm eine Übersicht der verbauten Komponenten samt deren Status. Und so sieht das dann aus:

Jmx4perl Configuration Files

2009-12-07T06:10:15+00:00

When you have already used `jmx4perl` you probably have remarked that the argument list can be quite lengthy, often due to the verbose JMX URLs. This gets even worse with jmx4perl's forthcoming proxy mode. Luckily, since version 0.36 it knows about configuration files which are the topic of this post.

Configuring remote JMX access for Weblogic Server

2009-12-02T17:40:43+00:00

In our series about configuring remote JMX access for various application servers, this article tackles Weblogic Server 9 and 10. There are several obstacles to get over, as expected ;-). This articles covers how to export the four MBeanServers known to Weblogic via RMI/IIOP or RMI/JRMP and what traps are waiting here.

Endlich...check_hpasm Release 4.0 ist fertig

2009-12-01T19:03:15+00:00

Statt zwei Wochen hat das Redesign von check_hpasm nun doch zwei Monate gedauert, aber dafür ist das Plugin für künftige Erweiterungen bestens gerüstet. Hinzugekommen ist die Unterstützung der neuen G6-Proliants und die Fähigkeit, auch HP BladeCenter (wenn auch nicht so detailliert) und HP Storage-Systeme überwachen zu können. Es wurden auch ein paar Verbesserungen an der (nicht ganz einfachen) Erkennung der Speichermodule vorgenommen. Bei einigen Anwendern dürften jetzt defekte Riegel ans Tageslicht kommen, deren Zustand sich mit der 3.x-Version nicht feststellen liess.

check_nagios_external_commands

2009-11-30T15:38:41+00:00

Nagios installations which rely on working external commands should have a check which verifys that external commands are really working. This plugins sends a test command and checks the logfile if that command occurs.

$ check_nagios_external_commands -t 120 -p /usr/local/nagios/var/rw/nagios.cmd \
    -l /usr/local/nagios/var/nagios.log
WARNING - command took 23s|command_write=0.85s command_read=22s

check_nagios_external_commands_0.1.tar

Access restrictions for jmx4perl

2009-11-28T10:42:23+00:00

[jmx4perl](http://www.jmx4perl.org) knows since some time how to restrict access to the agent (and soon proxy) servlet based on various criteria. However, this feature is unfortunately not yet well documented and a little bit hidden. This blog describes the nifty details and future roadmap.

Setting up JBoss for remote JMX

2009-11-23T08:06:29+00:00

As described in the last [post](/jmx4perl/2009/11/20/agentless-jmx4perl.html) jmx4perl can be operated in a so called *agentless* mode. For this to work, the target java server must be prepared for accepting remote JMX connections as described in JSR-160. This article describes the specific setup for **JBoss** along with the problems encountered and current limitations.

Jmx4Perl without agent servlet

2009-11-20T08:43:56+00:00

Big news around: [jmx4perl](http://www.jmx4perl.org) starts to support an agentless mode in which the target platform can be monitored without installing the j4p agent servlet. This works by using `j4p.war` as a *JMX Proxy*, which translates our JSON/HTTP protocol on the frontside to JSR-160 JMX remote requests on the backend and vice versa.

check_hpasm Sneak Preview II

2009-11-16T17:56:34+00:00

Das neue Release 4.0 von check_hpasm ist prinzipiell fertig und mit den Daten von über 500 Proliants getestet. Vorsichtshalber möchte ich aber noch eine letzte Testversion veröffentlichen. Hauptsächlich wurde die Erkennung von Speicherbausteinen verbessert. Vielen Maschinen, die bisher “status of all * dimms is n/a (please upgrade firmware)” meldeten, werden nun durch ein paar Tricks doch noch die fehlenden Informationen entlockt bzw. mit Hilfe bisher unbeachteter OIDs rekonstruiert.

check_hpasm-4pre3.tar.gz

Heartbeat-Cluster OCF-Agent für ndo2db

2009-11-13T13:31:23+00:00

Betreibt man eine hochverfügbare Nagios-Installation mit dem Heartbeat-Cluster, so benötigt man für die einzelnen Softwarekomponenten (Resourcen genannt) Agenten, die sich um Start, Stop und Überwachung derselben kümmern. Folgendes Script ermöglicht die Einbindung des NDO2DB-Daemons in so einen Cluster. Dazu muss man es nur nach /usr/lib/ocf/resource.d//ndo2db kopieren.

Download: ndo2db

jmx4perl 0.36 and 0.40_1 released

2009-11-10T12:28:27+00:00

Last week a minor update for jmx4perl was released. Beside bugfixes and code cleanup, version 0.36 includes:

A way to restrict agent acces to certain IPs or networks
Experimental support for a JDK 1.4 agent
Support for configuration files in order to alias server configuration parameters

But wait, there is more … ;-)

check_mssql_health 1.5.3

2009-11-02T13:59:20+00:00

Eine neue Version von check_mssql_health ist soeben erschienen. In erster Linie wurde ein Bug im Mode database-free beseitigt, der zu ungenauen bzw. falschen Ergebnissen führte, wenn der freie Plattenplatz knapp wurde.
Daneben wurde der neue Mode database-backup-age eingeführt, mit dem sich überwachen lässt, wie lange der Zeitpunkt des letzten Backups zurückliegt.

check_hpasm Sneak Preview

2009-10-27T13:19:42+00:00

Das Redesign von check_hpasm (Hauptgrund war die Unterstützung der neuen Proliant *G6) ist nun doch umfangreicher geworden, als ich dachte. Dafür ist der Code jetzt um einiges wartbarer und ermöglicht es, neue Features schneller und ohne Gefrickel einzubauen. Geplant ist ausserdem die Unterstützung von HP BladeCenter und Storagesystemen (Proliant 4LEE). Ein erstes Testrelease ist nun fertig.

Citrus still improving

2009-10-22T15:30:07+00:00

We fixed some issues in our 1.1-SNAPSHOT version of Citrus and also improved some features. For a detailed change history follow our changes report.

The latest version now supports multi-threaded performance tests. We recently tested a SOAP WebService regarding performance using Citrus. I will try to add a new post describing how to accomplish performance testing with Citrus as soon as possible.

Download the latest snapshot version of Citrus: Download

MS SQL Server Backups überwachen mit check_mssql_health

2009-10-16T14:43:10+00:00

Die check_[datenbank]_health-Plugins lassen sich leicht in ihrem Funktionsumfang erweitern, indem sie zur Laufzeit Zusatzmodule einlesen. Dieses Feature wurde eingebaut, damit für vorhandenen, u.U. firmenspezifischen Code kein eigenes Plugin geschrieben werden muss. Man steckt ihn einfach in Dateien, die einer bestimmten Namenskonvention folgen.
Als Beispiel soll hier gezeigt werden, wie man das Alter des letzten Backups einer Datenbank überwacht.

Citrus 1.1-SNAPSHOT released

2009-10-14T14:44:20+00:00

Citrus is now also available as snapshot release in version 1.1. We already have incorporated some really great new features and fixed some issues. See below a list of new features for 1.1-SNAPSHOT.

New features in first 1.1-SNAPSHOT release:

Sending SOAP attachments as a client
SOAP Fault validation (validate SOAP faults with SOAP fault code and fault string)
Extended exception validation (error message validation)
Generate test documentation in Excel
Extend test case meta-info with custom elements
Write custom actions and extend test case with custom actions

Testing the latest snapshot version including feedback is now very important for us. Therefore we hope you can switch to the latest snapshot versions. There are still more features to come in version 1.1 so stay tuned. For instance by following Citrus on Twitter (http://twitter.com/citrusframework) where all announcements will reach you right on time.

check_jmx4perl -- Einfache Servicedefinitionen

2009-10-08T09:22:00+00:00

Im Rahmen des Münchner Nagios-Stammtisches hielt Roland Huß gestern einen Vortrag über sein Framework Jmx4Perl. Mittlerweile haben sich mehrere Leute erkundigt, wie die Service- und Commanddefinitionen für das dazugehörige Plugin check_jmx4perl aussehen könnten. Deshalb soll hier erläutert werden, wie man ein paar grundlegende Messwerte aus einem Applicationserver ausliest und mit Nagios überwacht.

Neues Datenbank-Plugin check_db2_health

2009-10-02T16:59:04+00:00

Es gibt ein neues Mitglied in der check__health-Familie. Nach Oracle, MS SQL und MySQL habe ich mir DB2 vorgenommen und ein Plugin geschrieben, das leicht erweiterbar ist und grundlegende Anforderungen out of the box abdeckt.

Labs Maven Repository

2009-09-07T17:55:48+00:00

Labs got its own maven repository now:

http://labs.consol.de/maven/repository/ , for releasing artifacts
http://labs.consol.de/maven/snapshots-repository/ , for snapshot artifacts

How do I access the repo for my Maven project?

Add the repos to your project POM. Here’s an example for the release repository:


  consol-labs-release
  http://labs.consol.de/maven/repository/
  
    false
  
 
    true
  


  consol-labs-snapshots
  http://labs.consol.de/maven/snapshots-repository/
  
    true    
    interval:10080
  
  
    false

How do I release to the repos?

Simply add this profile to your project, and activate it when deploying:


  dist-labs
  
    
      consol-labs-release
      scpexe://labs.consol.de/home/maven-repository/www/htdocs/repository
    
    
      consol-labs-snapshots
      scpexe://labs.consol.de/home/maven-repository/www/htdocs/snapshots-repository

Additionally, you’ll have to modify your $HOME/.m2/settings.xml and configure the user for SSH deployment:


  consol-labs-release
  maven-repository


  consol-labs-snapshots
  maven-repository

Now you can simply deploy using Maven:

mvn clean install deploy -Pdist-labs

Note: We only support SSH transport for now, using SSH authorized keys.

Update von VMware ESXi3.5 auf ESXi4.0

2009-08-04T08:35:00+00:00

Ein Update von ESXi 3.5 auf 4.0 geht ganz einfach, auch wenn man keinen vCenter Update Manager hat. Für die meisten Nutzer der kostenlosen Variante von ESX dürfte das der Fall sein. Trotzdem gibt es auch für sie die Möglichkeit eines bequemen, automatisierten Updates.

Project root path in a Maven multi module project

2009-07-08T17:47:18+00:00

In a multi module Maven project, it seems non trival to reference the project root location from the sub modules deeper down in the module hierarchy. The following approach describes how to configure a plugin referencing a root POM relative file.

check_fs_ping - Ein Nagios-Plugin für NFS-Filesysteme, das sich nicht aufhängt

2009-07-03T18:08:10+00:00

Ein unangenehmes Phänomen bei NFS-gemounteten Filesysteme tritt auf, wenn der Fileserver abstürzt oder ein Netzwerkproblem zwischen NFS-Server und -Client besteht. Sämtliche Prozesse, die auf Dateien auf so einem Filesystem zugreifen wollen, bleiben einfach hängen. Das gilt auch für Nagios-Plugins. Nach Ablauf des Timeouts wird der Nagios-Kernel den Plugin-Prozess zwar abschiessen, jedoch bleibt dieser in der Prozessliste und zwar so lange, bis der NFS-Server wieder antwortet.

Willkommen bei ConSol Labs

2009-07-03T17:07:29+00:00