Bytedeco makes native libraries available to the Java platform by offering ready-to-use bindings generated with the codeveloped JavaCPP technology. This, we hope, is the missing bridge between Java and C/C++, bringing compute-intensive science, multimedia, computer vision, deep learning, etc to the Java platform.
Core Technologies
- JavaCPP [API] – A tool that can not only generate JNI code but also build native wrapper library files from an appropriate interface file written entirely in Java. It can also parse automatically C/C++ header files to produce the required Java interface files.
Prebuilt Java Bindings to C/C++ Libraries
These are part of a project that we call the JavaCPP Presets. Many coexist in the same GitHub repository, and all use JavaCPP to wrap predefined C/C++ libraries from open-source land. The bindings expose almost all of the relevant APIs and make them available in a portable and user-friendly fashion to any Java virtual machine (including Android), as if they were like any other normal Java libraries. We have presets for the following C/C++ libraries:- OpenCV – [sample usage] [API] – More than 2500 optimized computer vision and machine learning algorithms
- FFmpeg – [sample usage] [API] – A complete, cross-platform solution to record, convert and stream audio and video
- FlyCapture – [sample usage] [API] – Image acquisition and camera control software from PGR
- Spinnaker – [sample usage] [API] – Image acquisition and camera control software from FLIR
- libdc1394 – [sample usage] [API] – A high-level API for DCAM/IIDC cameras
- OpenKinect – [sample usage] [API] [API 2] – Open source library to use Kinect for Xbox and for Windows sensors
- librealsense – [sample usage] [API] [API 2] – Cross-platform library for Intel RealSense depth and tracking cameras
- videoInput – [sample usage] [API] – A free Windows video capture library
- ARToolKitPlus – [sample usage] [API] – Marker-based augmented reality tracking library
- Chilitags – [sample usage] [API] – Robust fiducial markers for augmented reality and robotics
- flandmark – [sample usage] [API] – Open-source implementation of facial landmark detector
- Arrow – [sample usage] [API] – A cross-language development platform for in-memory data
- HDF5 – [sample usage] [API] – Makes possible the management of extremely large and complex data collections
- Hyperscan – [sample usage] [API] – High-performance regular expression matching library
- LZ4 – [sample usage] [API] – Extremely fast compression algorithm
- MKL – [sample usage] [API] – The fastest and most-used math library for Intel-based systems
- oneDNN – [sample usage] [API] [API 2] – Intel Math Kernel Library for Deep Neural Networks (DNNL)
- OpenBLAS – [sample usage] [API] – An optimized BLAS library based on GotoBLAS2 1.13 BSD version, plus LAPACK
- ARPACK-NG – [sample usage] [API] – Collection of subroutines designed to solve large scale eigenvalue problems
- CMINPACK – [sample usage] [API] – For solving nonlinear equations and nonlinear least squares problems
- FFTW – [sample usage] [API] – Fast computing of the discrete Fourier transform (DFT) in one or more dimensions
- GSL – [sample usage] [API] – The GNU Scientific Library, a numerical library for C and C++ programmers
- CPython – [sample usage] [API] – The standard runtime of the Python programming language
- NumPy – [sample usage] [API] – Base N-dimensional array package
- SciPy – [sample usage] [API] – Fundamental library for scientific computing
- Gym – [sample usage] [API] – A toolkit for developing and comparing reinforcement learning algorithms
- LLVM – [sample usage] [API] – A collection of modular and reusable compiler and toolchain technologies
- libffi – [sample usage] [API] – A portable foreign-function interface library
- libpostal – [sample usage] [API] – For parsing/normalizing street addresses around the world
- LibRaw – [sample usage] [API] – A simple and unified interface for RAW files generated by digital photo cameras
- Leptonica – [sample usage] [API] – Software useful for image processing and image analysis applications
- Tesseract – [sample usage] [API] – Probably the most accurate open source OCR engine available
- Caffe – [sample usage] [API] – A fast open framework for deep learning
- OpenPose – [sample usage] [API] – Real-time multi-person keypoint detection for body, face, hands, and foot estimation
- CUDA – [sample usage] [API] – Arguably the most popular parallel computing platform for GPUs
- NVIDIA Video Codec SDK – [sample usage] [API] – An API for hardware accelerated video encode and decode
- OpenCL – [sample usage] [API] – Open standard for parallel programming of heterogeneous systems
- MXNet – [sample usage] [API] – Flexible and efficient library for deep learning
- PyTorch – [sample usage] [API] – Tensors and dynamic neural networks with strong GPU acceleration
- SentencePiece – [sample usage] [API] – Unsupervised text tokenizer for neural-network-based text generation
- TensorFlow – [sample usage] [API] – Computation using data flow graphs for scalable machine learning
- TensorFlow Lite – [sample usage] [API] – An open source deep learning framework for on-device inference
- TensorRT – [sample usage] [API] – High-performance deep learning inference optimizer and runtime
- Triton Inference Server – [sample usage] [API] – An optimized cloud and edge inferencing solution
- ALE – [sample usage] [API] – The Arcade Learning Environment to develop AI agents for Atari 2600 games
- DepthAI – [sample usage] [API] – An embedded spatial AI platform built around Intel Myriad X
- ONNX – [sample usage] [API] – Open Neural Network Exchange, an open source format for AI models
- nGraph – [sample usage] [API] – An open source C++ library, compiler, and runtime for deep learning frameworks
- ONNX Runtime – [sample usage] [API] – Cross-platform, high performance scoring engine for ML models
- TVM – [sample usage] [API] – An end to end machine learning compiler framework for CPUs, GPUs and accelerators
- Bullet Physics SDK – [sample usage] [API] – Real-time collision detection and multi-physics simulation
- LiquidFun – [sample usage] [API] – 2D physics engine for games
- Qt – [sample usage] [API] – A cross-platform framework that is usually used as a graphical toolkit
- Skia – [sample usage] [API] – A complete 2D graphic library for drawing text, geometries, and images
- cpu_features – [sample usage] [API] – A cross platform C99 library to get cpu features at runtime
- ModSecurity – [sample usage] [API] – A cross platform web application firewall (WAF) engine for Apache, IIS and Nginx
- Systems – [sample usage] [API] – To call native functions of operating systems (glibc, XNU libc, Win32, etc)
- Add here your favorite C/C++ library, for example: Caffe2, OpenNI, OpenMesh, PCL, etc. Read about how to do that.
We will add more to this list as they are made, including those from outside the bytedeco/javacpp-presets repository.
Projects Leveraging the Presets Bindings
- JavaCV [API] – Library based on the JavaCPP Presets that depends on commonly used native libraries in the field of computer vision to facilitate the development of those applications on the Java platform. It provides easy-to-use interfaces to grab frames from cameras and audio/video streams, process them, and record them back on disk or send them over the network.
- JavaCV Examples – Collection of examples originally written in C++ for the book entitled OpenCV 2 Computer Vision Application Programming Cookbook by Robert Laganière, but ported to JavaCV and written in Scala.
- ProCamCalib – Sample JavaCV application that can perform geometric and photometric calibration of a set of video projectors and color cameras.
- ProCamTracker – Another sample JavaCV application that uses the calibration from ProCamCalib to implement a vision method that tracks a textured planar surface and realizes markerless interactive augmented reality with projection mapping.
More Project Information
Please refer to the contribute and download pages for more information about how to help out or obtain this software.
See the developer site on GitHub for more general information about the Bytedeco projects.
Latest News
Third release at Bytedeco
Today, Bytedeco releases the third major iteration of its software, synchronized at version 0.10. For more information about what changed in each software component, please be sure to check out the repositories on GitHub: JavaCPP, JavaCPP Presets, JavaCV, ProCamCalib, and ProCamTracker.
The most prominent innovation in this release is the addition of the indexer package in JavaCPP, many thanks in no small part to Viliam Durina for the lengthy discussions on this subject and for offering invaluable advice. The classes found in that package allow easy, type-safe, and efficient access to multidimensional arrays, often useful when processing numerical matrices or image data. It is so fast, it can even be used as a way to accelerate some things over what can be done with OpenCV alone – all that from the comfort of Java, without increasing code complexity! For example, OpenCV does not come with any built-in optimized functions to do alpha blending on the CPU. We need to use more primitive arithmetic functions and implement custom processing to this effect, as shown in the sample code below:
import org.bytedeco.javacv.*;
import org.bytedeco.javacpp.*;
import org.bytedeco.javacpp.indexer.*;
import static org.bytedeco.javacpp.opencv_core.*;
public class Blend {
/** Does alpha blending using high-level functions from OpenCV. */
static void blendSlow(final Mat rgbaImg, final Mat bgrImg, final Mat dstImg) {
final MatVector rgba = new MatVector();
final MatVector bgr = new MatVector();
final MatVector dst = new MatVector();
Parallel.run(new Runnable() { public void run() { split(rgbaImg, rgba); }},
new Runnable() { public void run() { split(bgrImg, bgr); }},
new Runnable() { public void run() { split(dstImg, dst); }});
final Mat alpha = rgba.get(3);
final Mat beta = new Mat(alpha.rows(), alpha.cols(), CV_8UC1);
final Mat one = new Mat(new double[] { 255 });
subtract(one, alpha, beta);
Parallel.loop(0, 3, 3, new Parallel.Looper() {
public void loop(int from, int to, int i) {
multiply(rgba.get(2 - i), alpha, rgba.get(2 - i), 1.0/255.0, -1);
multiply(bgr.get(i), beta, bgr.get(i), 1.0/255.0, -1);
add(rgba.get(2 - i), bgr.get(i), dst.get(i));
}
});
merge(dst, dstImg);
}
/** Does alpha blending with high-performance Indexer from JavaCPP. */
static void blendFast(final Mat rgbaImg, final Mat bgrImg, final Mat dstImg) {
final UByteIndexer rgbaIdx = rgbaImg.createIndexer();
final UByteIndexer bgrIdx = bgrImg.createIndexer();
final UByteIndexer dstIdx = dstImg.createIndexer();
final int rows = rgbaImg.rows(), cols = rgbaImg.cols();
Parallel.loop(0, rows, new Parallel.Looper() {
public void loop(int from, int to, int looperID) {
for (int i = from; i < to; i++) {
for (int j = 0; j < cols; j++) {
float a = rgbaIdx.get(i, j, 3) * (1.0f/255.0f);
float b = rgbaIdx.get(i, j, 2) * a + bgrIdx.get(i, j, 0) * (1.0f - a);
float g = rgbaIdx.get(i, j, 1) * a + bgrIdx.get(i, j, 1) * (1.0f - a);
float r = rgbaIdx.get(i, j, 0) * a + bgrIdx.get(i, j, 2) * (1.0f - a);
dstIdx.put(i, j, 0, (byte)b);
dstIdx.put(i, j, 1, (byte)g);
dstIdx.put(i, j, 2, (byte)r);
}
}}});
rgbaIdx.release();
bgrIdx.release();
dstIdx.release();
}
public static void main(String[] args) {
// generate some random source images
Mat rgbaImg = new Mat(480, 640, CV_8UC4);
Mat bgrImg = new Mat(480, 640, CV_8UC3);
Mat dst1Img = new Mat(480, 640, CV_8UC3);
Mat dst2Img = new Mat(480, 640, CV_8UC3);
randu(rgbaImg, new Mat(new int[] { 0, 0, 0, 0 }), new Mat(new int[] { 255, 255, 255, 255 }));
randu(bgrImg, new Mat(new int[] { 0, 0, 0 }), new Mat(new int[] { 255, 255, 255 }));
// warm up
for (int i = 0; i < 100; i++) {
blendSlow(rgbaImg, bgrImg, dst1Img);
}
for (int i = 0; i < 100; i++) {
blendFast(rgbaImg, bgrImg, dst2Img);
}
// benchmark
long time1 = System.nanoTime();
for (int i = 0; i < 100; i++) {
blendSlow(rgbaImg, bgrImg, dst1Img);
}
long time2 = System.nanoTime();
for (int i = 0; i < 100; i++) {
blendFast(rgbaImg, bgrImg, dst2Img);
}
long time3 = System.nanoTime();
System.out.println("blendSlow(): " + (time2 - time1) / 100000000.0f + " ms");
System.out.println("blendFast(): " + (time3 - time2) / 100000000.0f + " ms");
// compare
UByteIndexer dst1Idx = dst1Img.createIndexer();
UByteIndexer dst2Idx = dst2Img.createIndexer();
int rows = bgrImg.rows(), cols = bgrImg.cols(), channels = bgrImg.channels();
exit:
for (int i = 0; i < rows; i++) {
for (int j = 0; j < cols; j++) {
for (int k = 0; k < channels; k++) {
int val1 = dst1Idx.get(i, j, k);
int val2 = dst2Idx.get(i, j, k);
if (Math.abs(val1 - val2) > 1) {
System.out.println(val1 + " != " + val2 + " at (" + i + ", " + j + ")");
break exit;
}
}
}
}
System.exit(0);
}
}
In this case, the blendSlow()
method uses OpenCV functions to perform alpha blending, while blendFast()
makes use of Indexer
objects to obtain similar results. Both exploit multithreading to some extent via the Parallel
class found in JavaCV. However, blendFast()
is much faster. On my machine (Intel Core i7-3632QM CPU @ 2.20GHz, Fedora 20, GCC 4.8.3, and OpenJDK 8) the main()
method above outputs something like the following:
blendSlow(): 5.623936 ms
blendFast(): 1.42263 ms
So, the version using Indexer
objects runs almost 4 times faster! This is because it allows the user to save on memory bandwidth by keeping intermediate values in cache memory, something that cannot be done with the batch operations found in OpenCV.
I hope you find this functionality useful, and as usual, if you have any questions, problems, or would like to contribute, but are unsure how to proceed, please feel free to share your concerns on the mailing list or via “issues” on GitHub. Enjoy and thank you for your continued interest!