java JNI 实现原理 (一)

来源：互联网发布：java布局方式编辑：程序博客网时间：2024/05/18 03:44

一、虚拟机中classloader的JNILibrary

调用JNI的时候，通常我们使用System.loadLibrary(String libname)来load JNI library, 同样也可以使用System.load(String fileName)来load JNI library，两者的区别是一个只需要设置库的名字，比如如果libA.so 只要输入A就可以了，而libA.so的位置可以同过设置 Java.library.path 或者 sun.boot.library.path，后者输入的是完整路经的文件名。

而不论用什么方法，最后JNI 库是通过classloader 来加载的。

[java] view plain copy
static void loadLibrary(Class fromClass, String name,  
                boolean isAbsolute) {}  

每个classloader 对象都有自己的nativeLibrary 数组，一个全局的systemNativeLibrary 数组，一个全局的已经加载过的loadLibraryNames数组，和一个正在加载过程中的记录栈nativeLibraryContext

对同一个classloader 对象可以重复加载相同的库，对不同的classloader只可以加载一次相同的库。

1. 这里定义的相同的库是指相同路经下的同一个文件

2. 这里同样指出的是同一个classloader对象，而不是同一种classloader类型，比如说如果一种classloader类型初始化成2个classloader对象，那么这两个对象就不能重复加载相同的库。

3. 重复加载，并不代表真的重复加载，而是代码中保护

[java] view plain copy
for (int i = 0; i < size; i++) {  
            NativeLibrary lib = (NativeLibrary)libs.elementAt(i);  
        if (name.equals(lib.name)) {  
            return true;  
        }  
        }  

4. 如果加载其他classloader已经加载过的库，会抛出 UnsatisfiedLinkError ERROR

在tomcat上，在不同的war包里，想加载相同的库文件，因为在 tomcat上是使用不同的classloader的对象去加载不同的war包，建议库文件放置在不同的路径通过System.load去加载。

二、Linux 下如何 load JNILibrary

在博客java JNI (一)虚拟机中classloader的JNILibrary 中讨论了Java中的Library 是由classloader 来load的，那我们来看看 classloader是如何去load 一个library的

ClassLoader.c

[cpp] view plain copy
JNIEXPORT void JNICALL   
Java_java_lang_ClassLoader_00024NativeLibrary_load  
  (JNIEnv *env, jobject this, jstring name)  
{  
    const char *cname;  
    jint jniVersion;  
    jthrowable cause;  
    void * handle;  
  
    if (!initIDs(env))  
        return;  
  
    cname = JNU_GetStringPlatformChars(env, name, 0);  
    if (cname == 0)  
        return;  
    handle = JVM_LoadLibrary(cname);  
    if (handle) {  
        const char *onLoadSymbols[] = JNI_ONLOAD_SYMBOLS;  
        JNI_OnLoad_t JNI_OnLoad;  
    int i;  
    for (i = 0; i < sizeof(onLoadSymbols) / sizeof(char *); i++) {  
        JNI_OnLoad = (JNI_OnLoad_t)   
            JVM_FindLibraryEntry(handle, onLoadSymbols[i]);  
        if (JNI_OnLoad) {  
            break;  
        }  
    }  
    if (JNI_OnLoad) {  
        JavaVM *jvm;  
        (*env)->GetJavaVM(env, &jvm);  
        jniVersion = (*JNI_OnLoad)(jvm, NULL);  
    } else {  
        jniVersion = 0x00010001;  
    }  
  
    cause = (*env)->ExceptionOccurred(env);  
    if (cause) {  
        (*env)->ExceptionClear(env);  
        (*env)->Throw(env, cause);  
        JVM_UnloadLibrary(handle);  
        goto done;  
    }  
     
    if (!JVM_IsSupportedJNIVersion(jniVersion)) {  
        char msg[256];  
        jio_snprintf(msg, sizeof(msg),  
             "unsupported JNI version 0x%08X required by %s",  
             jniVersion, cname);  
        JNU_ThrowByName(env, "java/lang/UnsatisfiedLinkError", msg);  
        JVM_UnloadLibrary(handle);  
        goto done;  
    }  
    (*env)->SetIntField(env, this, jniVersionID, jniVersion);  
    } else {  
    cause = (*env)->ExceptionOccurred(env);  
    if (cause) {  
        (*env)->ExceptionClear(env);  
        (*env)->SetLongField(env, this, handleID, (jlong)NULL);  
        (*env)->Throw(env, cause);  
    }  
    goto done;  
    }  
    (*env)->SetLongField(env, this, handleID, ptr_to_jlong(handle));  
  
 done:  
    JNU_ReleaseStringPlatformChars(env, name, cname);  
}  

1. JVM_LoadLibrary

jvm中load library 核心函数，实现也非常简单，在linux下调用了系统函数dlopen去打开库文件，详细可参考方法

[cpp] view plain copy
void * os::dll_load(const char *filename, char *ebuf, int ebuflen)  

2. JVM_FindLibraryEntry

JVM在加载库文件时候，会去尝试查找库中的JNI_ONLOAD方法的地址，而在Linux中调用了dlsym函数通过前面的dlopen加载库的指针去获取方法的地址，而dlsym在glibc2.0是非线程安全的，需要锁的保护，虽然在java中加载库已经有锁的保护，但只是针对同一个classloader对象的细粒度锁。

[cpp] view plain copy
void* os::dll_lookup(void* handle, const char* name) {  
  pthread_mutex_lock(&dl_mutex);  
  void* res = dlsym(handle, name);  
  pthread_mutex_unlock(&dl_mutex);  
  return res;  
}  

3. 方法JNI_OnLoad

JVM提供了一种方式允许你在加载库文件的时候做一些你想做的事情，也就是JNI_OnLoad方法

在2中提到过在加载动态链接库，JVM会去尝试查找JNI_OnLoad方法，同时也会调用该函数，这样你个人可以在函数里做一些初始化的事情，比如register native方法。

[cpp] view plain copy
JNIEXPORT jint JNICALL JNI_OnLoad(JavaVM* vm, void* reserved)  
{}  

JNI_OnLoad中返回的是JNI 的version,在1.6版本的情况下支持如下

[cpp] view plain copy
jboolean Threads::is_supported_jni_version(jint version) {  
  if (version == JNI_VERSION_1_2) return JNI_TRUE;  
  if (version == JNI_VERSION_1_4) return JNI_TRUE;  
  if (version == JNI_VERSION_1_6) return JNI_TRUE;  
  return JNI_FALSE;  
}  

完整的加载过程就是

首先先加载动态链接库，尝试查找JNI_OnLoad方法，并且运行方法，对我们来说从而实现可以自定义的初始化方法。

三、JNI中的RegisterNatives方法

我们常用javah去生成JNI的头文件，然后去实现自己定义的JNI方法，使用这种方式比较传统，我们可以看到定义的格式甚至连名字都必须按照规范

[cpp] view plain copy
JNIEXPORT jint JNICALL Java_test_symlink  
  (JNIEnv *, jobject, jstring, jstring);  

完整的结构是Java_classpath_classname_native method name，这样才能当jvm运行的时候根据这个命名规则去找到对应的native的方法。

实际上jvm也同时提供了直接RegisterNative方法手动的注册native方法

下面是一个代码的例子

[cpp] view plain copy
static JNINativeMethod methods[] = {  
    {"retrieveDirectives",  "()Ljava/lang/AssertionStatusDirectives;", (void *)&JVM_AssertionStatusDirectives}  
};  
  
  
    (*env)->RegisterNatives(env, cls, methods,   
                sizeof(methods)/sizeof(JNINativeMethod));  

RegisterNative 函数中的参数

RegisterNative(JNIEnv, jclass cls, JNINativeMethod *methods, jint number)

1. methods 是一个二维数组，代表着这个class里的每一个native方法所对应的实现的方法，在前面的例子中表示，一个native 方法retrieveDiretives, 返回值为AssertionStatusDirectives, 所对应的执行的本地方法是JVM_AssertionStatusDirectives

2. 后面的number 代表要指定的native的数量

RegisterNative的实现

RegisterNative 的实现非常简单，就是将class里面native的方法的地址+1指向执行的c代码的函数地址也就是上面的&JVM_AssertionStatusDirectives

[cpp] view plain copy
address* native_function_addr() const          { assert(is_native(), "must be native"); return (address*) 

四、初始化JNI方法

这是jvm当初始化类的时候，class的调用层级关系

instanceKlass::initialize()
-> instanceKlass::initialize_impl()
-> instanceKlass::link_class()
-> instanceKlass::link_class_impl()
-> instanceKlass::rewrite_class()
-> Rewriter::rewrite()
-> Rewriter::Rewriter()
-> methodOopDesc::link_method()

在方法methodOopDesc::link_method 设置到了对应的natvive方法的解释entry

[cpp] view plain copy
void methodOopDesc::link_method(methodHandle h_method, TRAPS) {  
  assert(_i2i_entry == NULL, "should only be called once");  
  assert(_adapter == NULL, "init'd to NULL" );  
  assert( _code == NULL, "nothing compiled yet" );  
  
  // Setup interpreter entrypoint  
  assert(this == h_method(), "wrong h_method()" );  
  address entry = Interpreter::entry_for_method(h_method); //找到对应的方法 entry  
  assert(entry != NULL, "interpreter entry must be non-null");  
  // Sets both _i2i_entry and _from_interpreted_entry  
  set_interpreter_entry(entry); //并且把entry设置到了methodoop中  
  ...  
}  

找到对应的方法类型的entry

函数entry_for_method 是从_entry_table数组中找到对应的entry

[cpp] view plain copy
static address    entry_for_method(methodHandle m)            { return _entry_table[method_kind(m)]; }  

在函数中TemplateInterpreterGenerator::generate_all，我们可以看到初始化了_entry_table entry数组,而这是在jvm初始化的时候（jint init_globals()）初始化的。

[cpp] view plain copy
#define method_entry(kind)                                                                    \  
  { CodeletMark cm(_masm, "method entry point (kind = " #kind ")");                    \  
    Interpreter::_entry_table[Interpreter::kind] = generate_method_entry(Interpreter::kind);  \  
  }  
  
  // all non-native method kinds  
  method_entry(zerolocals)  
  method_entry(zerolocals_synchronized)  
  method_entry(empty)  
  method_entry(accessor)  
  method_entry(abstract)  
  method_entry(method_handle)  
  method_entry(java_lang_math_sin  )  
  method_entry(java_lang_math_cos  )  
  method_entry(java_lang_math_tan  )  
  method_entry(java_lang_math_abs  )  
  method_entry(java_lang_math_sqrt )  
  method_entry(java_lang_math_log  )  
  method_entry(java_lang_math_log10)  
  
  // all native method kinds (must be one contiguous block)  
  Interpreter::_native_entry_begin = Interpreter::code()->code_end();  
  method_entry(native)  
  method_entry(native_synchronized)  
  Interpreter::_native_entry_end = Interpreter::code()->code_end();  
  
#undef method_entry  

而对应的不同的方法，使用不同的entry 是在函数generate_method_entry里定义的

[cpp] view plain copy
address AbstractInterpreterGenerator::generate_method_entry(  
                                        AbstractInterpreter::MethodKind kind) {  
  // determine code generation flags  
  bool synchronized = false;  
  address entry_point = NULL;  
  
  switch (kind) {  
  case Interpreter::zerolocals             :                                                                             break;  
  case Interpreter::zerolocals_synchronized: synchronized = true;                                                        break;  
  case Interpreter::native                 : entry_point = ((InterpreterGenerator*) this)->generate_native_entry(false); break;  
  case Interpreter::native_synchronized    : entry_point = ((InterpreterGenerator*) this)->generate_native_entry(true);  break;  
  case Interpreter::empty                  : entry_point = ((InterpreterGenerator*) this)->generate_empty_entry();       break;  
  case Interpreter::accessor               : entry_point = ((InterpreterGenerator*) this)->generate_accessor_entry();    break;  
  case Interpreter::abstract               : entry_point = ((InterpreterGenerator*) this)->generate_abstract_entry();    break;  
  case Interpreter::method_handle          : entry_point = ((InterpreterGenerator*) this)->generate_method_handle_entry();break;  
  
  case Interpreter::java_lang_math_sin     : // fall thru  
  case Interpreter::java_lang_math_cos     : // fall thru  
  case Interpreter::java_lang_math_tan     : // fall thru  
  case Interpreter::java_lang_math_abs     : // fall thru  
  case Interpreter::java_lang_math_log     : // fall thru  
  case Interpreter::java_lang_math_log10   : // fall thru  
  case Interpreter::java_lang_math_sqrt    : entry_point = ((InterpreterGenerator*) this)->generate_math_entry(kind);    break;  
  default                                  : ShouldNotReachHere();                                                       break;  
  }  
  
  if (entry_point) {  
    return entry_point;  
  }  
  
  return ((InterpreterGenerator*) this)->  
                                generate_normal_entry(synchronized);  
}  

我们可以看到在 native,和native_synchronized的情况下，使用了generate_native_entry

在methodoop设置了entry

在方法methodOopDesc::link_method 中，设置了_i2i_entry,和_from_interpreted_entry为entry 也是就在native的情况下设置了generate_native_entry

[cpp] view plain copy
void set_interpreter_entry(address entry)      { _i2i_entry = entry;  _from_interpreted_entry = ent

五、JNI方法解释调用

Hotspot主要有两种解释器，而下面我们主要讨论的是 Template Intepreter也叫asm interprete解释器, 文章下面的介绍基本都是基于template解释器

我们举一个invokespecial的例子，下面是templateTable方法解释invokespecial的代码

[cpp] view plain copy
void TemplateTable::invokespecial(int byte_no) {  
  transition(vtos, vtos);  
  assert(byte_no == f1_byte, "use this argument");  
  prepare_invoke(rbx, noreg, byte_no);  
  // do the call  
  __ verify_oop(rbx);  
  __ profile_call(rax);  
  __ jump_from_interpreted(rbx, rax);  
}  

函数prepare_invoke

函数prepare_invoke的层级调用关系

TemplateTable::prepare_invoke

-> TemplateTable::load_invoke_cp_cache_entry

-> TemplateTable::resolve_cache_and_index

在函数中resolve_cache_and_index可以看到

1. 首先先检查constantpoolcache，是否将方法指针保存到到线程的constantpoolcache里，如果有在方法里会使用jcc跳转到Label resolved去，而Lable resolved 在方法第一次运行结束后bind到函数的末尾。

2. 如果cache里没有那么会尝试用interpreterRuntime:resolve_invoke去找到正确的method, 并保存到constant pool cache 里

[cpp] view plain copy
case Bytecodes::_invokevirtual:  
case Bytecodes::_invokespecial:  
case Bytecodes::_invokestatic:  
case Bytecodes::_invokeinterface:  
  entry = CAST_FROM_FN_PTR(address, InterpreterRuntime::resolve_invoke);  
  break;  

而函数

interpreterRuntime:resolve_invoke

-->LinkResolver::resolve_invoke

-->LinkResolver::resolve_special_call

-->LinkResolver::linktime_resolve_special_method

-->LinkResolver::resolve_method

不论什么调用方式，最后都会调用LinkResolver::resolve_method 找到真实的调用方法，通过runtime_resolve_special_method把method指针作为methodhandle存放到CallInfo 中传回InterperterRuntime::resolve_invoke中，同时在CallInfo::set_common当设置-Xcomp情况下，决定是否需要编译方法。

我们可以看到方法prepare_invoke，已经找到了methodoop指针并且存放到寄存器rbx中

函数jump_from_interpreted

[cpp] view plain copy
void InterpreterMacroAssembler::jump_from_interpreted(Register method, Register temp) {  
  prepare_to_jump_from_interpreted();  
  
  if (JvmtiExport::can_post_interpreter_events()) {  
    Label run_compiled_code;  
    // JVMTI events, such as single-stepping, are implemented partly by avoiding running  
    // compiled code in threads for which the event is enabled.  Check here for  
    // interp_only_mode if these events CAN be enabled.  
    get_thread(temp);  
    // interp_only is an int, on little endian it is sufficient to test the byte only  
    // Is a cmpl faster (ce  
    cmpb(Address(temp, JavaThread::interp_only_mode_offset()), 0);  
    jcc(Assembler::zero, run_compiled_code);  
    jmp(Address(method, methodOopDesc::interpreter_entry_offset()));  
    bind(run_compiled_code);  
  }  
  
  jmp(Address(method, methodOopDesc::from_interpreted_offset()));  
  
}  

我们看到跳转到了methodoop中的_from_interpreted_entry，也就是在前面的博客里（java JNI (四) 初始化JNI方法）说的generate_native_entry 中

在这篇博客并没有太多的涉及到native方法的调用，而是asm解释器在解释一个方法的时候如何link到所对应method，并且找到处理method的entry。

六、调用JNI方法

在前面的博客中已经提到过JNI的entry是generate_native_entry，也就是说方法generate_native_entry才是最终调用的我们自己写的库文件里的方法

针对不同的解释器的类型，会调用不同的generate_native_entry，下面主要讨论的还是以template interpreter为主

如果是X86的，可以参考templateInterpreter_x86_64.cpp

[cpp] view plain copy
address InterpreterGenerator::generate_native_entry(bool synchronized) {  
....  
  {  
    Label L;  
    __ movptr(rax, Address(method, methodOopDesc::native_function_offset()));  
    ExternalAddress unsatisfied(SharedRuntime::native_method_throw_unsatisfied_link_error_entry());  
    __ movptr(rscratch2, unsatisfied.addr());  
    __ cmpptr(rax, rscratch2);  
    __ jcc(Assembler::notEqual, L);  
    __ call_VM(noreg,  
               CAST_FROM_FN_PTR(address,  
                                InterpreterRuntime::prepare_native_call),  
               method);  
    __ get_method(method);  
    __ verify_oop(method);  
    __ movptr(rax, Address(method, methodOopDesc::native_function_offset()));  
    __ bind(L);  
  }  
.....  
  __ call(rax);  
.....  
}  

具体我们来看prepare_native_call方法的实现

[cpp] view plain copy
IRT_ENTRY(void, InterpreterRuntime::prepare_native_call(JavaThread* thread, methodOopDesc* method))  
  methodHandle m(thread, method);  
  assert(m->is_native(), "sanity check");  
  // lookup native function entry point if it doesn't exist  
  bool in_base_library;  
  if (!m->has_native_function()) {  
    NativeLookup::lookup(m, in_base_library, CHECK);  
  }  
  // make sure signature handler is installed  
  SignatureHandlerLibrary::add(m);  
  // The interpreter entry point checks the signature handler first,  
  // before trying to fetch the native entry point and klass mirror.  
  // We must set the signature handler last, so that multiple processors  
  // preparing the same method will be sure to see non-null entry & mirror.  
IRT_END  

在代码中

if (!m->has_native_function()) {
NativeLookup::lookup(m, in_base_library, CHECK);
}

首先先检查一下是不是已经在method里面定义了native的方法，也就是我们前面提到的JNI中的（RegisterNatives方法）中是不是已经单独RegisterNatives注册了native方法

如果没有的话，将按照javah生成的JNI头文件里的方法的名字来绑定，也就是lookup里做的事情，然后设置回method的native的方法中去，保证调用只初始化一次，不是每次调用都去查找一遍

这样在方法prepare_native_call 中，我们可以和前面的博客（java JNI 实现原理 (三) JNI中的RegisterNatives方法）,完整的联系起来。

下面大概的介绍一下，完整的native call的过程

a. 初始化一些参数，把方法的一些信息放到对应的寄存器上

b. 检查是否需要锁，如果需要，锁到对应的object

c. 设置几个handler, signature hanlder, result handler, mirror handler(static 方法)

d. 找到 native 方法的指针，设置到rax寄存器中

e. 传入JNIEnv 对象在native方法第一个参数（因为JNIEnv 对象在Java 代码中的native 方法没有，而在自己定义的native方法里用来取得jni的运行环境的，所以需要在这里额外传入）

f. 设置java栈信息 last_java_frame

g. 设置线程信息为_thread_in_native

h. 运行native 方法

i. 在多核的情况下，使用Membar清楚内存cache

j. 检查是否在safepoint的点

k. 还原java 栈信息 reset_last_java_frame

l. 对结果进行一些处理

m. 处理过程中的异常

n. 释放锁

o. 调用result hanlder 的到结果

0 0