ThreadLocal 源码学习

02 Jan 2021 - 10 minute read

ThreadLocal的作用是提供线程内的局部变量，不同的线程之间不会互相干扰，这种变量在线程的生命周期内起作用，减少通一个线程内多个函数或组件之间一些公共变量传递的复杂度。

总结下来就是：

线程并发：在多线程并发的场景下
传递数据：我们可以通过ThreadLocal在同一线程，不同组件中传递公共变量
线程隔离：每个线程的变量都是独立的，不会互相影响

基本使用

在可能有并发访问的对象中设置ThreadLocal变量，然后通过ThreadLocal的set和get方法进行针对某一个线程的变量进行修改和访问。

并发问题出现：

package threadlocaltest;

public class Content {

    String content;
    public void setContent(String s) {
        this.content = s;
    }
    public String getContent() {
        return content;
    }

    public static void main(String[] args) {
        Content c1 = new Content();
        for(int i = 0; i < 6; i++) {
            Thread t = new Thread(new Runnable() {
                @Override
                public void run() {
                    c1.setContent(Thread.currentThread().getName() + "的内容");
                    System.out.println("------------------------------");
                    System.out.println(Thread.currentThread().getName() + " ----> " + c1.getContent());
                }
            });
            t.setName("Thread " + i);
            t.start();
        }
    }
}

output

Thread 0 —-> Thread 3的内容

Thread 4 —-> Thread 4的内容

Thread 3 —-> Thread 5的内容

Thread 2 —-> Thread 5的内容

Thread 1 —-> Thread 5的内容

Thread 5 —-> Thread 5的内容

通过加入ThreadLocal变量，代码可以改写为

package threadlocaltest;

public class Content {
    ThreadLocal<String> localVal = new ThreadLocal<>();
    String content;
    public void setContent(String s) {
        localVal.set(s);
    }
    public String getContent() {
        return localVal.get();
    }

    public static void main(String[] args) {
        Content c1 = new Content();
        for(int i = 0; i < 6; i++) {
            Thread t = new Thread(new Runnable() {
                @Override
                public void run() {
                    c1.setContent(Thread.currentThread().getName() + "的内容");
//                    System.out.println("------------------------------");
                    System.out.println(Thread.currentThread().getName() + " ----> " + c1.getContent());
                }
            });
            t.setName("Thread " + i);
            t.start();
        }
    }
}

output

Thread 0 —-> Thread 0的内容 Thread 1 —-> Thread 1的内容 Thread 2 —-> Thread 2的内容 Thread 5 —-> Thread 5的内容 Thread 3 —-> Thread 3的内容 Thread 4 —-> Thread 4的内容

ThreadLocal vs Synchronized

	synchronized	ThreadLocal
原理	同步机制采用“以时间换空间“的方式，只提供了一份变量，让不同的线程排队访问	ThreadLocal采用”以空间换时间“的方式，为每一个线程都提供了一份变量的副本，从而实现同时访问而相不干扰
侧重点	多个线程之间访问资源的同步	多线程中让每个线程之间的数据互相隔离

ThreadLocal有两个突出的有事：

传递数据：保存每个线程绑定的数据，在需要的地方可以直接获取，避免参数直接传递带来的代码耦合问题
线程隔离：各线程之间的数据相互隔离却又具备并发性，避免同步方法带来的性能损失。

ThreadLcoal内部结构

在JDK8中ThreadLocal的设计是：每个Thread都维护一个ThreadLocalMap，这个map的key是ThreadLocal实例本身，value才是真正要存储的值Object。

以上是JDK8和之前版本设计的区别，JDK8的具体过程如下：

每个thread内部都有一个map
map里面存储threadlocal对象（key）和线程变量副本（value）
Thread内部的map是由ThreadLocal维护的，由ThreadLocal负责向map获取和设置线程的变量值
对于不同的线程，每次获取副本值时，别的线程并不能获取当前线程的副本值，形成了副本的隔离，互不干扰。

这样设计的好处是：

每个Map存储的Entry数量变少，因为ThreadLocal的数量是远远小于线程数量的
当Thread销毁的时候，ThreadLocalMap也会随之销毁，减少内存使用。

ThreadLocal源码分析

set方法

    /**
     * Sets the current thread's copy of this thread-local variable
     * to the specified value.  Most subclasses will have no need to
     * override this method, relying solely on the {@link #initialValue}
     * method to set the values of thread-locals.
     *
     * @param value the value to be stored in the current thread's copy of
     *        this thread-local.
     */
	public void set(T value) {
        Thread t = Thread.currentThread();
        ThreadLocalMap map = getMap(t);
        if (map != null)
            map.set(this, value);
        else
            createMap(t, value);
    }


    void createMap(Thread t, T firstValue) {
        t.threadLocals = new ThreadLocalMap(this, firstValue);
    }

流程如下：

首先获取当前线程，并故居当前线程获取一个Map
如果获取的map不为空，则将参数设置到Map中（当前ThreadLocal的引用作为key）
如果Map为空，则给该线程创建map，并设置初始值

Thread类里维护了一个ThreadLocal

    /* ThreadLocal values pertaining to this thread. This map is maintained
     * by the ThreadLocal class. */
    ThreadLocal.ThreadLocalMap threadLocals = null;

get方法

    /**
     * Returns the value in the current thread's copy of this
     * thread-local variable.  If the variable has no value for the
     * current thread, it is first initialized to the value returned
     * by an invocation of the {@link #initialValue} method.
     *
     * @return the current thread's value of this thread-local
     */    
	public T get() {
        Thread t = Thread.currentThread();
        ThreadLocalMap map = getMap(t);
        if (map != null) {
            ThreadLocalMap.Entry e = map.getEntry(this);
            if (e != null) {
                @SuppressWarnings("unchecked")
                T result = (T)e.value;
                return result;
            }
        }
        return setInitialValue();
    }

    /**
     * Variant of set() to establish initialValue. Used instead
     * of set() in case user has overridden the set() method.
     *
     * @return the initial value
     */
    private T setInitialValue() {
        // 调用initialValue获取初始化的值
        // 此方法可以被子类重写，如果不重写默认返回null
        T value = initialValue();
        Thread t = Thread.currentThread();
        ThreadLocalMap map = getMap(t);
        if (map != null)
            map.set(this, value);
        else
            // 1.当前线程Thread不存在ThreadLocalMap对象
            // 2.调用createMap进行ThreadLocalMap对象的初始化
            // 3.并将t(当前线程)和value(t对应的值)作为第一个entry存放至ThreadLocalMap中
            createMap(t, value);
        return value;
    }

流程如下：

首先获取当前线程，根据当前线程获取一个Map
如果获取的map不为空，则在map中以threadLocal的引用作为Key在map中获取对应的entry e，否则转到4
如果e不为null，则返回e.value,否则转到4
map为空或者e为空，则通过initialVlaue函数获取初始值value然后用ThreadLocal的引用和value作为firstKey和firstValue创建一个新的Map

先获取当前线程的ThreadLocalMap变量，如果存在则返回值，不存在则创建并返回初始值。

remove方法

    /**
     * Removes the current thread's value for this thread-local
     * variable.  If this thread-local variable is subsequently
     * {@linkplain #get read} by the current thread, its value will be
     * reinitialized by invoking its {@link #initialValue} method,
     * unless its value is {@linkplain #set set} by the current thread
     * in the interim.  This may result in multiple invocations of the
     * {@code initialValue} method in the current thread.
     *
     * @since 1.5
     */
     public void remove() {
         ThreadLocalMap m = getMap(Thread.currentThread());
         if (m != null)
             m.remove(this);
     }

获取当前线程，并根据当前线程获取一个map
如果获取的map不为空，则移除当前ThreadLocal对象对应的entry

InitialValue

    /**
     * Returns the current thread's "initial value" for this
     * thread-local variable.  This method will be invoked the first
     * time a thread accesses the variable with the {@link #get}
     * method, unless the thread previously invoked the {@link #set}
     * method, in which case the {@code initialValue} method will not
     * be invoked for the thread.  Normally, this method is invoked at
     * most once per thread, but it may be invoked again in case of
     * subsequent invocations of {@link #remove} followed by {@link #get}.
     *
     * <p>This implementation simply returns {@code null}; if the
     * programmer desires thread-local variables to have an initial
     * value other than {@code null}, {@code ThreadLocal} must be
     * subclassed, and this method overridden.  Typically, an
     * anonymous inner class will be used.
     *
     * @return the initial value for this thread-local
     */
    protected T initialValue() {
        return null;
    }

这个方法是一个延迟调用方法，从上面的代码我们得知，在set方法还未调用而先调用了get方法时才只能，并且仅执行一次。
这个方法缺省实现直接返回一个null
如果想要以个除null之外的初始值，可以重写此方法。（此方法是一个protected的方法，显然是为了让子类覆盖而设计的）

ThreadLocalMap源码分析

ThreadLocalMap是ThreadLocal的内部类，没有实现Map接口，用独立的方式实现了Map的功能，其内部的Entry也是独立实现的。

成员变量

        /**
         * The initial capacity -- MUST be a power of two.
         */
        private static final int INITIAL_CAPACITY = 16;

        /**
         * The table, resized as necessary.
         * table.length MUST always be a power of two.
         */
        private Entry[] table;

        /**
         * The number of entries in the table.
         */
        private int size = 0;

        /**
         * The next size value at which to resize.
         */
        private int threshold; // Default to 0

        /**
         * Set the resize threshold to maintain at worst a 2/3 load factor.
         */
        private void setThreshold(int len) {
            threshold = len * 2 / 3;
        }

        /**
         * Increment i modulo len.
         */
        private static int nextIndex(int i, int len) {
            return ((i + 1 < len) ? i + 1 : 0);
        }

        /**
         * Decrement i modulo len.
         */
        private static int prevIndex(int i, int len) {
            return ((i - 1 >= 0) ? i - 1 : len - 1);
        }

存储结构-Entry

        /**
         * The entries in this hash map extend WeakReference, using
         * its main ref field as the key (which is always a
         * ThreadLocal object).  Note that null keys (i.e. entry.get()
         * == null) mean that the key is no longer referenced, so the
         * entry can be expunged from table.  Such entries are referred to
         * as "stale entries" in the code that follows.
         */
        static class Entry extends WeakReference<ThreadLocal<?>> {
            /** The value associated with this ThreadLocal. */
            Object value;

            Entry(ThreadLocal<?> k, Object v) {
                super(k);
                value = v;
            }
        }

Entry中的eky只能是ThreadLocal对象，已经在构造方法中写死了。另外，Entry继承了WeakReferrence，也就是key（ThreadLocal）是弱引用，其目的是将ThreadLocal对象的声明周期和线程声明周期解绑。

构造方法

        /**
         * Construct a new map initially containing (firstKey, firstValue).
         * ThreadLocalMaps are constructed lazily, so we only create
         * one when we have at least one entry to put in it.
         */
        ThreadLocalMap(ThreadLocal<?> firstKey, Object firstValue) {
            table = new Entry[INITIAL_CAPACITY];
            int i = firstKey.threadLocalHashCode & (INITIAL_CAPACITY - 1);
            table[i] = new Entry(firstKey, firstValue);
            size = 1;
            setThreshold(INITIAL_CAPACITY);
        }

重点 int i = firstKey.threadLocalHashCode & (INITIAL_CAPACITY - 1);

    /**
     * ThreadLocals rely on per-thread linear-probe hash maps attached
     * to each thread (Thread.threadLocals and
     * inheritableThreadLocals).  The ThreadLocal objects act as keys,
     * searched via threadLocalHashCode.  This is a custom hash code
     * (useful only within ThreadLocalMaps) that eliminates collisions
     * in the common case where consecutively constructed ThreadLocals
     * are used by the same threads, while remaining well-behaved in
     * less common cases.
     */
    private final int threadLocalHashCode = nextHashCode();

    /**
     * Returns the next hash code.
     */
    private static int nextHashCode() {
        return nextHashCode.getAndAdd(HASH_INCREMENT);
    }

    /**
     * The next hash code to be given out. Updated atomically. Starts at
     * zero.
     */
    private static AtomicInteger nextHashCode =
        new AtomicInteger();

    /**
     * The difference between successively generated hash codes - turns
     * implicit sequential thread-local IDs into near-optimally spread
     * multiplicative hash values for power-of-two-sized tables.
     */
    private static final int HASH_INCREMENT = 0x61c88647;

关于firstKey.threadLocalHashCode & (INITIAL_CAPACITY - 1);

这里定义了一个AtomicInteger类型，每次获取当前值并加上HASH_INCREMENT，这个值跟斐波那契数列有关，其主要目的就是为了让哈希码能均匀的分布在2的n此房的数组里，也就是Entry[] table中，这样做可以尽量避免hash冲突

关于& (INITIAL_CAPACITY - 1)

计算hash的时候里面采用了hashCode&(size - 1)的算法，这相当于取模运算hashCode % size的一个更搞笑的实现。正式因为这种算法，我们要求size必须是2的整次幂，这也能保证在索引不越界的前提下，使得hash冲突的次数减小。

set方法

        /**
         * Set the value associated with key.
         *
         * @param key the thread local object
         * @param value the value to be set
         */
        private void set(ThreadLocal<?> key, Object value) {

            // We don't use a fast path as with get() because it is at
            // least as common to use set() to create new entries as
            // it is to replace existing ones, in which case, a fast
            // path would fail more often than not.

            Entry[] tab = table;
            int len = tab.length;
            int i = key.threadLocalHashCode & (len-1);

            for (Entry e = tab[i];
                 e != null;
                 e = tab[i = nextIndex(i, len)]) {
                ThreadLocal<?> k = e.get();
                // 如果存在，覆盖
                if (k == key) {
                    e.value = value;
                    return;
                }

                if (k == null) {
                                replaceStaleEntry(key, value, i);
                    return;
                }
            }

            tab[i] = new Entry(key, value);
            int sz = ++size;
            if (!cleanSomeSlots(i, sz) && sz >= threshold)
                rehash();
        }

        /**
         * Increment i modulo len.
         */
        private static int nextIndex(int i, int len) {
            return ((i + 1 < len) ? i + 1 : 0);
        }

根据key计算出索引i，然后查找位置上的Entry
如果entry已经存在并且key等于传入key，直接覆盖
如果entry存在，但是key为null，调用replaceStaleEntry来更换这个key为空的entry
不断玄幻检测，知道遇到为null的地方，这时候要是还没在循环过程中return，那么久在这个null的位置新建一个Entry并且插入，size+1
最后调用cleanSomeSlots，清理key为null的entry，最后返回是否清理了entry，接下来再来判断sz是否>=thredhold达到rehash的条件，达到的话就调用rehash函数执行一次权标的扫描清理

        /**
         * Replace a stale entry encountered during a set operation
         * with an entry for the specified key.  The value passed in
         * the value parameter is stored in the entry, whether or not
         * an entry already exists for the specified key.
         *
         * As a side effect, this method expunges all stale entries in the
         * "run" containing the stale entry.  (A run is a sequence of entries
         * between two null slots.)
         *
         * @param  key the key
         * @param  value the value to be associated with key
         * @param  staleSlot index of the first stale entry encountered while
         *         searching for key.
         */
        private void replaceStaleEntry(ThreadLocal<?> key, Object value,
                                       int staleSlot) {
            Entry[] tab = table;
            int len = tab.length;
            Entry e;

            // Back up to check for prior stale entry in current run.
            // We clean out whole runs at a time to avoid continual
            // incremental rehashing due to garbage collector freeing
            // up refs in bunches (i.e., whenever the collector runs).
            int slotToExpunge = staleSlot;
            for (int i = prevIndex(staleSlot, len);
                 (e = tab[i]) != null;
                 i = prevIndex(i, len))
                if (e.get() == null)
                    slotToExpunge = i;

            // Find either the key or trailing null slot of run, whichever
            // occurs first
            for (int i = nextIndex(staleSlot, len);
                 (e = tab[i]) != null;
                 i = nextIndex(i, len)) {
                ThreadLocal<?> k = e.get();

                // If we find key, then we need to swap it
                // with the stale entry to maintain hash table order.
                // The newly stale slot, or any other stale slot
                // encountered above it, can then be sent to expungeStaleEntry
                // to remove or rehash all of the other entries in run.
                if (k == key) {
                    e.value = value;

                    tab[i] = tab[staleSlot];
                    tab[staleSlot] = e;

                    // Start expunge at preceding stale entry if it exists
                    if (slotToExpunge == staleSlot)
                        slotToExpunge = i;
                    cleanSomeSlots(expungeStaleEntry(slotToExpunge), len);
                    return;
                }

                // If we didn't find stale entry on backward scan, the
                // first stale entry seen while scanning for key is the
                // first still present in the run.
                if (k == null && slotToExpunge == staleSlot)
                    slotToExpunge = i;
            }

            // If key not found, put new entry in stale slot
            tab[staleSlot].value = null;
            tab[staleSlot] = new Entry(key, value);

            // If there are any other stale entries in run, expunge them
            if (slotToExpunge != staleSlot)
                cleanSomeSlots(expungeStaleEntry(slotToExpunge), len);
        }

ThreadLocalMap使用线性探测法来解决哈希冲突。

该方法一次探测下一个地址，直到有空的地址后插入，弱整个空间都找不到空余的地址，则产生一处。

弱引用和内存泄漏

在使用ThreadLocal的过程中发现有内存泄漏的情况发生，猜测这个内存泄漏跟Entry中使用了弱引用的key有关系。其实这个理解不对。

内存泄漏的相关概念

Memory overflow，没有足够的内存提供申请者使用
Memory leak，指程序中已动态分配的堆内存由于某种原因程序未释放或无法释放，造成系统内存的额浪费，导致运行速度减慢甚至系统崩溃等严重后果。内存泄漏的堆积终将导致内存溢出。

弱引用的相关概念

强引用（Strong Reference），就是我常见的普通对象引用，只要还有强引用指向一个对象，就证明对象还活着，垃圾回收旧不会回收这种对象。

弱引用（Weak Reference），垃圾回收器一旦发现了值具有弱引用的对象，不管当前内存空间足够与否，都会回收它的内存。

如果key使用强引用

假设在业务代码中使用完ThreadLocal，threadLocal Ref被回收了。
但是因为threadLocalMap的Entry强引用了threadLocal，造成了threadLocal无法被回收。
在没有手动删除这个Entry以及CuccrentThread依然运行的前提下，始终有强引用链threadRef->currentThread->threadLocalMap->entry，导致Entry内存泄漏、

也就是说，ThreadLocalMap中的key使用了强引用，是无法完全避免内存泄漏的。

如果Key使用弱引用

还是假设在业务代码中使用完ThreadLocal，threadLocal Ref被回收了
由于THreadLocalMap值持有THreadLocal的弱引用，没没有任何强引用指向threadLocal实例，所以threadLocal就可以顺利地被gc回收，此时Entry中的key = null。
但是在没有手动删除这个Entry以及CurrentThread依然运行的前提下，在有强引用链 threadRef -> currentThread-> threadLocalMap -> entry -> value， value不会被回收，而这块value永远不会被访问到了，导致value 内存泄漏

也就是说，ThreadLocalMap中的key使用了弱引用，也有可能内存泄漏。

需要能回答的问题

Java中的引用类型有哪几种?

每种引用类型的特点是什么?

每种引用类型的应用场景是什么? ThreadLocal你了解吗

ThreadLocal应用在什么地方? Spring事务方面应用到了

ThreadLocal会产生内存泄漏你了解吗?

Duo Space

Welcome to my space!