焦点精选！如何让别人看不懂你的 JS 代码？

2022-07-26 16:03:02来源：code秘密花园

大家好，我是ConardLi。

我们在上周的文章中一种奇特的 JavaScript 编码风格：Get 一种可以用来装逼的 JavaScript 编码风格，引起了广大网友的热议。

这是实际上属于一种代码混淆技术，可以让们的代码更难阅读和逆向，同时也能租网一些恶意爬虫和自动化分析。天我就带大家来看看还有哪些其他能让JavaScript代码变得难以分析的代码混淆技术。

(资料图)

我们以下面这段代码为例：

console.log("ConardLi",666);

通过一些转换，它可以变成下面这个样子：

怎么做到的呢？我们一起来看一下～

十六进制字符串编码

我们尝试去Javascript Obfuscator这个网站，选中Encode Strings复选框，将得到下面的代码：

console["\x6C\x6F\x67"]("\x43\x6F\x6E\x61\x72\x64\x4C\x69\x20"+ 666)

它的原理很简单，就是将字符串的每个ASCII字符转换为十六进制形式（将函数调用改为用括号的形式，例如console.log->console["log"]在代码混淆中也是相当常见的做法），这就是最简单的混淆了，但是只能骗骗小白，我们可以轻易的反解：

这种技术还有一些其他变体，比如用unicode编码替换字符。

https://javascriptobfuscator.com/Javascript-Obfuscator.aspx

字符串数组映射

还是在上面的网站，我们选中Move Strings这个选项，得到的代码是下面这样的：

var _0x8925=["\x43\x6F\x6E\x61\x72\x64\x4C\x69\x20","\x6C\x6F\x67"];console[_0x8925[1]](_0x8925[0]+ 666)

多了个字符串数组，通过在不同索引处引入数组来间接使用这些字符串。

死代码注入

死代码其实指的就是一些无法访问的代码，我们可以在原本的代码上额外注入一些永远无法访问的代码来让代码难以阅读，但是同时也会让代码变得更大。这次我们尝试一下defendjs：

安装：

$ npm install -g https://github.com/alexhorn/defendjs.git

我们尝试创建一个conardli.js并且将上面的代码放入这个文件，执行下面的命令：

$ defendjs --input conardli.js --features dead_code --output .

得到了下面这一大坨代码：

(function () {    function a(a, d) {        var b = new Array(0);;        var c = arguments;        while (true)            try {                switch (a) {                case 21309:                    return;                case 792:                    function e(a, b) {                        return Array.prototype.slice.call(a).concat(Array.prototype.slice.call(b));                    }                    function f() {                        var a = arguments[0], c = Array.prototype.slice.call(arguments, 1);                        var b = function () {                            return a.apply(this, c.concat(Array.prototype.slice.call(arguments)));                        };                        b.prototype = a.prototype;                        return b;                    }                    function g(a, b) {                        return Array.prototype.slice.call(a, b);                    }                    function h(b) {                        var c = {};                        for (var a = 0; a < b.length; a += 2) {                            c[b[a]] = b[a + 1];                        }                        return c;                    }                    function i(a) {                        return a.map(function (a) {                            return String.fromCharCode(a & ~0 >>> 16) + String.fromCharCode(a >> 16);                        }).join("");                    }                    function j() {                        return String.fromCharCode.apply(null, arguments);                    }                    console.log("ConardLi", 666);                    a = 21309;                    break;                }            } catch (b) {                $$defendjs$tobethrown = null;                switch (a) {                default:                    throw b;                }            }    }    a(792, {});}())

代码很大，其实仔细分析就会发现其余插入的代码都是无法运行的：

最顶层包了一个IIFE，然后有一个a函数，a、b两个参数。调用a函数时只传入了第一个参数 792，然后就会发现 a 函数里有个switch语句，只会执行到第二个case，里面是这样的语句：

e、f、g、h、j、i 这几个函数都是没有调用的，所以只会执行最后的console.log("ConardLi", 666);语句...

https://github.com/alexhorn/defendjs

作用域混淆

我们将代码还原回去，重新执行defendjs的scope能力：

$ defendjs --input conardli.js --features scope --output .

(function () {    {        {            function b(a, b) {                return Array.prototype.slice.call(a).concat(Array.prototype.slice.call(b));            }            function c() {                var a = arguments[0], c = Array.prototype.slice.call(arguments, 1);                var b = function () {                    return a.apply(this, c.concat(Array.prototype.slice.call(arguments)));                };                b.prototype = a.prototype;                return b;            }            function d(a, b) {                return Array.prototype.slice.call(a, b);            }            function e(b) {                var c = {};                for (var a = 0; a < b.length; a += 2) {                    c[b[a]] = b[a + 1];                }                return c;            }            function f(a) {                return a.map(function (a) {                    return String.fromCharCode(a & ~0 >>> 16) + String.fromCharCode(a >> 16);                }).join("");            }            function g() {                return String.fromCharCode.apply(null, arguments);            }        }        var a = [];        console.log("ConardLi", 666);    }}())

这个可能看起来像是前面的一个简单版本，但是有一个关键的区别：它引入了多个具有重复标识符的词法作用域。例如，a可能是最内层作用域中第一个函数的参数，也可以是第二个函数中的变量，甚至可以是与我们的conaole.log语句相同作用域中的变量。在这个简单的示例中，很容易看穿，因为最内层范围内的任何函数都不会在任何地方被调用，但是，现实的业务代码往往是很复杂的，混淆后就不那么容易看穿了。

字符编码

还是使用defendjs，对我们的代码执行下面的命令：

$ defendjs --input conardli.js --features literals --output .

得到下面的代码：

(function () {    function c() {        var c = arguments;        var b = [];        b[1] = "";        b[1] += a(67, 111, 110);        b[1] += a(97);        b[1] += a(114, 100);        b[1] += a(76, 105);        return b[1];    }    {        {            function e(a, b) {                return Array.prototype.slice.call(a).concat(Array.prototype.slice.call(b));            }            function d() {                var a = arguments[0], c = Array.prototype.slice.call(arguments, 1);                var b = function () {                    return a.apply(this, c.concat(Array.prototype.slice.call(arguments)));                };                b.prototype = a.prototype;                return b;            }            function f(a, b) {                return Array.prototype.slice.call(a, b);            }            function g(b) {                var c = {};                for (var a = 0; a < b.length; a += 2) {                    c[b[a]] = b[a + 1];                }                return c;            }            function h(a) {                return a.map(function (a) {                    return String.fromCharCode(a & ~0 >>> 16) + String.fromCharCode(a >> 16);                }).join("");            }            function a() {                return String.fromCharCode.apply(null, arguments);            }        }        var b = [];        console.log(d(c, b)(), 666);    }}())

在这种情况下，硬编码会被转换成Unicode然后重新计算，这样直接阅读代码就很难再直接看穿硬编码的字符串了。

变量缩短

Mangling是一种为了优化和混淆目的而缩短变量和属性名称的转换。比如下面的代码：

let sixSixSix = 666;let name = "ConardLi ";console.log(name + sixSixSix);

我们使用DefendJS的mangling功能：

$ defendjs --input conardli.js --features mangle --output .

得到的代码是：

(function () {    var a = 666;    var b = "ConardLi! ";    console.log(b + a);}())

两个变量都被重新命名了，在这个简单的例子下还是很好分析的。但是如果是庞大的业务代码，这会让我们的代码变得非常难以阅读。

代码压缩

下面，综合利用一下几种技术，执行：

defendjs --input conardli.js --output . --features=control_flow,literals,mangle,compress

得到下面的代码：

(function(){function a(d,g){var b=new Array(1);;var e=arguments;while(true)t

关键词：十六进制就会发现来看一下是这样的

焦点精选！如何让别人看不懂你的 JS 代码？

相关新闻

中山外贸展现出较强韧性前三季度全市外贸进出口2187.9亿元

做一个简易的配置中心，顺带还给整合到了SpringCloud

为什么JSON.parse会损坏大数字，如何解决这个问题？

在任期第一年每位CIO都必须完成的12件事

一次服务器非法重启后导致的故障排查记录

如何在Linux中使用xargs命令

聊聊国产数据库TiDB相关知识，你学会了吗？

什么是 CDN 缓存命中率以及如何计算和优化它？

在传统运维监控系统中加入新的预警能力

Kotlin Flow响应式编程，基础知识入门

程序员应如何理解Reactor模式？

一文掌握所有命令行，包括73个“冷门但有用”的技巧｜GitHub 11万标星之作

一文了解云计算的基本指南

LeCun转推，PyTorch GPU内存分配有了火焰图可视化工具

如何提高无线路由器的安全性

聚焦

IT

科技