CHEATSHEET March 19, 2024

RAG CheatSheet

Words count 27k Reading time 25 mins.

langchain+ollama 本地文档常用模型: SBert ollama加载模型在线[src] https://ollama.com/libraryollama run gemma:2b 离线1... Read article

CHEATSHEET March 19, 2024

RAG CheatSheet

Words count 27k Reading time 25 mins.

langchain+ollama 本地文档常用模型: SBert ollama加载模型在线[src] https://ollama.com/libraryollama run gemma:2b 离线1、创建模型配置文件创建模型配置文件,比如: Modelfile 这个文件名,文件内容指定需要加载的具体模型文件如下: 1FROM ./Meta-Llama-3-8B-Instruct-Q6_K.gguf 2、构建对应的Ollama模型我们使用以下命令构建 Ollama 模型: 1ollama cre... Read article

CHEATSHEET March 19, 2024

Cluster Setup CheatSheet

Words count 138k Reading time 2:06

Cluster-setupBIOS config for Disk SystemDisk Raid 1 DataDisk JBOD OSon demand, ubuntu\fedora\centos\suse\redhat update root passwordsudo passwd root sudo免密1234su chmod u+w /etc/sudoersvi /etc/sudoers # username ALL=(ALL:ALL) NOPASSWD: ALLchmod u-w /e... Read article

CHEATSHEET March 19, 2024

Git CheatSheet

Words count 853 Reading time 1 mins.

检出checkout git checkout 分支名/标签, 该命令会变成detach 只读状态 git checkout -b|-B <new_branch> [<start point>] 基于远程分支名/标签/commit创建新分支 分支管理branch 列出分支 仅本地 git branch 所有 git branch -a 模糊匹配 git branch -a –list <pattern> 删除分支 git bran... Read article

CHEATSHEET March 19, 2024

Flink CheatSheet

Words count 15k Reading time 14 mins.

flinkFlink applications code -> JobGraph -> JobManager -> TaskManagers 环境 ExecutionEnvironment StreamExecutionEnvironment TableEnvironment 并行度env.setParallelism(3) 数据源 env.fromSequence(from, to) env.fromElements(elements: _*) // 添加... Read article

CHEATSHEET March 19, 2024

YARN CheatSheet

Words count 16k Reading time 15 mins.

8088挖矿漏洞发起获取appIDcurl -X POST http://10.33.21.190:8088/ws/v1/cluster/apps/new-application 新建任务信息文件1.json反弹shell{‘application-id’: ‘application_1639358619460_0019’,‘application-name’: ‘get-shell’,‘am-container-spec’: {‘commands’: {‘command’: ‘/bi... Read article

CHEATSHEET March 19, 2024

Apache Arrow CheatSheet

Words count 24k Reading time 22 mins.

相关概念包括ValueVector、Field、Schema、VectorSchemaRoot以及Table 1234567891011<dependency> <groupId>org.apache.arrow</groupId> <artifactId>arrow-memory-netty</artifactId> <version>${arrow.version}</version>&l... Read article

CHEATSHEET March 19, 2024

ClickHouse CheatSheet

Words count 27k Reading time 24 mins.

启动默认绑定端口9000 与hdfs冲突,修改tcp_port默认配置文件/etc/clickhouse-server/config.xml自定义配置文件目录 /etc/clickhouse-server/config.d/ clickhouse startclickhouse stopclickhouse-client docker12docker pull clickhouse/clickhouse-server:22.8.14.53docker run -di --name=clickho... Read article

Article February 28, 2024

Oceanbase

Words count 798 Reading time 1 mins.

JDBC连接Maven Repository: com.oceanbase » oceanbase-client (mvnrepository.com) 引入oceanbase-client-1.1.10.jar到spark的jars目录,使用Beeline连接!connnect jdbc:oceanbase://ip:2883/DB 用户名格式有:用户名@租户名#集群名、集群名:租户名:用户名、集群名-租户名-用户名 、集群名.租户名.用户名 obdumper–compress –compre... Read article

Article February 01, 2024

sqlite3

Words count 1.6k Reading time 1 mins.

usageyum install sqlite 123456789101112sqlite3 test.db.header oncreate table stu(id int, name char, sex char , score int);insert into stu values(1001, 'zhangsan', 'm', 89);select * from stu where id=1001;delete from stu where id=1... Read article

Article January 23, 2024

dataTable

Words count 4.1k Reading time 4 mins.

dim补数时使用历史版本表名增加时间信息 / 增加含时间信息的分区, 通过函数根据时间获取对应的表/分区,默认返回最新版本 以下为表名增加时间信息的示例 1234567891011121314151617181920212223242526def getChannelTable(spark: SparkSession, date: String): String = { val db = "dim" var result = "chan... Read article
Load more
0%