Tokenscope

TokenScope是一个面向大型语言模型的令牌感知目录探索工具，提供智能目录结构分析、文件内容令牌感知提取、令牌使用统计和综合报告生成功能。

开发者工具文件系统 #目录分析 #令牌感知 #代码扫描 #LLM工具 .Python

评分 : 2.5分

下载量 : 6.8K

更新时间 : 2025-04-28

打开站点

什么是TokenScope?

TokenScope是一个专为大型语言模型设计的MCP服务器，它能够高效地分析和探索代码目录，帮助您快速了解项目结构并专注于重要的文件。

如何使用TokenScope?

通过简单的命令行操作，您可以轻松扫描目录结构、分析文件内容并生成详细的报告。

适用场景

TokenScope非常适合需要深入理解代码库或文件结构的开发者和研究人员，特别是在处理大型代码库时。

主要功能

目录扫描

以高效的令牌方式扫描目录结构，支持递归深度设置和自定义忽略模式。

文件内容提取

智能提取文件内容，尊重令牌限制并支持多种文件格式。

令牌统计

估算处理整个代码库所需的令牌数量，并按文件扩展名分解统计。

综合报告生成

生成包含目录结构、令牌统计和重要文件样本的Markdown报告。

安全路径验证

确保所有文件操作都在指定的安全目录内进行，防止越权访问。

优势

令牌意识设计，高效处理大目录。

支持自定义忽略模式，灵活适应不同项目需求。

内置安全机制，保障操作合规性。

生成高质量的综合报告，便于团队协作。

局限性

需要Python 3.10及以上版本。

对非常大的目录可能需要较高的计算资源。

初次安装可能需要一些依赖配置经验。

如何使用

安装TokenScope

通过pip安装TokenScope并确保已安装Python 3.10或更高版本。

启动TokenScope

运行TokenScope服务器，并指定基础目录以确保安全性。

配置在Claude Desktop中

将TokenScope添加到Claude Desktop的MCP服务器配置中。

使用案例

目录结构扫描

请求扫描项目目录并关注最重要的文件。

令牌统计分析

分析代码库的总令牌使用情况。

生成目录报告

生成包含目录结构、令牌统计和文件样本的综合报告。

常见问题

TokenScope是否支持自定义忽略模式？

TokenScope是否需要特殊的权限才能运行？

如何查看TokenScope的令牌使用情况？

🚀 令牌范围

TokenScope 是一款用于高效分析代码库并与大语言模型（LLM）交互的工具。它整合了 FastMCP 和 tiktoken，旨在借助大型语言模型对代码库开展高效的上下文分析。

🚀 快速开始

以下是一个简单的使用示例：

from tokenscope import create_client

# 创建客户端
client = create_client()

# 扫描目录结构
structure = client.scan_directory_structure("path/to/directory")

# 提取文件内容
content = client.extract_file_content("path/to/file.py")

# 分析 token 使用情况
analysis = client.analyze_token_usage("path/to/directory")

print(structure)
print(content)
print(analysis)

✨ 主要特性

目录结构扫描：以 token 有效的方式扫描并返回目录结构。
文件内容提取：提取特定文件的内容，考虑 token 限制和格式。
文件搜索：根据指定模式在目录结构中搜索文件。
token 使用分析：分析目录或文件的 token 使用情况，估算 LLM 处理需求。
生成报告：生成包含目录统计信息的综合 Markdown 报告。
文件复制：将文件从源路径复制到目标路径。

📦 安装指南

通过 pip 安装

pip install token-scope

克隆仓库并安装

git clone https://github.com/yourusername/token-scope.git
cd token-scope
python -m pip install .

💻 使用示例

基础用法

from tokenscope import create_client

# 创建客户端
client = create_client()

# 扫描目录结构
structure = client.scan_directory_structure("path/to/directory")

# 提取文件内容
content = client.extract_file_content("path/to/file.py")

# 分析 token 使用情况
analysis = client.analyze_token_usage("path/to/directory")

print(structure)
print(content)
print(analysis)

高级用法

以下展示了使用 TokenScope 各个功能的代码示例：

# 扫描目录结构
scan_directory_structure(
    path: str, 
    depth: int = 3,
    max_tokens: int = 10000,
    ignore_patterns: list[str] | None = None,
    include_gitignore: bool = True,
    include_default_ignores: bool = True
)

# 提取特定文件的内容
extract_file_content(
    file_path: str, 
    max_tokens: int = 10000,
    sample_only: bool = False
)

# 根据指定模式搜索目录中的文件
search_files_by_pattern(
    directory: str,
    patterns: list[str],
    max_depth: int = 5,
    include_content: bool = False,
    max_files: int = 100,
    max_tokens_per_file: int = 1000,
    sample_only: bool = False,
    ignore_patterns: list[str] | None = None,
    include_gitignore: bool = True,
    include_default_ignores: bool = True
)

# 分析指定路径的 token 使用情况
analyze_token_usage(
    path: str,
    include_file_details: bool = False,
    ignore_patterns: list[str] | None = None,
    include_gitignore: bool = True,
    include_default_ignores: bool = True
)

# 生成包含统计信息的 Markdown 报告
generate_directory_report(
    directory: str, 
    depth: int = 3,
    include_file_content: bool = True,
    max_files_with_content: int = 5,
    max_tokens_per_file: int = 1000,
    sample_only: bool = False,
    ignore_patterns: list[str] | None = None,
    include_gitignore: bool = True,
    include_default_ignores: bool = True
)

# 复制文件到指定路径
copy_file_to_destination(
    source_path: str,
    destination_path: str
)

📚 详细文档

`scan_directory_structure`

扫描目录并返回其结构，考虑 token 限制。

scan_directory_structure(
    path: str, 
    depth: int = 3,
    max_tokens: int = 10000,
    ignore_patterns: list[str] | None = None,
    include_gitignore: bool = True,
    include_default_ignores: bool = True
)

`extract_file_content`

提取特定文件的内容，考虑 token 限制和格式。

extract_file_content(
    file_path: str, 
    max_tokens: int = 10000,
    sample_only: bool = False
)

`search_files_by_pattern`

根据指定模式搜索目录中的文件。

search_files_by_pattern(
    directory: str,
    patterns: list[str],
    max_depth: int = 5,
    include_content: bool = False,
    max_files: int = 100,
    max_tokens_per_file: int = 1000,
    sample_only: bool = False,
    ignore_patterns: list[str] | None = None,
    include_gitignore: bool = True,
    include_default_ignores: bool = True
)

`analyze_token_usage`

分析指定路径的 token 使用情况。

analyze_token_usage(
    path: str,
    include_file_details: bool = False,
    ignore_patterns: list[str] | None = None,
    include_gitignore: bool = True,
    include_default_ignores: bool = True
)

`generate_directory_report`

生成包含统计信息的 Markdown 报告。

generate_directory_report(
    directory: str, 
    depth: int = 3,
    include_file_content: bool = True,
    max_files_with_content: int = 5,
    max_tokens_per_file: int = 1000,
    sample_only: bool = False,
    ignore_patterns: list[str] | None = None,
    include_gitignore: bool = True,
    include_default_ignores: bool = True
)

`copy_file_to_destination`

复制文件到指定路径。

copy_file_to_destination(
    source_path: str,
    destination_path: str
)

🔧 技术细节

TokenScope 自动忽略以下常见目录和文件：

DEFAULT_IGNORE_PATTERNS = [
    ".git/",
    ".venv/",
    "venv/",
    "__pycache__/",
    "node_modules/",
    ".pytest_cache/",
    ".ipynb_checkpoints/",
    ".DS_Store",
    "*.pyc",
    "*.pyo",
    "*.pyd",
    "*.so",
    "*.dll",
    "*.class",
    "build/",
    "dist/",
    "*.egg-info/",
    ".tox/",
    ".coverage",
    ".idea/",
    ".vscode/",
    ".mypy_cache/",
]