当前位置：首页 > news >正文

吴恩达MCP课程（2）：research_server

news 2025/7/27 6:10:35

代码

import arxiv
import json
import os
from typing import List
from mcp.server.fastmcp import FastMCPPAPER_DIR = "papers"mcp = FastMCP("research")@mcp.tool()
def search_papers(topic: str, max_results: int = 5) -> List[str]:"""Search for papers on arXiv based on a topic and store their information.Args:topic: The topic to search formax_results: Maximum number of results to retrieve (default: 5)Returns:List of paper IDs found in the search"""# Use arxiv to find the papersclient = arxiv.Client()# Search for the most relevant articles matching the queried topicsearch = arxiv.Search(query = topic,max_results = max_results,sort_by = arxiv.SortCriterion.Relevance)papers = client.results(search)# Create directory for this topicpath = os.path.join(PAPER_DIR, topic.lower().replace(" ", "_"))os.makedirs(path, exist_ok=True)file_path = os.path.join(path, "papers_info.json")# Try to load existing papers infotry:with open(file_path, "r") as json_file:papers_info = json.load(json_file)except (FileNotFoundError, json.JSONDecodeError):papers_info = {}# Process each paper and add to papers_infopaper_ids = []for paper in papers:paper_ids.append(paper.get_short_id())paper_info = {'title': paper.title,'authors': [author.name for author in paper.authors],'summary': paper.summary,'pdf_url': paper.pdf_url,'published': str(paper.published.date())}papers_info[paper.get_short_id()] = paper_info# Save updated papers_info to json filewith open(file_path, "w") as json_file:json.dump(papers_info, json_file, indent=2)print(f"Results are saved in: {file_path}")return paper_ids@mcp.tool()
def extract_info(paper_id: str) -> str:"""Search for information about a specific paper across all topic directories.Args:paper_id: The ID of the paper to look forReturns:JSON string with paper information if found, error message if not found"""for item in os.listdir(PAPER_DIR):item_path = os.path.join(PAPER_DIR, item)if os.path.isdir(item_path):file_path = os.path.join(item_path, "papers_info.json")if os.path.isfile(file_path):try:with open(file_path, "r") as json_file:papers_info = json.load(json_file)if paper_id in papers_info:return json.dumps(papers_info[paper_id], indent=2)except (FileNotFoundError, json.JSONDecodeError) as e:print(f"Error reading {file_path}: {str(e)}")continuereturn f"There's no saved information related to paper {paper_id}."if __name__ == "__main__":mcp.run(transport="stdio")

代码解释

导入模块

import arxiv        # 用于访问arXiv API搜索论文
import json         # 处理JSON数据
import os           # 操作系统功能，如文件路径处理
from typing import List  # 类型提示
from mcp.server.fastmcp import FastMCP  # 导入MCP框架

常量定义

PAPER_DIR = "papers"  # 定义存储论文信息的目录

MCP服务器初始化

mcp = FastMCP("research")  # 创建一个名为"research"的MCP服务器实例

工具函数定义

1. search_papers 函数

@mcp.tool()
def search_papers(topic: str, max_results: int = 5) -> List[str]:

这个函数被注册为MCP工具，用于在arXiv上搜索特定主题的论文并保存信息：

装饰器：@mcp.tool() 将此函数注册为MCP服务的工具
参数：
- topic: 要搜索的主题
- max_results: 最大结果数量（默认5个）
返回值：找到的论文ID列表

功能流程：

创建arXiv客户端
按相关性搜索主题相关论文
为该主题创建目录（如papers/machine_learning）
尝试加载已有的论文信息（如果存在）
处理每篇论文，提取标题、作者、摘要等信息
将论文信息保存到JSON文件中
返回论文ID列表

2. extract_info 函数

@mcp.tool()
def extract_info(paper_id: str) -> str:

这个函数也被注册为MCP工具，用于在所有主题目录中搜索特定论文的信息：

装饰器：@mcp.tool() 将此函数注册为MCP服务的工具
参数：paper_id - 要查找的论文ID
返回值：包含论文信息的JSON字符串（如果找到），否则返回错误信息

功能流程：

遍历papers目录下的所有子目录
在每个子目录中查找papers_info.json文件
如果找到文件，检查是否包含指定的论文ID
如果找到论文信息，返回格式化的JSON字符串
如果未找到，返回未找到的提示信息

主程序

if __name__ == "__main__":mcp.run(transport="stdio")

总结

research_server.py是一个基于MCP框架的研究服务器，提供了两个主要工具：

搜索arXiv上的论文并保存信息
提取已保存的论文信息

这个服务器可以作为AI助手的后端，通过MCP协议与前端交互，提供论文研究相关的功能。

运行示例

可用inspector工具查看，可以参照这个例子MarkItDown-MCP 测试与debug

请添加图片描述

前一节链接：
吴恩达MCP课程（1）：chat_bot

查看全文

http://www.dtcms.com/a/224501.html

Linux系统下安装配置 Nginx

Kanass入门教程- 事项管理

机器视觉2D定位引导-合同要点重度讲解-技术要点及注意事项

Java-Character类静态方法深度剖析

C语言结构体的别名与创建结构体变量

共享内存-systemV

Python 从入门到精通视频下载

各种数据库，行式、列式、文档型、KV、时序、向量、图究竟怎么选？

点云识别模型汇总整理

【Doris基础】Doris中的Replica详解：Replica原理、架构

华为OD机试真题——找出两个整数数组中同时出现的整数（2025A卷：100分）Java/python/JavaScript/C++/C语言/GO六种最佳实现

黄金价格查询接口如何用C#进行调用？

Nacos实战——动态 IP 黑名单过滤

AI书签管理工具开发全记录（七）：页面编写与接口对接

手写HashMap

AE已禁用刷新请释放Caps Lock

现代网络安全攻防技术与发展现状

头歌java课程实验（学习-Java字符串之正则表达式之元字符之判断字符串是否符合规则）

使用Python实现Windows系统垃圾清理

Webug4.0靶场通关笔记16- 第16关MySQL配置文件下载

项目日记 -Qt音乐播放器 -搜索模块

Linux研学-用户解析

【Java笔记】Spring IoC DI

ApiHug 1.3.9 支持 Spring 3.5.0 + Plugin 0.7.4 内置小插件升级！儿童节快乐！！！

新闻数据加载（鸿蒙App开发实战）

flowable候选人及候选人组（Candidate Users 、Candidate Groups）的应用包含拾取、归还、交接

neo4j 5.19.0安装、apoc csv导入导出及相关问题处理

内容中台构建数字化管理新路径

每日c/c++题备战蓝桥杯（P1204 [USACO1.2] 挤牛奶 Milking Cows）

【多线程初阶】死锁的产生如何避免死锁

目录

代码