# 通用基础库
版本: v0.1.9
* 扩展了对json格式文件的读写支持:readjson, readjsonp, savejson, savejsonp
* 增加了MutliTask 多进程任务类
* 优化了计时器类:TimeCount
版本: v0.1.5
* 修改了splitset方法,可用于拆分数据集
* 增加 split_dataframe方法,可对DataFrame进行拆分数据集;
* 增加 分层抽取方法: data_split, save_data_split
版本: v0.1.4
* 修改了TimeCount类
版本: v0.1.1
可对目录下的文件进行以下批量处理:
* 清除空格 空行 按句子分行;
* 删除空文件,找到后改名(改为"原文件名.del") 或者直接删除
* 删除重复的文件: 根据文件的MD5判断文件是否相同,找到后改名(原文件.same)或者直接删除
* 批量重命名: 可按序号进行重命名,默认从1开始,文件名会自动在前面补0,例如"0001.txt"
* 可统计文本文件的行数 [2019/1/18 添加]
* 对数据进行检查;
* 对数据重复数据检查并删除;
* 对数据进行随机抽样;
* 处理参数可以自定义顺序,
Raw data
{
"_id": null,
"home_page": "https://github.com/xmxoxo/baselibs",
"name": "baselibs",
"maintainer": null,
"docs_url": null,
"requires_python": null,
"maintainer_email": null,
"keywords": "baselibs",
"author": "He xi",
"author_email": "xmhexi@qq.com",
"download_url": "https://files.pythonhosted.org/packages/f2/ae/4cc6d7f893e4889e700f63a1d6662bcbde5bbad353d55e9c1ff9a9a72b67/baselibs-0.1.9.tar.gz",
"platform": null,
"description": "#\u00a0\u901a\u7528\u57fa\u7840\u5e93 \r\n\r\n\u7248\u672c: v0.1.9\r\n\r\n* \u6269\u5c55\u4e86\u5bf9json\u683c\u5f0f\u6587\u4ef6\u7684\u8bfb\u5199\u652f\u6301\uff1areadjson, readjsonp, savejson, savejsonp\r\n* \u589e\u52a0\u4e86MutliTask \u591a\u8fdb\u7a0b\u4efb\u52a1\u7c7b\r\n* \u4f18\u5316\u4e86\u8ba1\u65f6\u5668\u7c7b\uff1aTimeCount\r\n\r\n\r\n\u7248\u672c: v0.1.5\r\n\r\n* \u4fee\u6539\u4e86splitset\u65b9\u6cd5\uff0c\u53ef\u7528\u4e8e\u62c6\u5206\u6570\u636e\u96c6\r\n* \u589e\u52a0 split_dataframe\u65b9\u6cd5\uff0c\u53ef\u5bf9DataFrame\u8fdb\u884c\u62c6\u5206\u6570\u636e\u96c6\uff1b\r\n* \u589e\u52a0 \u5206\u5c42\u62bd\u53d6\u65b9\u6cd5: data_split, save_data_split\r\n\r\n\u7248\u672c: v0.1.4\r\n\r\n* \u4fee\u6539\u4e86TimeCount\u7c7b\r\n\r\n\u7248\u672c: v0.1.1\r\n\r\n\u53ef\u5bf9\u76ee\u5f55\u4e0b\u7684\u6587\u4ef6\u8fdb\u884c\u4ee5\u4e0b\u6279\u91cf\u5904\u7406\uff1a\r\n\r\n* \u6e05\u9664\u7a7a\u683c \u7a7a\u884c \u6309\u53e5\u5b50\u5206\u884c\uff1b\r\n* \u5220\u9664\u7a7a\u6587\u4ef6\uff0c\u627e\u5230\u540e\u6539\u540d\uff08\u6539\u4e3a\"\u539f\u6587\u4ef6\u540d.del\") \u6216\u8005\u76f4\u63a5\u5220\u9664\r\n* \u5220\u9664\u91cd\u590d\u7684\u6587\u4ef6: \u6839\u636e\u6587\u4ef6\u7684MD5\u5224\u65ad\u6587\u4ef6\u662f\u5426\u76f8\u540c\uff0c\u627e\u5230\u540e\u6539\u540d\uff08\u539f\u6587\u4ef6.same)\u6216\u8005\u76f4\u63a5\u5220\u9664\r\n* \u6279\u91cf\u91cd\u547d\u540d: \u53ef\u6309\u5e8f\u53f7\u8fdb\u884c\u91cd\u547d\u540d\uff0c\u9ed8\u8ba4\u4ece1\u5f00\u59cb\uff0c\u6587\u4ef6\u540d\u4f1a\u81ea\u52a8\u5728\u524d\u9762\u88650\uff0c\u4f8b\u5982\"0001.txt\"\r\n* \u53ef\u7edf\u8ba1\u6587\u672c\u6587\u4ef6\u7684\u884c\u6570 [2019/1/18 \u6dfb\u52a0]\r\n* \u5bf9\u6570\u636e\u8fdb\u884c\u68c0\u67e5\uff1b\r\n* \u5bf9\u6570\u636e\u91cd\u590d\u6570\u636e\u68c0\u67e5\u5e76\u5220\u9664\uff1b\r\n* \u5bf9\u6570\u636e\u8fdb\u884c\u968f\u673a\u62bd\u6837\uff1b\r\n* \u5904\u7406\u53c2\u6570\u53ef\u4ee5\u81ea\u5b9a\u4e49\u987a\u5e8f\uff0c\r\n",
"bugtrack_url": null,
"license": null,
"summary": "baselibs",
"version": "0.1.9",
"project_urls": {
"Blog": "https://blog.csdn.net/xmxoxo",
"Homepage": "https://github.com/xmxoxo/baselibs"
},
"split_keywords": [
"baselibs"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "2479183ffc2e8be09b3a9f8a50065dcb6b105cb485a1b5775dda6cf6216ac86a",
"md5": "00bf3d12ef7c9cd1f54bb9a4d430db35",
"sha256": "a09b77fece78e779eca4db0669eacbe7bdd6e641287d9650c425a1250764af66"
},
"downloads": -1,
"filename": "baselibs-0.1.9-py3-none-any.whl",
"has_sig": false,
"md5_digest": "00bf3d12ef7c9cd1f54bb9a4d430db35",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 34897,
"upload_time": "2024-11-25T06:44:56",
"upload_time_iso_8601": "2024-11-25T06:44:56.290672Z",
"url": "https://files.pythonhosted.org/packages/24/79/183ffc2e8be09b3a9f8a50065dcb6b105cb485a1b5775dda6cf6216ac86a/baselibs-0.1.9-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "f2ae4cc6d7f893e4889e700f63a1d6662bcbde5bbad353d55e9c1ff9a9a72b67",
"md5": "311d51da7f808ab1d0dd574592644a63",
"sha256": "59ad8019b760522818dfff1fe69aca152cc120fc105dff281358f09fbedd1afd"
},
"downloads": -1,
"filename": "baselibs-0.1.9.tar.gz",
"has_sig": false,
"md5_digest": "311d51da7f808ab1d0dd574592644a63",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 34731,
"upload_time": "2024-11-25T06:44:58",
"upload_time_iso_8601": "2024-11-25T06:44:58.456200Z",
"url": "https://files.pythonhosted.org/packages/f2/ae/4cc6d7f893e4889e700f63a1d6662bcbde5bbad353d55e9c1ff9a9a72b67/baselibs-0.1.9.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-11-25 06:44:58",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "xmxoxo",
"github_project": "baselibs",
"github_not_found": true,
"lcname": "baselibs"
}