Skip to content
Change the repository type filter

All

    Repositories list

    • mercury

      Public
      JavaScript
      0020Updated Jul 30, 2024Jul 30, 2024
    • sol

      Public
      TypeScript
      0020Updated Jul 29, 2024Jul 29, 2024
    • _rosetta

      Public
      A large-scaled, diverse and linguistically-enriched social media corpus of Mandarin in Taiwan.
      TypeScript
      0010Updated Jul 9, 2024Jul 9, 2024
    • A Python web scraper for extracting post content and comments from PTT website.
      Python
      Apache License 2.0
      3100Updated Jun 23, 2024Jun 23, 2024
    • A Python package that asynchronously segments JSON data into TEI XML format.
      Python
      Apache License 2.0
      0000Updated Apr 29, 2024Apr 29, 2024
    • A repo that demonstrates how to build Blacklab corpus via Docker and Nginx.
      Shell
      0000Updated Apr 29, 2024Apr 29, 2024
    • A large-scaled, diverse and linguistically-enriched social media corpus of Mandarin in Taiwan.
      TypeScript
      0000Updated May 17, 2023May 17, 2023
    • .github

      Public
      We are building a large-scaled, diverse and linguistically-enriched social media corpus of Mandarin in Taiwan
      0000Updated Dec 10, 2022Dec 10, 2022
    • scraptt

      Public
      The most comprehensive PTT (踢踢踢) Crawler
      Python
      1300Updated Sep 2, 2018Sep 2, 2018