# đŻ KOMPAKTNE NIMEKIRI: 700 artiklist 700 teadusartikli kĂ€sitlemine on suur andmehulk. Siin on optimaalsed lahendused kompaktsete nimeliste saamiseks. --- ## đ LAHENDUSED ### **Variant 1: Python skript (PARIM)** â ```bash python3 compact_articles_list.py ``` **Loob:** - `articles_compact_list.csv` - Excel-ile - `articles_compact_list.html` - Veebis vaatamine + sorteerimine - `articles_compact_list.md` - Markdown tabel **VĂ€ljad:** ``` | # | Pealkiri | Aasta | Ćœurnaal | Allikfail | Ălevaade (30 sĂ”na) | Relevants/10 | ``` **Eelised:** - â HTML on interaktiivne (sorteerimine, filtreerimine) - â CSV on Exceli-nÔÔps - â Markdown on VS Code'is mugav - â Ălevaated Ă€ra lĂ”igatud (30 sĂ”na) - loetav! --- ### **Variant 2: Bash + CURL + jq (KIIRE)** ```bash chmod +x compact_curl_list.sh ./compact_curl_list.sh ``` **Loob:** - `articles_compact.csv` - `articles_compact.md` **Eelised:** - â Kiire (pole Pythoni init) - â VĂ€hem failisid - â jq kasutab ĂŒlevaadete lĂ”ikamist (20 sĂ”na) --- ### **Variant 3: Lihtne CURL kĂ€sk (KĂSITSI)** ```bash # CSV formaadis curl -s http://100.80.222.54:9020/v1/graphql \ -X POST \ -H "Content-Type: application/json" \ -d '{ "query": "{ Get { ScientificArticle(limit: 50) { title year journal abstract_en relevance_score } } }" }' \ | jq -r '.data.Get.ScientificArticle[] | [.title, .year, .journal, .relevance_score] | @csv' \ > ~/Downloads/articles.csv # Markdown tabel curl -s http://100.80.222.54:9020/v1/graphql \ -X POST \ -H "Content-Type: application/json" \ -d '{ "query": "{ Get { ScientificArticle(limit: 50) { title year journal relevance_score } } }" }' \ | jq -r '.data.Get.ScientificArticle[] | "| \(.title) | \(.year) | \(.journal) | \(.relevance_score)/10 |"' \ > ~/Downloads/articles.md ``` --- ## đ VĂLJAD - KOMPAKTSES PĂRINGUS ```graphql { Get { ScientificArticle(limit: 100) { # PĂ”hivalik - KOMPAKTNE title # Artikli pealkiri source_file # PDF allikfail year # Avaldamise aasta journal # Ajakirja nimi abstract_en # LĂŒhike kokkuvĂ”te relevance_score # Relevantsus (0-10) # Valikuline - suurendab hulka # doi # DOI ID # authors # Autorite nimekiri # key_concepts # VĂ”tmesĂ”nad # processing_date # Töötlemise kuupĂ€ev } } } ``` --- ## ⥠OPTIMISEERIMISE NIPID ### **1. Limit muuta suuremaks** Asemel `limit: 100`, kasuta `limit: 700` (vĂ”i kuni 10,000): ```graphql ScientificArticle(limit: 700) { title source_file year } ``` â ïž **Hoiatus:** 700+ artikli kohta vĂ”tab ~ 5-15 sekundit. ### **2. Filtreerimine vĂ”tmesĂ”nade jĂ€rgi** ```graphql ScientificArticle( limit: 100 where: { path: "key_concepts" operator: ContainsAny valueString: ["transport", "road safety"] } ) { title key_concepts } ``` ### **3. Sorteerimine relevantsuse jĂ€rgi** ```graphql ScientificArticle( limit: 100 sort: {path: "relevance_score", order: desc} ) { title relevance_score } ``` ### **4. Ăhendamine - ainult ĂŒlemused 10 sĂ”na** ```bash # jq-s .abstract_en | split(" ") | .[0:10] | join(" ") ``` --- ## đ JĂUDLUSE VĂRDLUS | Lahendus | VĂ€ljundid | Kiirus | Mugavus | |----------|-----------|--------|---------| | Python | CSV, HTML, MD | 3-5s | âââââ | | Bash | CSV, MD | 2-3s | ââââ | | CURL kĂ€sk | JSON/CSV | 1-2s | âââ | --- ## đŸ FAILI FORMAADID ### **CSV** (Excel, LibreOffice) ``` title,year,journal,abstract,relevance "Article Title",2024,"Nature","Abstract text...",9 ``` ### **HTML** (Veebis sorteeritav) ```html
| Pealkiri | Aasta | ...
|---|---|
| Article | 2024 | ...