IMicrosoft ibonisa iMarkItDown, isixhobo sokuguqula amaxwebhu akho abe yiMarkdown

MarkItDown

Kwixesha elidlulileyo, kungekudala emva kokupapasha i-a isikhokelo kwiMarkdown, Ndakhangela ulwazi ukufumanisa ukuba iLibreOffice ingasetyenziselwa ukwenza ezi ntlobo zamaxwebhu. Andizange ndifumane nto, ngaphandle kweempendulo ezinjengokuthi "kutheni ufuna ukwenza loo nto?" bala. Ingcinga yayikukudala uxwebhu ngomhleli wohlobo lweLizwi okanye uMbhali kwaye emva koko uyigcine kwi-.md ifomathi, kodwa njengoko benditshilo; Andifumananga nto. Kutshanje, iMicrosoft ikhuphe isixhobo sento efanayo, kwaye igama layo li MarkItDown.

I-MarkItDown yi ilayibrari yepython enokuthi ifakwe kwisistim - hayi kwiLinux ukusukela kwiPython 3.12 - okanye kwindawo ebonakalayo (env). Emva kofakelo, isiseko okanye ukusetyenziswa okukrwada kuya kufuna ukubhala imigca embalwa kwiPython, onayo apha ngezantsi. Kodwa asiyiyo yodwa indlela yokuyisebenzisa.

MarkItDown usebenzisa iPython

I-API yile ilula:

ukusuka kwimarkitdown import MarkitDown markitdown = MarkItDown() result = markitdown.convert("test.xlsx") print(result.text_content)

Ukusuka apha ngasentla, umgca wokuqala ungenisa ngaphandle ithala leencwadi; eyesibini yenza into ehambelanayo; Kweyesithathu yenza ukuguqulwa - kwifayile kumzekelo ebizwa ngokuba yi-text.xlsx - kwaye okwesine iya kuprinta umphumo kwi-console. Ngaphezu koko, njengoko kuchaziwe kwi eyakho iGitHub, inokwenziwa ihambelane neLLM njengeChatGPT, konke oku kukuthanda komthengi kwaye kuxhomekeke kulwazi lomntu ngamnye.

Ukuba ikhowudi ayisiyiyo eyona ilungileyo kuthi, umphuhlisi ogama linguMat Palmer wenze iwebhu ukuququzelela umsebenzi. Nangona ikwisiNgesi, ukusetyenziswa kwayo kulula kakhulu. Emazantsi ebhokisi ibonisa iifayile ezixhaswayo, eziyiPDF, PPTX, DOCX, XLSX, Imifanekiso, Audio, HTML kunye neefayile zombhalo. Ekuphela kwento ekuya kufuneka siyenze kukutsala ifayile kwibhokisi kwaye silinde umlingo ukuba wenzeke, njengoko kubonwa kwi-header screenshot.

Ngexesha lokubhala kukho ingxaki yokukhuphela ifayile, ebonisa umyalezo wephutha endaweni yombhalo. Kuyenzeka, into endingakhange ndiyiqinisekise, ukuba ndiyayibona kuba ndiyenzile ifayile esuka kwiLinux, LibreOffice okanye zombini, kodwa ndiyayibona loo mpazamo xa ukhuphela ifayile. Iyenza kakuhle uguqulo, kwaye ungasoloko ukopa okubhaliweyo okungenanto okuvelisayo, yincamathisele kwifayile yokubhaliweyo kwaye uyigcine ngolwandiso lwe .md.

Ukuyijonga, kwiLinux sinokusebenzisa izixhobo ezinje nge-Okular, iKhowudi yeSitudiyo esiBonakalayo okanye inkqubo ethile. ulungile, phakathi kwabanye.

Ukuqwalasela

Nangona isixhobo senziwe nguMicrosoft, ayizizo zonke izinto eziya kuhlala zihamba kakuhle. Ukuze ufumane iziphumo ezingcono, kufuneka usebenzise iindlela ezichanekileyo. Umzekelo, ukubeka a # Titular o ## Título 2, kufuneka ukhethe oko kwiLizwi okanye kwiinketho zoMbhali. Okufanayo noluhlu olucwangcisiweyo okanye olungacwangciswanga, amakhonkco, imifanekiso ... Ukuba endaweni yokusebenzisa ukhetho oluchanekileyo, sikhetha isicatshulwa kwaye sibeke ifonti engqindilili kunye nenkulu, i-Markdown ayisebenzi ngolo hlobo, kwaye sinokufumana iziphumo ezixubileyo. Ngolwazi oluthe kratya malunga neempawu ezixhaswayo, sibhekisa kuwe kwikhonkco oya kuyifumana kumhlathi wokuqala wale nqaku.

Ngoku, sisixhobo esisemthethweni seMicrosoft, kwaye inokuba lolona khetho lulungileyo lokuguqula iifayile ezixhaswayo zibe yiMarkdown.


Shiya uluvo lwakho

Idilesi yakho ye email aziyi kupapashwa. ezidingekayo ziphawulwe *

*

*

  1. Inoxanduva lwedatha: I-AB Internet Networks 2008 SL
  2. Injongo yedatha: Ulawulo lwe-SPAM, ulawulo lwezimvo.
  3. Umthetho: Imvume yakho
  4. Unxibelelwano lwedatha: Idatha ayizukuhanjiswa kubantu besithathu ngaphandle koxanduva lomthetho.
  5. Ukugcinwa kweenkcukacha
  6. Amalungelo: Ngalo naliphi na ixesha unganciphisa, uphinde uphinde ucime ulwazi lwakho.