معرفی شرکت ها

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

توضیحات

Dice embedding is an hardware-agnostic framework for large-scale knowledge graph embedding applications

ویژگی	مقدار
سیستم عامل	-
نام فایل	dicee-0.0.3
نام	dicee
نسخه کتابخانه	0.0.3
نگهدارنده	[]
ایمیل نگهدارنده	[]
نویسنده	Caglar Demir
ایمیل نویسنده	caglardemir8@gmail.com
آدرس صفحه اصلی	https://github.com/dice-group/dice-embeddings
آدرس اینترنتی	https://pypi.org/project/dicee/
مجوز	-

# DICE Embeddings: Hardware-agnostic Framework for Large-scale Knowledge Graph Embeddings Knowledge graph embedding research has mainly focused on learning continuous representations of knowledge graphs towards the link prediction problem. Recently developed frameworks can be effectively applied in a wide range of research-related applications. Yet, using these frameworks in real-world applications becomes more challenging as the size of the knowledge graph grows. We developed the DICE Embeddings framework (dicee) to compute embeddings for large-scale knowledge graphs in a hardware-agnostic manner. To achieve this goal, we rely on 1. **[Pandas](https://pandas.pydata.org/) & Co.** to use parallelism at preprocessing a large knowledge graph, 2. **[PyTorch](https://pytorch.org/) & Co.** to learn knowledge graph embeddings via multi-CPUs, GPUs, TPUs or computing cluster, and 3. **[Huggingface](https://huggingface.co/)** to ease the deployment of pre-trained models. **Why [Pandas](https://pandas.pydata.org/) & Co. ?** A large knowledge graph can be read and preprocessed (e.g. removing literals) by pandas, modin, or polars in parallel. Through polars, a knowledge graph having more than 1 billion triples can be read in parallel fashion. Importantly, using these frameworks allow us to perform all necessary computations on a single CPU as well as a cluster of computers. **Why [PyTorch](https://pytorch.org/) & Co. ?** PyTorch is one of the most popular machine learning frameworks available at the time of writing. PytorchLightning facilitates scaling the training procedure of PyTorch without boilerplate. In our framework, we combine [PyTorch](https://pytorch.org/) & [PytorchLightning](https://www.pytorchlightning.ai/). Users can choose the trainer class (e.g., DDP by Pytorch) to train large knowledge graph embedding models with billions of parameters. PytorchLightning allows us to use state-of-the-art model parallelism techniques (e.g. Fully Sharded Training, FairScale, or DeepSpeed) without extra effort. With our framework, practitioners can directly use PytorchLightning for model parallelism to train gigantic embedding models. **Why [Hugging-face Gradio](https://huggingface.co/gradio)?** Deploy a pre-trained embedding model without writing a single line of code. ## Installation ``` pip install dicee ``` or ``` git clone https://github.com/dice-group/dice-embeddings.git conda create -n dice python=3.10 --no-default-packages && conda activate dice pip3 install "pandas>=1.5.1" pip3 install "torch>=2.0.0" pip3 install "polars>=0.16.14" pip3 install "scikit-learn>=1.2.2" pip3 install "pyarrow>=11.0.0" pip3 install "pytest>=7.2.2" pip3 install "gradio>=3.23.0" pip3 install "psutil>=5.9.4" pip3 install "pytorch-lightning==1.6.4" ``` To test the Installation ``` wget https://hobbitdata.informatik.uni-leipzig.de/KG/KGs.zip unzip KGs.zip pytest -p no:warnings -x # it takes circa 15 minutes pytest -p no:warnings --lf # run only the last failed test pytest -p no:warnings --ff # to run the failures first and then the rest of the tests. ``` To see the software architecture, execute the following command ``` pyreverse dicee/ && dot -Tpng -x classes.dot -o dice_software.png && eog dice_software.png # or pyreverse dicee/trainer && dot -Tpng -x classes.dot -o trainer.png && eog trainer.png ``` ## Applications ### Description Logic Concept Learning (soon) ```python from dicee import KGE # (1) Load a pretrained KGE model on KGs/Family pretrained_model = KGE(path='Experiments/2022-12-08 11:46:33.654677') pretrained_model.learn_concepts(pos={''},neg={''},topk=1) ``` ### Conjunctive Query/Question Answering ```python from dicee import KGE # (1) Load a pretrained KGE model on KGs/Family pretrained_model = KGE(path='Experiments/2022-12-08 11:46:33.654677') # (2) Answer the following conjunctive query question: To whom a sibling of F9M167 is married to? # (3) Decompose (2) into two query # (3.1) Who is a sibling of F9M167? => {F9F141,F9M157} # (3.2) To whom a results of (3.1) is married to ? {F9M142, F9F158} pretrained_model.predict_conjunctive_query(entity='<http://www.benchmark.org/family#F9M167>', relations=['<http://www.benchmark.org/family#hasSibling>', '<http://www.benchmark.org/family#married>'], topk=1) ``` ### Triple Classification #### Using pre-trained ConEx on DBpedia 03-2022 ```bash # To download a pretrained ConEx mkdir ConEx && cd ConEx && wget -r -nd -np https://hobbitdata.informatik.uni-leipzig.de/KGE/DBpedia/ConEx/ && cd .. ``` ```python from dicee import KGE # (1) Load a pretrained ConEx on DBpedia pre_trained_kge = KGE(path='ConEx') pre_trained_kge.triple_score(head_entity=["http://dbpedia.org/resource/Albert_Einstein"],relation=["http://dbpedia.org/ontology/birthPlace"],tail_entity=["http://dbpedia.org/resource/Ulm"]) # tensor([0.9309]) pre_trained_kge.triple_score(head_entity=["http://dbpedia.org/resource/Albert_Einstein"],relation=["http://dbpedia.org/ontology/birthPlace"],tail_entity=["http://dbpedia.org/resource/German_Empire"]) # tensor([0.9981]) pre_trained_kge.triple_score(head_entity=["http://dbpedia.org/resource/Albert_Einstein"],relation=["http://dbpedia.org/ontology/birthPlace"],tail_entity=["http://dbpedia.org/resource/Kingdom_of_Württemberg"]) # tensor([0.9994]) pre_trained_kge.triple_score(head_entity=["http://dbpedia.org/resource/Albert_Einstein"],relation=["http://dbpedia.org/ontology/birthPlace"],tail_entity=["http://dbpedia.org/resource/Germany"]) # tensor([0.9498]) pre_trained_kge.triple_score(head_entity=["http://dbpedia.org/resource/Albert_Einstein"],relation=["http://dbpedia.org/ontology/birthPlace"],tail_entity=["http://dbpedia.org/resource/France"]) # very low pre_trained_kge.triple_score(head_entity=["http://dbpedia.org/resource/Albert_Einstein"],relation=["http://dbpedia.org/ontology/birthPlace"],tail_entity=["http://dbpedia.org/resource/Italy"]) # very low ``` ### Relation Prediction ```python from dicee import KGE pre_trained_kge = KGE(path='ConEx') pre_trained_kge.predict_topk(head_entity=["http://dbpedia.org/resource/Albert_Einstein"],tail_entity=["http://dbpedia.org/resource/Ulm"]) ``` ### Entity Prediction ```python from dicee import KGE pre_trained_kge = KGE(path='ConEx') pre_trained_kge.predict_topk(head_entity=["http://dbpedia.org/resource/Albert_Einstein"],relation=["http://dbpedia.org/ontology/birthPlace"]) pre_trained_kge.predict_topk(relation=["http://dbpedia.org/ontology/birthPlace"],tail_entity=["http://dbpedia.org/resource/Albert_Einstein"]) ``` ### Finding Missing Triples ```python from dicee import KGE pre_trained_kge = KGE(path='ConEx') missing_triples = pre_trained_kge.find_missing_triples(confidence=0.95, entities=[''], relations=['']) ``` ## How to Train a KGE model > How to use the framework:`examples`. ## How to Deploy Any pretrained model can be deployed with an ease. By setting ```share=True```, anyone on the internet can use a pretrained model. ```python from dicee import KGE KGE(path='...').deploy(share=True,top_k=10) ``` ![alt text](dicee/figures/deploy_qmult_family.png) ## Pre-trained Models Please contact: ```caglar.demir@upb.de ``` or ```caglardemir8@gmail.com ``` , if you lack hardware resources to obtain embeddings of a specific knowledge Graph. - [DBpedia version: 06-2022 Embeddings](https://hobbitdata.informatik.uni-leipzig.de/KGE/DBpediaQMultEmbeddings_03_07): - Models: ConEx, QMult - [YAGO3-10 ConEx embeddings](https://hobbitdata.informatik.uni-leipzig.de/KGE/conex/YAGO3-10.zip) - [FB15K-237 ConEx embeddings](https://hobbitdata.informatik.uni-leipzig.de/KGE/conex/FB15K-237.zip) - [WN18RR ConEx embeddings](https://hobbitdata.informatik.uni-leipzig.de/KGE/conex/WN18RR.zip) - For more please look at [Hobbit Data](https://hobbitdata.informatik.uni-leipzig.de/KGE/) ### Documentation In documents folder, we explained many details about knowledge graphs, knowledge graph embeddings, training strategies and many more background knowledge. We continuously work on documenting each and every step to increase the readability of our code. ## How to cite Currently, we are working on our manuscript describing our framework. If you really like our work and want to cite it now, feel free to chose one :) ``` # DICE Embedding Framework @article{demir2022hardware, title={Hardware-agnostic computation for large-scale knowledge graph embeddings}, author={Demir, Caglar and Ngomo, Axel-Cyrille Ngonga}, journal={Software Impacts}, year={2022}, publisher={Elsevier} } # KronE @article{demir2022kronecker, title={Kronecker Decomposition for Knowledge Graph Embeddings}, author={Demir, Caglar and Lienen, Julian and Ngomo, Axel-Cyrille Ngonga}, journal={arXiv preprint arXiv:2205.06560}, year={2022} } # QMult, OMult, ConvQ, ConvO @InProceedings{pmlr-v157-demir21a, title = {Convolutional Hypercomplex Embeddings for Link Prediction}, author = {Demir, Caglar and Moussallem, Diego and Heindorf, Stefan and Ngonga Ngomo, Axel-Cyrille}, booktitle = {Proceedings of The 13th Asian Conference on Machine Learning}, pages = {656--671}, year = {2021}, editor = {Balasubramanian, Vineeth N. and Tsang, Ivor}, volume = {157}, series = {Proceedings of Machine Learning Research}, month = {17--19 Nov}, publisher = {PMLR}, pdf = {https://proceedings.mlr.press/v157/demir21a/demir21a.pdf}, url = {https://proceedings.mlr.press/v157/demir21a.html}, } # ConEx @inproceedings{demir2021convolutional, title={Convolutional Complex Knowledge Graph Embeddings}, author={Caglar Demir and Axel-Cyrille Ngonga Ngomo}, booktitle={Eighteenth Extended Semantic Web Conference - Research Track}, year={2021}, url={https://openreview.net/forum?id=6T45-4TFqaX}} # Shallom @inproceedings{demir2021shallow, title={A shallow neural model for relation prediction}, author={Demir, Caglar and Moussallem, Diego and Ngomo, Axel-Cyrille Ngonga}, booktitle={2021 IEEE 15th International Conference on Semantic Computing (ICSC)}, pages={179--182}, year={2021}, organization={IEEE} ``` For any questions or wishes, please contact: ```caglar.demir@upb.de``` or ```caglardemir8@gmail.com```

نیازمندی

مقدار	نام
>=1.5.1	pandas
>=0.16.14	polars
>=11.0.0	pyarrow
>=2.0.0	torch
==1.6.4	pytorch-lightning
>=1.2.2	scikit-learn
>=7.2.2	pytest
>=5.9.4	psutil
>=3.23.0	gradio

زبان مورد نیاز

مقدار	نام
>=3.10	Python

نحوه نصب

نصب پکیج whl dicee-0.0.3:

pip install dicee-0.0.3.whl

نصب پکیج tar.gz dicee-0.0.3:

pip install dicee-0.0.3.tar.gz