Tensor2Tensor
From MaRDI portal
Software:38238
swMATH26507WikidataQ107386858MaRDI QIDQ38238
No author found.
Source code repository: https://github.com/tensorflow/tensor2tensor
Related Items (only showing first 100 items - show all)
Reconstructing a quantum state with a variational autoencoder ⋮ Graph Neural Networks for Natural Language Processing: A Survey ⋮ Can We Automate Scientific Reviewing? ⋮ A Comprehensive Review of Modern Object Segmentation Approaches ⋮ ConViT: improving vision transformers with soft convolutional inductive biases* ⋮ Adaptive and Implicit Regularization for Matrix Completion ⋮ Mixture of Linear Models Co-supervised by Deep Neural Networks ⋮ Graph representation learning for popularity prediction problem: A survey ⋮ Progressive Interpretation Synthesis: Interpreting Task Solving by Quantifying Previously Used and Unused Information ⋮ A Deep Learning Modeling Framework to Capture Mixing Patterns in Reactive-Transport Systems ⋮ Learning Multiple Quantiles With Neural Networks ⋮ CASA: Conversational Aspect Sentiment Analysis for Dialogue Understanding ⋮ Fine-grained Prediction of Political Leaning on Social Media with Unsupervised Deep Learning ⋮ Visually Grounded Models of Spoken Language: A Survey of Datasets, Architectures and Evaluation Techniques ⋮ FFCI: A Framework for Interpretable Automatic Evaluation of Summarization ⋮ Analyzing Firm Reports for Volatility Prediction: A Knowledge-Driven Text-Embedding Approach ⋮ Detecting Product Adoption Intentions via Multiview Deep Learning ⋮ Automated Reinforcement Learning (AutoRL): A Survey and Open Problems ⋮ Out of Context: A New Clue for Context Modeling of Aspect-based Sentiment Analysis ⋮ Supervised Visual Attention for Simultaneous Multimodal Machine Translation ⋮ Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Decreasing the Size of the Restricted Boltzmann Machine ⋮ Multi-Document Summarization with Determinantal Point Process Attention ⋮ Set-to-Sequence Methods in Machine Learning: A Review ⋮ Long Short-Term Memory Networks for Traffic Flow Forecasting: Exploring Input Variables, Time Frames and Multi-Step Approaches ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Exploring the attention mechanism in LSTM-based Hong Kong stock price movement prediction ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Grouping of contracts in insurance using neural networks ⋮ An External Knowledge Enhanced Graph-based Neural Network for Sentence Ordering ⋮ Dynamical Variational Autoencoders: A Comprehensive Review ⋮ Deep double descent: where bigger models and more data hurt* ⋮ Reconstruction of pairwise interactions using energy-based models* ⋮ High generalization performance structured self-attention model for knapsack problem ⋮ Quantum Mathematics in Artificial Intelligence ⋮ Explainable Deep Learning: A Field Guide for the Uninitiated ⋮ Image Captioning as an Assistive Technology: Lessons Learned from VizWiz 2020 Challenge ⋮ Neural Character-Level Syntactic Parsing for Chinese ⋮ Deep neural networks can stably solve high-dimensional, noisy, non-linear inverse problems ⋮ WaveNet-based deep neural networks for the characterization of anomalous diffusion (WADNet) ⋮ LocalGLMnet: interpretable deep learning for tabular data ⋮ A transformer-based synthetic-inflow generator for spatially developing turbulent boundary layers ⋮ Characterization of anomalous diffusion through convolutional transformers ⋮ Robustness of LSTM neural networks for multi-step forecasting of chaotic time series ⋮ Meta-learning pseudo-differential operators with deep neural networks ⋮ LMMS reloaded: transformer-based sense embeddings for disambiguation and beyond ⋮ Inductive logic programming at 30 ⋮ ReliefE: feature ranking in high-dimensional spaces via manifold embeddings ⋮ Persistence in complex systems ⋮ Entity recognition of Chinese medical text based on multi-head self-attention combined with BILSTM-CRF ⋮ End-to-end deep representation learning for time series clustering: a comparative study ⋮ Mesh-Conv: convolution operator with mesh resolution independence for flow field modeling ⋮ Multi-objective CFD-driven development of coupled turbulence closure models ⋮ Probabilistic interpretation of the distillation problem ⋮ IGA-reuse-NET: a deep-learning-based isogeometric analysis-reuse approach with topology-consistent parameterization ⋮ RockGPT: reconstructing three-dimensional digital rocks from single two-dimensional slice with deep learning ⋮ Learning the travelling salesperson problem requires rethinking generalization ⋮ Transformer-based deep neural language modeling for construct-specific automatic item generation ⋮ Hierarchical Bayesian text modeling for the unsupervised joint analysis of latent topics and semantic clusters ⋮ An enriched category theory of language: from syntax to semantics ⋮ Efficient hybrid explicit-implicit learning for multiscale problems ⋮ Improving sequential latent variable models with autoregressive flows ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Data-driven rogue waves and parameters discovery in nearly integrable \(\mathcal{PT}\)-symmetric Gross-Pitaevskii equations via PINNs deep learning ⋮ Neural-network learning of SPOD latent dynamics ⋮ ProTranslator: zero-shot protein function prediction using textual description ⋮ End-to-end neural event coreference resolution ⋮ Quantifying and alleviating political bias in language models ⋮ Comprehensive analysis of embeddings and pre-training in NLP ⋮ A reinforcement learning approach to the orienteering problem with time windows ⋮ Structure-aware shape correspondence network for 3D shape synthesis ⋮ Reinforcement learning for combinatorial optimization: a survey ⋮ A hybrid inference system for improved curvature estimation in the level-set method using machine learning ⋮ Few-shot learning with adaptively initialized task optimizer: a practical meta-learning approach ⋮ An explainable attention network for fraud detection in claims management ⋮ Machine learning applied to asteroid dynamics ⋮ Multi-fidelity surrogate modeling using long short-term memory networks ⋮ Data-driven soliton mappings for integrable fractional nonlinear wave equations via deep learning with Fourier neural operator ⋮ Rationalizing predictions by adversarial information calibration ⋮ BERT-based NLP techniques for classification and severity modeling in basic warranty data study ⋮ Research on spatio-temporal network prediction model of parallel-series traffic flow based on transformer and GCAT ⋮ Deep reinforcement learning for optimal well control in subsurface systems with uncertain geology ⋮ Robust attentional aggregation of deep feature sets for multi-view 3D reconstruction ⋮ Deep learning for generic object detection: a survey ⋮ Unnamed Item ⋮ A physics-informed diffusion model for high-fidelity flow field reconstruction ⋮ Data-driven forward and inverse problems for chaotic and hyperchaotic dynamic systems based on two machine learning architectures ⋮ Graph-based structural knowledge-aware network for diagnosis assistant ⋮ Improving the quality of machine translation using the reverse model ⋮ Neural network stochastic differential equation models with applications to financial data forecasting ⋮ Mixed membership Gaussians
This page was built for software: Tensor2Tensor